{"id":457,"date":"2021-05-13T09:03:05","date_gmt":"2021-05-13T09:03:05","guid":{"rendered":"https:\/\/multisite.korebots.com\/SearchAssist\/?p=457"},"modified":"2021-05-13T09:03:05","modified_gmt":"2021-05-13T09:03:05","slug":"manage-content-of-a-web-page","status":"publish","type":"post","link":"https:\/\/multisite.korebots.com\/SearchAssist\/source-management\/content\/manage-content-of-a-web-page\/","title":{"rendered":"Manage Content of a Web Page"},"content":{"rendered":"<section class=\"l-section wpb_row height_auto\"><div class=\"l-section-h i-cf\"><div class=\"g-cols vc_row via_grid cols_1 laptops-cols_inherit tablets-cols_inherit mobiles-cols_1 valign_top type_default stacking_default\"><div class=\"wpb_column vc_column_container\"><div class=\"vc_column-inner\"><div class=\"wpb_text_column\"><div class=\"wpb_wrapper\"><p><span style=\"font-weight: 400\">Once you add content to the application, it needs to be updated as the content from websites may not be static. You can manage (schedule periodic web crawling and edit crawling) and ensure that the content is in sync with the data on the website.<\/span><\/p>\n<\/div><\/div><div class=\"w-separator size_medium with_line width_default thick_1 style_solid color_border align_center\"><div class=\"w-separator-h\"><\/div><\/div><div class=\"wpb_text_column\"><div class=\"wpb_wrapper\"><h2><span class=\"ez-toc-section\" id=\"Schedule_Web_Crawling\"><\/span>Schedule Web Crawling<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p><span style=\"font-weight: 400\">The scheduler allows you to schedule a job to re-crawl the configured website periodically. To schedule a web crawling job, follow the below steps:<\/span><\/p>\n<ol>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">On the Indices page, click <\/span><b>Content <\/b><span style=\"font-weight: 400\">on the left pane.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">On the Content list view page, select the respective source from the list.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">On the source dialog box, click the <\/span><b>Configuration <\/b><span style=\"font-weight: 400\">tab.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">On the Configuration tab, turn on the <\/span><b>Schedule <\/b><span style=\"font-weight: 400\">toggle.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Set the <\/span><b>Date<\/b><span style=\"font-weight: 400\">, <\/span><b>Time<\/b><span style=\"font-weight: 400\">, and <\/span><b>Frequency<\/b><span style=\"font-weight: 400\">.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Turn on the <\/span><b>Crawl Everything<\/b><span style=\"font-weight: 400\"> toggle to crawl all the domains.\u00a0<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">If you wish to crawl only selected domains, then turn off the <\/span><b>Crawl Everything<\/b><span style=\"font-weight: 400\"> toggle.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">After you turn off the <\/span><b>Crawl Everything<\/b><span style=\"font-weight: 400\"> toggle, the <\/span><b>Allow List<\/b><span style=\"font-weight: 400\"> toggle is turned on automatically. You can enter the allowed list of URLs in the <\/span><b>Allow URLs<\/b><span style=\"font-weight: 400\"> field.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">If you wish to block URLs, then turn off the <\/span><b>Allow List<\/b><span style=\"font-weight: 400\"> toggle.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">After you turn off the <\/span><b>Allow List<\/b><span style=\"font-weight: 400\"> toggle, the <\/span><b>Block List<\/b><span style=\"font-weight: 400\"> toggle is turned on automatically. You can enter URLs to block in the <\/span><b>Block URLs<\/b><span style=\"font-weight: 400\"> field.\u00a0<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Select <\/span><b>Crawl Settings<\/b><span style=\"font-weight: 400\">:\u00a0<\/span><\/li>\n<\/ol>\n<ul>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">JavaScript-rendered<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Use Cookies<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Respect robots.txt<\/span><\/li>\n<\/ul>\n<ol>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Click <\/span><b>Save<\/b><span style=\"font-weight: 400\">.<\/span><\/li>\n<\/ol>\n<\/div><\/div><div class=\"w-separator size_medium with_line width_default thick_1 style_solid color_border align_center\"><div class=\"w-separator-h\"><\/div><\/div><div class=\"wpb_text_column\"><div class=\"wpb_wrapper\"><h2><span class=\"ez-toc-section\" id=\"Edit_Crawler_Configuration\"><\/span>Edit Crawler Configuration<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p><span style=\"font-weight: 400\">To edit a web crawling source, follow the below steps:<\/span><\/p>\n<ol>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">On the Indices page, click <\/span><b>Content <\/b><span style=\"font-weight: 400\">on the left pane.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">On the Content list view page, select the respective source from the list.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">On the source dialog box, make the required changes.<\/span><\/li>\n<li style=\"font-weight: 400\">Click <b>Save<\/b>.<\/li>\n<\/ol>\n<\/div><\/div><\/div><\/div><\/div><\/div><\/section>\n","protected":false},"excerpt":{"rendered":"Once you add content to the application, it needs to be updated as the content from websites may not be static. You can manage (schedule periodic web crawling and edit crawling) and ensure that the content is in sync with the data on the website. Schedule Web Crawling The scheduler allows you to schedule a...","protected":false},"author":12,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[11],"tags":[],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/posts\/457"}],"collection":[{"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/users\/12"}],"replies":[{"embeddable":true,"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/comments?post=457"}],"version-history":[{"count":1,"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/posts\/457\/revisions"}],"predecessor-version":[{"id":458,"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/posts\/457\/revisions\/458"}],"wp:attachment":[{"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/media?parent=457"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/categories?post=457"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/tags?post=457"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}