{"id":3691,"date":"2022-04-25T10:35:46","date_gmt":"2022-04-25T10:35:46","guid":{"rendered":"https:\/\/multisite.korebots.com\/SearchAssist\/?p=3691"},"modified":"2022-08-17T11:21:32","modified_gmt":"2022-08-17T11:21:32","slug":"crawling-web-pages","status":"publish","type":"post","link":"https:\/\/multisite.korebots.com\/SearchAssist\/concepts\/managing-content\/crawling-web-pages\/","title":{"rendered":"Crawling Web Pages"},"content":{"rendered":"<section class=\"l-section wpb_row height_auto width_full\">\n<div class=\"l-section-h i-cf\">\n<div class=\"g-cols vc_row type_default valign_top\">\n<div class=\"vc_col-sm-12 wpb_column vc_column_container\">\n<div class=\"vc_column-inner\">\n<div class=\"wpb_wrapper\">\n<div class=\"wpb_text_column\">\n<div class=\"wpb_wrapper\">\n<p>Organizations usually have web pages which a user can query, such as product information or process knowledge pages. You can leverage these pages by mapping your SearchAssist app to the content.<\/p>\n<p>SearchAssist enables you to ingest content through web crawling. For example, consider a banking website. Its pages contain information that can answer most user queries. SearchAssist can crawl the bank\u2019s website and index all the content. When a user submits a query, the SearchAssist app retrieves and displays the correct content.<\/p>\n<p>To maintain an up-to-date index, schedule automated web crawl sessions (e.g. time and frequency) as required. After you customize crawl settings, click Proceed at the bottom of the page.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/section>\n<section class=\"l-section wpb_row height_auto width_full\">\n<div class=\"l-section-h i-cf\">\n<div class=\"g-cols vc_row type_default valign_top\">\n<div class=\"vc_col-sm-12 wpb_column vc_column_container\">\n<div class=\"vc_column-inner\">\n<div class=\"wpb_wrapper\">\n<div class=\"wpb_text_column\">\n<div class=\"wpb_wrapper\">\n<h2><span class=\"ez-toc-section\" id=\"Adding_Content_by_Web_Crawling\"><\/span><span id=\"Adding_Content_by_Web_Crawling\" class=\"ez-toc-section\"><\/span><em><br \/>\n<\/em>Adding Content by Web Crawling<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<ol>\n<li aria-level=\"1\">Follow these steps to crawl web domains:\n<ol>\n<li aria-level=\"1\">Log in to SearchAssist.<\/li>\n<li aria-level=\"1\">Go to the All Apps heading.<\/li>\n<li aria-level=\"1\">Select the correct SearchAssist app.<a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl3_select-app.png\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-3711 size-full aligncenter\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl3_select-app.png\" alt=\"\" width=\"433\" height=\"141\" data-pagespeed-url-hash=\"3933570652\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl3_select-app.png 433w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl3_select-app-300x98.png 300w\" sizes=\"(max-width: 433px) 100vw, 433px\" \/><\/a><\/li>\n<li aria-level=\"1\">Click the Sources menu tab.<\/li>\n<li aria-level=\"1\">In the left pane, click Content.<a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_sources.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-3712 size-full\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_sources.png\" alt=\"\" width=\"385\" height=\"244\" data-pagespeed-url-hash=\"1989680817\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_sources.png 385w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_sources-300x190.png 300w\" sizes=\"(max-width: 385px) 100vw, 385px\" \/><\/a><\/li>\n<li aria-level=\"1\">Click the +Add Content drop-down menu and select Crawl Web Domain. (Or click the + Add Content button in the top right corner of the page.)\n<p><a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl-1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-3741 size-full\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl-1.png\" alt=\"\" width=\"612\" height=\"183\" data-pagespeed-url-hash=\"973210816\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl-1.png 612w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl-1-300x90.png 300w\" sizes=\"(max-width: 612px) 100vw, 612px\" \/><\/a><\/li>\n<li aria-level=\"1\">On the Crawl Web Domain page, fill in the Source Title and Description fields.<\/li>\n<li aria-level=\"1\">Enter the domain address in the Source URL field.<a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/Enter-crawling-source-URL.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-4083 size-large\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/Enter-crawling-source-URL-1024x557.png\" alt=\"\" width=\"640\" height=\"348\" data-pagespeed-url-hash=\"1424674604\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/Enter-crawling-source-URL-1024x557.png 1024w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/Enter-crawling-source-URL-300x163.png 300w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/Enter-crawling-source-URL.png 1484w\" sizes=\"(max-width: 640px) 100vw, 640px\" \/><\/a><\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div class=\"wpb_text_column\">\n<div class=\"wpb_wrapper\">\n<h3><span class=\"ez-toc-section\" id=\"Scheduling_Crawls\"><\/span><span id=\"Scheduling_Crawls\" class=\"ez-toc-section\"><\/span>Scheduling Crawls<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Activate this optional feature to schedule regular crawls on your site. This feature is well-suited for organizations that regularly publish new website content and want to keep their search index up to date.<\/p>\n<p><b>Note<\/b>:\u00a0If you do not schedule auto-crawls, you need to manually launch new crawls to keep the index up to date.<\/p>\n<p>&nbsp;<\/p>\n<ol>\n<li aria-level=\"1\">Click the toggle switch to the ON position. The default setting is OFF.<a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_scheduleronoff_toggle-1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-3735 size-full\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_scheduleronoff_toggle-1.png\" alt=\"\" width=\"578\" height=\"125\" data-pagespeed-url-hash=\"1513216943\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_scheduleronoff_toggle-1.png 578w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_scheduleronoff_toggle-1-300x65.png 300w\" sizes=\"(max-width: 578px) 100vw, 578px\" \/><\/a><\/li>\n<li aria-level=\"1\">Click the Select Date field.<\/li>\n<li aria-level=\"1\">Set the day and date of the first crawl on the calendar.<\/li>\n<li aria-level=\"1\">Enter the time to launch crawl. (use 24-times)<\/li>\n<li aria-level=\"1\">Click the Time Zone field and select an option. SearchAssist currently supports three time zones: IST, EST, and UTC.<br \/>\n<a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_timezone-1.png\"><br \/>\n<\/a><a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_timezone-1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-3739 size-medium\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_timezone-1-300x135.png\" alt=\"\" width=\"300\" height=\"135\" data-pagespeed-url-hash=\"2713410751\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_timezone-1-300x135.png 300w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_timezone-1.png 528w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/a><\/li>\n<li aria-level=\"1\">\u00a0Click the Frequency field.<a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/frequency12.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-3912 size-medium\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/frequency12-300x252.png\" alt=\"\" width=\"300\" height=\"252\" data-pagespeed-url-hash=\"2508140531\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/frequency12-300x252.png 300w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/frequency12.png 389w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/a><\/li>\n<li aria-level=\"1\">Select an option.<\/li>\n<\/ol>\n<p>Select the Custom option if you want granular control over index updates and then click Ok.<\/p>\n<ul>\n<li aria-level=\"1\">Set the frequency of crawls (e.g. 1 time a week).<\/li>\n<li aria-level=\"1\">Choose a specific day of the week.<\/li>\n<li aria-level=\"1\">Choose when to end the crawls. The default setting is no end (Never). You can stop future crawls by selecting a specific date (choose On) or after SearchAssist records a set number of crawl occurrences (choose At).<\/li>\n<\/ul>\n<p><a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/scheduler.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-3717 size-medium\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/scheduler-300x297.png\" alt=\"\" width=\"300\" height=\"297\" data-pagespeed-url-hash=\"3356476586\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/scheduler-300x297.png 300w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/scheduler-150x150.png 150w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/scheduler.png 338w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/a><\/p>\n<\/div>\n<\/div>\n<div class=\"wpb_text_column\">\n<div class=\"wpb_wrapper\">\n<h3><span class=\"ez-toc-section\" id=\"Setting_Crawl_Options\"><\/span><span id=\"Setting_Crawl_Options\" class=\"ez-toc-section\"><\/span>Setting Crawl Options<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Click the Crawls options field to change the default setting (Crawl Everything). The options are:<\/p>\n<p><a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_crawl_options.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-3718 size-full\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_crawl_options.png\" alt=\"\" width=\"544\" height=\"144\" data-pagespeed-url-hash=\"1087541135\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_crawl_options.png 544w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_crawl_options-300x79.png 300w\" sizes=\"(max-width: 544px) 100vw, 544px\" \/><\/a><\/p>\n<ul>\n<li aria-level=\"1\">Crawl Everything: Crawl all URLs that belong to the domain.<\/li>\n<li>Crawl Everything Except Specific URLs: Create rules to stop the crawl from indexing specific URLs.<\/li>\n<li>Crawl Only Specific URLs: Use rules to crawl specific URLs.<br \/>\n<a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_crawl_options2-1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-3738 size-full\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_crawl_options2-1.png\" alt=\"\" width=\"545\" height=\"166\" data-pagespeed-url-hash=\"1854501553\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_crawl_options2-1.png 545w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_crawl_options2-1-300x91.png 300w\" sizes=\"(max-width: 545px) 100vw, 545px\" \/><\/a><\/li>\n<\/ul>\n<h3><span class=\"ez-toc-section\" id=\"Apply_Crawl_Settings\"><\/span><span id=\"Apply_Crawl_Settings\" class=\"ez-toc-section\"><\/span><i><a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_crawl_options3-1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-3737 size-full\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_crawl_options3-1.png\" alt=\"\" width=\"543\" height=\"140\" data-pagespeed-url-hash=\"591138442\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_crawl_options3-1.png 543w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_crawl_options3-1-300x77.png 300w\" sizes=\"(max-width: 543px) 100vw, 543px\" \/><\/a><\/i><br \/>\nApply Crawl Settings<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<h4>Customize Crawl Settings.<\/h4>\n<p><a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_crawl_options_apply.png\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-3721 size-full aligncenter\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_crawl_options_apply.png\" alt=\"\" width=\"599\" height=\"165\" data-pagespeed-url-hash=\"2550959802\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_crawl_options_apply.png 599w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_crawl_options_apply-300x83.png 300w\" sizes=\"(max-width: 599px) 100vw, 599px\" \/><\/a><\/p>\n<ul>\n<li aria-level=\"1\"><em><strong>Java Script-Rendered<\/strong><\/em><br \/>\nCheck this box to crawl content rendered through JS code. The default setting is unchecked.SearchAssist crawls website pages if JavaScript is enabled for those pages. Leave the Java Script-rendered box unchecked if you want to ignore those pages.<\/li>\n<\/ul>\n<ul>\n<li aria-level=\"1\"><strong>Crawl Beyond Sitemap<\/strong><br \/>\nCheck the box to crawl web pages beyond the URLs in the sitemap file of the target website. The default setting is unchecked, which limits the crawl to site map content.<\/li>\n<li aria-level=\"1\"><i><strong>Respect Robots.txt Directives<\/strong><br \/>\n<\/i>This feature forces crawlers to honor any directives in the robots.txt file for the web domain. The default setting is a checked box.<\/li>\n<li aria-level=\"1\"><i>Use Cookies<br \/>\n<\/i>The default setting is a checked box, which means SearchAssist crawls web pages that require cookie acceptance. Uncheck the box to ignore web pages that require cookie acceptance.<\/li>\n<li aria-level=\"1\"><i><strong>Crawl Depth<\/strong><br \/>\n<\/i>Most commercial or enterprise websites contain multiple levels of hierarchy created by pages and subpages. The homepage is at the top of the site page hierarchy (level 0). Inner pages to the homepage create deeper layers of nested levels. Crawl depth specifies how deep into those nested levels the crawler should reach. You can set the maximum depth allowed to crawl. The value 0 indicates no limit.<\/li>\n<li aria-level=\"1\"><i><strong>Max URL Limit<\/strong><br \/>\n<\/i>Set the maximum number of URLs to crawl. The value 0 indicates no limit.<\/li>\n<\/ul>\n<h3><span class=\"ez-toc-section\" id=\"Launching_A_Web_Crawl\"><\/span><span id=\"Launching_A_Web_Crawl\" class=\"ez-toc-section\"><\/span>Launching A Web Crawl<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>After customizing the crawl options, click Proceed.<\/p>\n<ul>\n<li>The first step is to validate the URL. If successful, SearchAssist displays a success pop-up window. Click Crawl to launch to start indexing now or Close to launch the crawl at another time.<img loading=\"lazy\" decoding=\"async\" class=\"size-medium wp-image-3722 aligncenter\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_validation-successful-300x240.png\" alt=\"\" width=\"300\" height=\"240\" data-pagespeed-url-hash=\"1325953543\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_validation-successful-300x240.png 300w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/add_content_webcrawl_validation-successful.png 326w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/li>\n<\/ul>\n<p>The crawl takes time to complete. Click Ok to close the window and wait for the crawl to complete.<\/p>\n<ul>\n<li aria-level=\"1\">To cancel the crawl, click the Abort button.<a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/Crawling-in-progress.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-3723 size-full\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/Crawling-in-progress.png\" alt=\"\" width=\"337\" height=\"210\" data-pagespeed-url-hash=\"2854614636\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/Crawling-in-progress.png 337w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/Crawling-in-progress-300x187.png 300w\" sizes=\"(max-width: 337px) 100vw, 337px\" \/><\/a><\/li>\n<\/ul>\n<h4><i>Editing Crawl Settings<\/i><\/h4>\n<p>You can edit, update, or remove the crawler settings.<\/p>\n<ol>\n<li aria-level=\"1\">Click the Sources menu tab.<\/li>\n<li aria-level=\"1\">Click Content in the left pane.<\/li>\n<li aria-level=\"1\">Click the crawl you want to edit.<br \/>\n<a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/content-management.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-3724 size-full\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/content-management.png\" alt=\"\" width=\"484\" height=\"184\" data-pagespeed-url-hash=\"1336517248\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/content-management.png 484w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/content-management-300x114.png 300w\" sizes=\"(max-width: 484px) 100vw, 484px\" \/><\/a><\/li>\n<li aria-level=\"1\">On the next page, click the Configuration menu tab.<br \/>\n<a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/configuration.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-3725 size-full\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/configuration.png\" alt=\"\" width=\"326\" height=\"189\" data-pagespeed-url-hash=\"1585831755\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/configuration.png 326w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/configuration-300x174.png 300w\" sizes=\"(max-width: 326px) 100vw, 326px\" \/><\/a><\/li>\n<li aria-level=\"1\">Scroll down to the bottom of the page.<\/li>\n<li aria-level=\"1\">Edit schedule and crawl settings as required.<\/li>\n<li aria-level=\"1\">Click Save.<a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/click_save.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-3726 size-full\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/click_save.png\" alt=\"\" width=\"491\" height=\"271\" data-pagespeed-url-hash=\"227315681\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/click_save.png 491w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/click_save-300x166.png 300w\" sizes=\"(max-width: 491px) 100vw, 491px\" \/><\/a><\/li>\n<\/ol>\n<h4>Checking The Results<\/h4>\n<p>When the crawl completes, SearchAssist changes the status to Success.<\/p>\n<p><a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/results.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-3727 size-full\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/results.png\" alt=\"\" width=\"635\" height=\"133\" data-pagespeed-url-hash=\"2510502663\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/results.png 635w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/results-300x63.png 300w\" sizes=\"(max-width: 635px) 100vw, 635px\" \/><\/a><\/p>\n<p>To review the URLs in the file:<\/p>\n<ol>\n<li aria-level=\"1\">Click anywhere on the crawl row.<\/li>\n<li aria-level=\"1\">Scroll through the list or use the Search tool to find a specific URL.<\/li>\n<li aria-level=\"1\">Click the X icon to close the window.<\/li>\n<\/ol>\n<h4><i><a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/urls.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-3728 size-full\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/urls.png\" alt=\"\" width=\"600\" height=\"201\" data-pagespeed-url-hash=\"216455149\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/urls.png 600w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/urls-300x101.png 300w\" sizes=\"(max-width: 600px) 100vw, 600px\" \/><\/a><br \/>\nNext Steps<\/i><\/h4>\n<p>Now that you have successfully crawled one or more websites, you can add more content (e.g. documents, FAQs). After that, you are ready to index the content.<\/p>\n<h4><i>Returning To An Unfinished Crawl<\/i><\/h4>\n<p>If you are unable to complete a crawl in one session, it is possible to re-start a crawl.<\/p>\n<ol>\n<li aria-level=\"1\">Log in to SearchAssist.<\/li>\n<li aria-level=\"1\">Go to the All Apps heading.<\/li>\n<li aria-level=\"1\">Select a SearchAssist app.<\/li>\n<li aria-level=\"1\">Click the Sources menu tab.<\/li>\n<li aria-level=\"1\">In the left pane, click Content.<\/li>\n<li aria-level=\"1\">Click the unfinished crawl in the large pane. SearchAssist starts the crawl and changes the status to In Progress.<\/li>\n<\/ol>\n<h5><i><a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/return.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-3729 size-full\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/return.png\" alt=\"\" width=\"538\" height=\"172\" data-pagespeed-url-hash=\"3846648885\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/return.png 538w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/return-300x96.png 300w\" sizes=\"(max-width: 538px) 100vw, 538px\" \/><\/a><\/i><\/h5>\n<h2><span class=\"ez-toc-section\" id=\"Web_Crawling_Error_Messages\"><\/span><span id=\"Web_Crawling_Error_Messages\" class=\"ez-toc-section\"><\/span>Web Crawling Error Messages<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Most web crawls fail for one of two reasons.<\/p>\n<ul>\n<li aria-level=\"1\"><i>Invalid URL\u00a0<\/i>URL validation fails because of connectivity issues or bad spelling. Check the URL address and click\u00a0Retry or Edit Configuration\u00a0to change the URL.<\/li>\n<\/ul>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-3914 aligncenter\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/validation-failed-1.png\" alt=\"\" width=\"298\" height=\"209\" data-pagespeed-url-hash=\"4022449386\" \/><\/p>\n<ul>\n<li aria-level=\"1\">Fails after starting successful URL Validation for the given website and during web crawling.<a ref=\"magnificPopup\" href=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/failed-to-add-source-1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-4085 size-full aligncenter\" src=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/failed-to-add-source-1.png\" alt=\"\" width=\"308\" height=\"185\" data-pagespeed-url-hash=\"4060016998\" srcset=\"https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/failed-to-add-source-1.png 308w, https:\/\/multisite.korebots.com\/SearchAssist\/wp-content\/uploads\/sites\/18\/2022\/04\/failed-to-add-source-1-300x180.png 300w\" sizes=\"(max-width: 308px) 100vw, 308px\" \/><\/a><\/li>\n<\/ul>\n<p><b><br \/>\nNote<\/b>:\u00a0If you are attempting to crawl the same website without turning on the Frequent Scheduling toggle, a duplication warning message pops up that reads \u201cweb crawling cannot be duplicated\u201d instead try using the Crawl by schedule.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/section>\n","protected":false},"excerpt":{"rendered":"<p>Organizations usually have web pages which a user can query, such as product information or process knowledge pages. You can leverage these pages by mapping your SearchAssist app to the content. SearchAssist enables you to ingest content through web crawling. For example, consider a banking website. Its pages contain information that can answer most user&#8230;<\/p>\n","protected":false},"author":18,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[95],"tags":[],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/posts\/3691"}],"collection":[{"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/users\/18"}],"replies":[{"embeddable":true,"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/comments?post=3691"}],"version-history":[{"count":14,"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/posts\/3691\/revisions"}],"predecessor-version":[{"id":4428,"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/posts\/3691\/revisions\/4428"}],"wp:attachment":[{"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/media?parent=3691"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/categories?post=3691"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/tags?post=3691"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}