{"id":130,"date":"2021-01-19T10:11:58","date_gmt":"2021-01-19T10:11:58","guid":{"rendered":"https:\/\/multisite.korebots.com\/SearchAssist\/?p=130"},"modified":"2021-06-28T07:44:05","modified_gmt":"2021-06-28T07:44:05","slug":"keyword-extraction","status":"publish","type":"post","link":"https:\/\/multisite.korebots.com\/SearchAssist\/manage-index\/keyword-extraction\/","title":{"rendered":"Keyword Extraction"},"content":{"rendered":"<section class=\"l-section wpb_row height_auto\"><div class=\"l-section-h i-cf\"><div class=\"g-cols vc_row via_grid cols_1 laptops-cols_inherit tablets-cols_inherit mobiles-cols_1 valign_top type_default stacking_default\"><div class=\"wpb_column vc_column_container\"><div class=\"vc_column-inner\"><div class=\"wpb_text_column\"><div class=\"wpb_wrapper\"><p><span style=\"font-weight: 400;\">Keyword Extraction is a technique to automatically detect important words from the text stored in a field. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">The Keyword Extraction stage in <a href=\"\/SearchAssist\/lifecycle\/manage-index\/workbench\/\" target=\"_blank\" rel=\"noopener noreferrer\">Index Pipeline<\/a> allows you to identify a set of keywords from a source field and save the identified keywords in a target field so that it can be used to identify the intention of the search user better. SearchAssist supports various NLP algorithms to extract keywords.<\/span><\/p>\n<div class=\"wpb_text_column \">\n<div class=\"wpb_wrapper\">\n<p>You can:<\/p>\n<ul>\n<li>Define a condition for the keyword extraction stage. The keywords will be extracted only from the documents that satisfy the given condition.<\/li>\n<li>Re-order or delete keyword extractions.<\/li>\n<li>Simulate the changes before saving them.<\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">Ensure to <strong>Train your app<\/strong> each time you make changes to any index configuration. This builds the index based on the updated configurations.<\/span><\/p>\n<\/div>\n<\/div>\n<\/div><\/div><div class=\"w-separator size_small with_line width_default thick_1 style_solid color_border align_center\"><div class=\"w-separator-h\"><\/div><\/div><div class=\"wpb_text_column\"><div class=\"wpb_wrapper\"><h2><span class=\"ez-toc-section\" id=\"Configuration\"><\/span>Configuration<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p><span style=\"font-weight: 400;\">To configure keyword extraction, follow the below steps:<\/span><\/p>\n<ol>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Click the <\/span><b>Indices <\/b><span style=\"font-weight: 400;\">tab on the top.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">On the left pane, under the <\/span><b>Index Configuration<\/b><span style=\"font-weight: 400;\"> section, click <\/span><b>Workbench<\/b><span style=\"font-weight: 400;\">.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">On the Workbench (Index Configuration) page, on the <\/span><b>Stages <\/b><span style=\"font-weight: 400;\">column, click the <\/span><b>+<\/b><span style=\"font-weight: 400;\"> icon.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">On the right column, select <\/span><b>Keyword Extraction<\/b><span style=\"font-weight: 400;\"> from the <\/span><b>Stage Type<\/b><span style=\"font-weight: 400;\"> drop-down list.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Enter a name in the <\/span><b>Stage Name<\/b><span style=\"font-weight: 400;\"> field.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Enter a condition in the <\/span><b>Condition <\/b><span style=\"font-weight: 400;\">field.\u00a0 You can add multiple conditions using the AND\/OR connectors. Documents that satisfy the condition will be executed as part of the stage. <a href=\"#Conditions\">See below for details<\/a>.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Select the field you want to extract keyword from as <\/span><b>Source Field <\/b><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Define where you want to store the extracted keyword as <\/span><b>Target<\/b> <strong>Field<\/strong>. This field is created by the application<span style=\"font-weight: 400;\">.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Choose a model from the <\/span><b>Choose Model<\/b><span style=\"font-weight: 400;\"> drop-down list. <a href=\"#Models\">See below for details<\/a>.<\/span><\/li>\n<li>Click\u00a0<b>Simulate<\/b> to verify the configurations. The simulator displays the Source and the number of documents to which the mapping was applied, and the result. You can change the Source (if not mentioned in the condition) and the number of documents.<\/li>\n<li>Once done, click\u00a0<b>Save Configuration<\/b>\u00a0on the top-right.<\/li>\n<\/ol>\n<p><a ref=\"magnificPopup\" href=\"http:\/\/docs.kore.ai\/searchassist\/wp-content\/uploads\/sites\/4\/2021\/01\/indices_keyword.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-1269\" src=\"http:\/\/docs.kore.ai\/searchassist\/wp-content\/uploads\/sites\/4\/2021\/01\/indices_keyword.png\" alt=\"\" width=\"1112\" height=\"428\" \/><\/a><\/p>\n<\/div><\/div><div class=\"w-separator size_small with_line width_default thick_1 style_solid color_border align_center\"><div class=\"w-separator-h\"><\/div><\/div><div class=\"wpb_text_column\"><div class=\"wpb_wrapper\"><h3><span class=\"ez-toc-section\" id=\"Models\"><\/span><span id=\"Actions\" class=\"ez-toc-section\">Models<\/span><span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>The following models are supported:<\/p>\n<ul>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\"><strong>Topic Rank<\/strong> &#8211; It is a method to extract keyphrases from the most important topics of a document.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\"><strong>Position Rank<\/strong> &#8211; It is a method to capture both highly frequent words or phrases and their position in a document.\u00a0<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\"><strong>Multi-partite Rank<\/strong> &#8211; It is a keyphrase extraction method that encodes topical information within a multi-partite graph structure.<\/span><\/li>\n<\/ul>\n<\/div><\/div><div class=\"wpb_text_column\"><div class=\"wpb_wrapper\"><h3><span class=\"ez-toc-section\" id=\"Conditions\"><\/span><span id=\"Conditions\" class=\"ez-toc-section\">Conditions<\/span><span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Condition is of the following format:\u00a0<code>ctx.field_name==value<\/code>\u00a0or\u00a0<code>ctx.field_name!=value<\/code>. The <em>field_name<\/em>\u00a0can be obtained from the Fields table under Index Configuration.<\/p>\n<p>For example,\u00a0<code>ctx.contentType==\"web\"<\/code> to restrict the extraction from the content from a web source.<\/p>\n<\/div><\/div><div class=\"w-separator size_small with_line width_default thick_1 style_solid color_border align_center\"><div class=\"w-separator-h\"><\/div><\/div>[vc_raw_html]JTNDZGl2JTIwY2xhc3MlM0QlMjJ3LXBvc3QtZWxtJTIwcG9zdF9uYXZpZ2F0aW9uJTIwbGF5b3V0X3NpbXBsZSUyMGludl9mYWxzZSUyMiUzRSUwQSUwOSUzQ2ElMjBjbGFzcyUzRCUyMnBvc3RfbmF2aWdhdGlvbi1pdGVtJTIwb3JkZXJfZmlyc3QlMjB0b19wcmV2JTIyJTIwaHJlZiUzRCUyMiUyRlNlYXJjaEFzc2lzdCUyRm1hbmFnZS1pbmRleCUyRmZpZWxkLW1hcHBpbmclMkYlMjIlM0UlM0MlMkZwJTNFJTBBJTNDZGl2JTIwY2xhc3MlM0QlMjJwb3N0X25hdmlnYXRpb24taXRlbS1hcnJvdyUyMiUzRSUzQyUyRmRpdiUzRSUwQSUzQ2RpdiUyMGNsYXNzJTNEJTIycG9zdF9uYXZpZ2F0aW9uLWl0ZW0tbWV0YSUyMiUzRVByZXZpb3VzJTNDJTJGZGl2JTNFJTBBJTNDZGl2JTIwY2xhc3MlM0QlMjJwb3N0X25hdmlnYXRpb24taXRlbS10aXRsZSUyMiUzRSUzQ3NwYW4lM0VGaWVsZCUyME1hcHBpbmclM0MlMkZzcGFuJTNFJTNDJTJGZGl2JTNFJTBBJTNDcCUzRSUzQyUyRmElM0UlM0NiciUyMCUyRiUzRSUwQSUzQ2ElMjBjbGFzcyUzRCUyMnBvc3RfbmF2aWdhdGlvbi1pdGVtJTIwb3JkZXJfc2Vjb25kJTIwdG9fbmV4dCUyMiUyMGhyZWYlM0QlMjIlMkZTZWFyY2hBc3Npc3QlMkZtYW5hZ2UtaW5kZXglMkZ0cmFpdHMtZXh0cmFjdGlvbiUyRiUyMiUzRSUzQyUyRnAlM0UlMEElM0NkaXYlMjBjbGFzcyUzRCUyMnBvc3RfbmF2aWdhdGlvbi1pdGVtLWFycm93JTIyJTNFJTNDJTJGZGl2JTNFJTBBJTNDZGl2JTIwY2xhc3MlM0QlMjJwb3N0X25hdmlnYXRpb24taXRlbS1tZXRhJTIyJTNFTmV4dCUzQyUyRmRpdiUzRSUwQSUzQ2RpdiUyMGNsYXNzJTNEJTIycG9zdF9uYXZpZ2F0aW9uLWl0ZW0tdGl0bGUlMjIlM0UlM0NzcGFuJTNFVHJhaXRzJTIwRXh0cmFjdGlvbiUzQyUyRnNwYW4lM0UlM0MlMkZkaXYlM0UlMEElM0NwJTNFJTNDJTJGYSUzRSUzQyUyRmRpdiUzRSUwQQ==[\/vc_raw_html]<div class=\"w-separator size_small with_line width_default thick_1 style_solid color_border align_center\"><div class=\"w-separator-h\"><\/div><\/div><\/div><\/div><\/div><\/div><\/section>\n","protected":false},"excerpt":{"rendered":"Keyword Extraction is a technique to automatically detect important words from the text stored in a field. The Keyword Extraction stage in Index Pipeline allows you to identify a set of keywords from a source field and save the identified keywords in a target field so that it can be used to identify the intention...","protected":false},"author":12,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[6],"tags":[],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/posts\/130"}],"collection":[{"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/users\/12"}],"replies":[{"embeddable":true,"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/comments?post=130"}],"version-history":[{"count":22,"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/posts\/130\/revisions"}],"predecessor-version":[{"id":1035,"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/posts\/130\/revisions\/1035"}],"wp:attachment":[{"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/media?parent=130"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/categories?post=130"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/multisite.korebots.com\/SearchAssist\/wp-json\/wp\/v2\/tags?post=130"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}