Systems and methods for identifying similarities using unstructured text analysis
First Claim
1. A system to perform unstructured text analysis comprising:
- processing circuitry communicatively coupled to a memory, the memory including instructions stored thereon which, when executed by processing circuitry, cause the processing circuitry to implement modules comprising;
a user interface module to receive, through a user interface, structured data and deconstruct the structured data to create unstructured text, the structured data comprising user responses input into respective predefined prompts on the user interface and metadata for each response of the user responses, the unstructured text comprising only text corresponding to the user responses input into the respective predefined prompts;
an application server module to receive the unstructured text, create a first word cloud using only the unstructured text, create a query based on the first word cloud, and create a boosted query by increasing a weight of a word in the query based on the metadata;
a search platform module to execute the boosted query on a database, analyze a specified number of records returned from executing the boosted query, and provide data from the specified number of records determined to include data most similar to the first word cloud based on the boosted query; and
wherein the application server module is further to create a plurality of second word clouds, one second word cloud for each of the specified number of records, determine a similarity value indicating how similar the first word cloud is to each of the second word clouds, and provide a similarity indicator to the user interface that indicates how similar a particular record of the specified number of records is to the text received at the user interface based on the determined similarity value.
1 Assignment
0 Petitions
Accused Products
Abstract
Generally discussed herein are systems, devices, and methods for unstructured text analysis. A method can include deconstructing structured data to create unstructured text, creating a first word cloud using the unstructured text, creating a query based on the first word cloud, receiving data corresponding to contents of a specified number of records determined to include data most similar to the first word cloud in a database of records, creating a second word cloud for each of the specified number of records using the data from the specified number of records, determining similarity values indicating how similar the first word cloud is to each of the second word clouds, and providing a similarity indicator for each record of the specified number of records to a user interface, the similarity indicator representing a relative magnitude of the determined similarity values of the specified number of records.
76 Citations
18 Claims
-
1. A system to perform unstructured text analysis comprising:
-
processing circuitry communicatively coupled to a memory, the memory including instructions stored thereon which, when executed by processing circuitry, cause the processing circuitry to implement modules comprising; a user interface module to receive, through a user interface, structured data and deconstruct the structured data to create unstructured text, the structured data comprising user responses input into respective predefined prompts on the user interface and metadata for each response of the user responses, the unstructured text comprising only text corresponding to the user responses input into the respective predefined prompts; an application server module to receive the unstructured text, create a first word cloud using only the unstructured text, create a query based on the first word cloud, and create a boosted query by increasing a weight of a word in the query based on the metadata; a search platform module to execute the boosted query on a database, analyze a specified number of records returned from executing the boosted query, and provide data from the specified number of records determined to include data most similar to the first word cloud based on the boosted query; and wherein the application server module is further to create a plurality of second word clouds, one second word cloud for each of the specified number of records, determine a similarity value indicating how similar the first word cloud is to each of the second word clouds, and provide a similarity indicator to the user interface that indicates how similar a particular record of the specified number of records is to the text received at the user interface based on the determined similarity value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method for unstructured text analysis comprising:
-
deconstructing structured data provided through a user interface to create unstructured text, the structured data comprising user responses input into respective predefined prompts on the user interface and metadata for each response of the user responses, the unstructured text comprising only text corresponding to the user responses input into the respective predefined prompts; creating, using an application server, a first word cloud using only the unstructured text; creating, using the application server, a query based on the first word cloud; creating a boosted query by increasing a weight of a word in the query based on the metadata; receiving, in response to issuing the boosted query, data corresponding to contents of a specified number of records determined to include data most similar to the first word cloud in a database of records, the database of records including records of product defects, customer complaints, and part defects; creating a second word cloud for each of the specified number of records using the data from the specified number of records; determining similarity values indicating how similar the first word cloud is to each of the second word clouds; and providing a similarity indicator for each record of the specified number of records to a user interface, the similarity indicator representing a relative magnitude of the determined similarity values of the specified number of records. - View Dependent Claims (12, 13, 14)
-
-
15. A machine-readable storage device including instructions stored thereon which, when executed by a machine, cause the machine to perform operations for unstructured text analysis, the operations comprising:
-
creating a first word cloud using only unstructured text of structured data provided by a user through a user interface, the structured data comprising user responses input into respective predefined prompts on the user interface and metadata for each response of the user responses, the unstructured text comprising only text corresponding to the user responses input into the respective predefined prompts; creating a query based on the first word cloud; creating a boosted query by increasing a weight of a word in the query based on the metadata; receiving, in response to issuing the boosted query, data corresponding to the contents of a specified number of records determined to include data most similar to the first word cloud in a database of records; creating second word clouds using the data from the specified number of records; determining similarity values indicating how similar the first word cloud is to each of the second word clouds; and providing a similarity indicator for each record of the specified number of records to a user interface, the similarity indicator representing a relative magnitude of the determined similarity values of the specified number of records. - View Dependent Claims (16, 17, 18)
-
Specification