System and method for natural language processing and using ontological searches
First Claim
Patent Images
1. A method performed on a computer for data searching, comprising:
- collecting data utilizing a computer network;
parsing the data by language type;
expanding the data using off-line ontological semantics;
categorizing the data and expanded data by anticipated queries;
a processor scoring the data and the expanded data based on anticipated queries;
receiving a query;
parsing the query for question type and expanding the query using ontological semantics and;
comparing the parsed and expanded user query with the anticipated queries; and
providing the matching data and the expanded data based in part on the comparison by adding a weight for each word and dividing by a total number of words, wherein the scoring is calculated such that
wherein M is a number of significant words in the data, R is 1 for each word found in the data, and 0 otherwise, W is the weight for each word and Sx is a presence score;
storing the data and the expanded data based on anticipated queries;
receiving a query;
parsing the query for question type and expanding the query using ontological semantics;
the processor comparing the parsed and expanded user query with the anticipated queries; and
providing the matching data and the expanded data based in part on the comparison.
1 Assignment
0 Petitions
Accused Products
Abstract
A search engine is disclosed that utilizes natural language processing (NLP) techniques. The search engine utilizes meaning-based natural language processing using ontological semantics in analyzing the meaning of queries and the searched text. This system analyzes Web pages and queries. The NLP method produces equivalent meanings to a sequence of user initiated words, wherein relevance parsing of the original query produces a display of queries/questions as hot links to the next round of searching without additional typing by the user.
62 Citations
3 Claims
-
1. A method performed on a computer for data searching, comprising:
-
collecting data utilizing a computer network; parsing the data by language type; expanding the data using off-line ontological semantics; categorizing the data and expanded data by anticipated queries; a processor scoring the data and the expanded data based on anticipated queries; receiving a query; parsing the query for question type and expanding the query using ontological semantics and; comparing the parsed and expanded user query with the anticipated queries; and providing the matching data and the expanded data based in part on the comparison by adding a weight for each word and dividing by a total number of words, wherein the scoring is calculated such that
wherein M is a number of significant words in the data, R is 1 for each word found in the data, and 0 otherwise, W is the weight for each word and Sx is a presence score;storing the data and the expanded data based on anticipated queries; receiving a query; parsing the query for question type and expanding the query using ontological semantics; the processor comparing the parsed and expanded user query with the anticipated queries; and providing the matching data and the expanded data based in part on the comparison.
-
-
2. A method performed on a computer for data searching, comprising:
-
collecting data utilizing a computer network; parsing the data by language type; expanding the data using off-line ontological semantics; categorizing the data and expanded data by anticipated queries; scoring the data based on the parsed data; a processor scoring the data based on a universal resource locator and title of the data, wherein the universal resource locator scoring is calculated such that;
wherein J is a number of characters of query words found in a domain in the universal resource locator, Z is a total number of characters in the domain, M is a length of the query, c is a credibility factor, Ss is a sentence score and Sdom is the universal resource locator scoring;scoring the data based on the query; storing the data and the expanded data based on anticipated queries; receiving a query; parsing the query for question type and expanding the query using ontological semantics; the processor comparing the parsed and expanded user query with the anticipated queries; and providing the matching data and the expanded data based in part on the comparison.
-
-
3. A method performed on a computer for data searching, comprising:
-
collecting data utilizing a computer network; parsing the data by language type; expanding the data using off-line ontological semantics; categorizing the data and expanded data by anticipated queries; scoring the data based on the parsed data; scoring the data based on a universal resource locator and title of the data; a processor scoring the data based on the query, wherein the score is degraded by distance between data words and an order of the data words such that;
wherein M is a length of the query, Sd is a pairwise distance, So is a pairwise order score, Sw is a pairwise weighting score, and St is a pairwise total score;storing the data and the expanded data based on anticipated queries; receiving a query; parsing the query for question type and expanding the query using ontological semantics; the processor comparing the parsed and expanded user query with the anticipated queries; and providing the matching data and the expanded data based in part on the comparison.
-
Specification