Search System
0 Assignments
0 Petitions
Accused Products
Abstract
A search engine and system for data, such as Internet web pages, including a query analyser for processing a query to assign respective weights to terms of the query and to generate a query vector including the weights, and an index network responsive to the query vector to output at least one index to data in response to the query. The index network is a self-generating neural network built using training examples derived from a feature extractor. The feature extractor is used during both the search and training phase. A clusterer is used to group search results.
54 Citations
99 Claims
-
1-54. -54. (canceled)
-
55. A search system, including:
-
query means for processing a query to assign respective weights to terms of said query based on the grammatical structure of the query and the meaning of the terms of the query and to generate a query vector including said weights; index means responsive to said query vector to output at least one index to data in response to said query, said index means being a self generating neural network having nodes of weight vectors representing categories and terms of said data, said nodes further including pointers to other nodes, and leaf nodes of said network including an index to said data; feature extraction means for extracting indices and respective terms of said data as term weight pairs, the weights of the pairs being based on the importance and uniqueness of component ngrams of the terms of an indexed document and the terms being extracted on the basis of the distribution of ngrams in a document space of indexed documents of said data; and wherein said neural network is generated on the basis of training examples including said term weight pairs, and the format of said query vectors and said weight vectors of said network is generated on the basis of said training examples. - View Dependent Claims (56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 98)
-
-
76. A search engine stored on a computer readable storage medium, including:
-
a query analyser for processing a query to assign respective weights to terms of said query based on the grammatical structure of the query and the meaning of the terms of the query and to generate a query vector including said weights; an index network responsive to said query vector to output at least one index to data in response to said query, said index network being a self generating neural network having nodes of weight vectors representing categories and terms of said data, said nodes further including pointers to other nodes, and leaf nodes of said network including an index to said data; and a feature extractor for extracting indices and respective terms of said data as term weight pairs, the weights of the pairs being based on the importance and uniqueness of component ngrams of the terms of an indexed document and the terms being extracted on the basis of the distribution of ngrams in a document space of index documents of said data; and wherein said neural network is generated on the basis of training examples including said term weight pairs, and the format of said query vectors and weights vectors of said network is generated on the basis of said training examples. - View Dependent Claims (77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 99)
-
Specification