Search system
First Claim
Patent Images
1. A search system, including:
- query means in a search computer for processing a query to assign respective weights to terms of said query based on the grammatical structure of the query and the meaning of the terms of the query and to generate a query vector including said weights;
index means in the search computer responsive to said query vector to output indices to data in response to said query, said index means being a self generating neural network having nodes of weight vectors representing categories and terms of said data, said nodes further including pointers to other nodes, and leaf nodes of said network each including an index to said data;
feature extraction means in the search computer for extracting said indices and respective terms of said data as term weight pairs, the weights of the pairs being based on the importance and uniqueness of component ngrams of the terms of an indexed document and the terms being extracted on the basis of the distribution of ngrams in a document space of indexed documents of said data; and
wherein said neural network is generated on the basis of training examples including said term weight pairs, and the format of said query vectors and said weight vectors of said network is generated on the basis of said training examples.
0 Assignments
0 Petitions
Accused Products
Abstract
A search engine and system for data, such as Internet web pages, including a query analyser for processing a query to assign respective weights to terms of the query and to generate a query vector including the weights, and an index network responsive to the query vector to output at least one index to data in response to the query. The index network is a self-generating neural network built using training examples derived from a feature extractor. The feature extractor is used during both the search and training phase. A clusterer is used to group search results.
310 Citations
45 Claims
-
1. A search system, including:
-
query means in a search computer for processing a query to assign respective weights to terms of said query based on the grammatical structure of the query and the meaning of the terms of the query and to generate a query vector including said weights; index means in the search computer responsive to said query vector to output indices to data in response to said query, said index means being a self generating neural network having nodes of weight vectors representing categories and terms of said data, said nodes further including pointers to other nodes, and leaf nodes of said network each including an index to said data; feature extraction means in the search computer for extracting said indices and respective terms of said data as term weight pairs, the weights of the pairs being based on the importance and uniqueness of component ngrams of the terms of an indexed document and the terms being extracted on the basis of the distribution of ngrams in a document space of indexed documents of said data; and wherein said neural network is generated on the basis of training examples including said term weight pairs, and the format of said query vectors and said weight vectors of said network is generated on the basis of said training examples. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A search engine stored on a computer readable storage medium, having stored thereon instructions for extracting terms from a query, comprising machine executable code which when executed by at least one machine, causes the machine to:
-
generate a query analyser for processing a query to assign respective weights to terms of said query based on the grammatical structure of the query and the meaning of the terms of the query and to generate a query vector including said weights; generate an index network responsive to said query vector to output indices to data in response to said query, said index network being a self generating neural network having nodes of weight vectors representing categories and terms of said data, said nodes further including pointers to other nodes, and leaf nodes of said network each including an index to said data; and generate a feature extractor for extracting said indices and respective terms of said data as term weight pairs, the weights of the pairs being based on the importance and uniqueness of component ngrams of the terms of an indexed document and the terms being extracted on the basis of the distribution of ngrams in a document space of index documents of said data; and wherein said neural network is generated on the basis of training examples including said term weight pairs, and the format of said query vectors and weights vectors of said network is generated on the basis of said training examples. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45)
-
Specification