Dynamic comparison of search systems in a controlled environment
First Claim
1. A software program on a computer usable storage medium for performing a computer controlled method of determining the effectiveness of a search engine in performing searches in a search area comprising:
- computer code generating controlled relevant documents by manipulation of relevant search terms of documents in the search area based on a relevancy algorithm;
wherein the relevancy algorithm relates to positions of relevant search terms in documents and the generation of the controlled relevant documents is done by placing search terms into various positions in the document in accordance with the relevancy algorithm and wherein at least 3 different controlled relevant documents are produced for a given relevancy algorithm with search terms placed in different positions in different documents with each document compared with the document relevant to the search area with absence of search terms in that document to generate the relevancy vector chart;
using the controlled relevant documents to create a relevancy vector chart; and
comparing results obtained from the search engine to the relevancy vector chart to provide a relevancy rating for the search engine with computer code using the relevancy vector chart to obtain a relative ranking of a plurality of search engines.
1 Assignment
0 Petitions
Accused Products
Abstract
A random document is stripped of the relevant search terms to generate a non-relevant document. The relevant search terms are formed into grammatically correct but not necessarily technically correct sentences. The grammatically correct sentences are placed at the beginning of the random document in one pass through the system in the middle of a document in a second pass through the system and at the end of the document in a third pass through the system. A relevancy vector chart is computed using the references documents and a known relevancy algorithm relating to position of search terms. The results obtained from search engines are compared to the relevancy vector chart to determine the relative relevancy of the returned search results from the search engines.
-
Citations
11 Claims
-
1. A software program on a computer usable storage medium for performing a computer controlled method of determining the effectiveness of a search engine in performing searches in a search area comprising:
-
computer code generating controlled relevant documents by manipulation of relevant search terms of documents in the search area based on a relevancy algorithm;
wherein the relevancy algorithm relates to positions of relevant search terms in documents and the generation of the controlled relevant documents is done by placing search terms into various positions in the document in accordance with the relevancy algorithm and wherein at least 3 different controlled relevant documents are produced for a given relevancy algorithm with search terms placed in different positions in different documents with each document compared with the document relevant to the search area with absence of search terms in that document to generate the relevancy vector chart;using the controlled relevant documents to create a relevancy vector chart; and comparing results obtained from the search engine to the relevancy vector chart to provide a relevancy rating for the search engine with computer code using the relevancy vector chart to obtain a relative ranking of a plurality of search engines. - View Dependent Claims (2, 3)
-
-
4. A method of determining the relative effectiveness of search engines comprising the steps of:
-
selecting documents from a database; generating from the selected documents controlled relevant documents by manipulation of relevant terms in the selected documents based on a relevancy algorithm;
wherein the relevancy algorithm relates to positions of the relevant terms in documents and the generation of the controlled relevant documents is done by placing search terms into various positions in the controlled relevant documents in accordance with the relevant algorithm;using the generated controlled relevant documents to create relevancy reference vectors; searching the database using each of the search engines and generating performance vectors with the search results for each engine; and comparing the performance vectors for each engine to the relevancy reference vectors to provide a relevancy rating for the search engines based on the relevancy algorithm using a relevancy vector chart made from the relevancy reference vectors to obtain a relative ranking of each of the search engines wherein at least 3 different controlled relevant documents are produced for each selected document using a given relevancy algorithm with search terms placed in 3 different positions in different documents with each document compared with the document relevant to the search area with an absence of search terms in the document to generate the relevancy reference vectors of the relevancy vector chart. - View Dependent Claims (5)
-
-
6. A method of determining the relative effectiveness of search engines in performing searches in a search area comprising:
-
generating a controlled relevant document by the arrangement of relevant search terms of a selected document in the search area in accordance with a relevancy algorithm; wherein the relevancy algorithm relates to positions of relevant search terms in documents and the generation of the controlled relevant documents is done by placing search terms into various positions in the document in accordance with the relevancy algorithm and wherein at least 3 different controlled relevant documents are produced for a given relevancy algorithm with search terms placed in different positions in different documents with each document compared with the document relevant to the search area with absence of search terms in that document to generate the relevancy vector chart; interrogating a database containing the search area with the search engines; comparing by vector analysis, references obtained from the search engines with the controlled relevant document to provide a relevancy rating for the search engine to determine the relative effectiveness of the search engines; generating vectors Vr for the control relevant document and VSEj for search results of each search engine SEj (J=1, 2, . . . n); computing the distance d(VSEj Vr) between VSEj and Vr for each search engine; computing a score Sj=100/1+d(VSEj, Vr) for each search engine; assigning the score Sj to each search engine SEj; and comparing the search engines SE1, SE2, . . . SEn based on their scores Si, S2, . . . Sn. - View Dependent Claims (7, 8, 9, 10, 11)
-
Specification