DISCOVERY ENGINE
0 Assignments
0 Petitions
Accused Products
Abstract
A method that is relatively inexpensive to implement and that permits a user to conduct searches of electronically stored documents using an entire document, multiple documents or portions of a document as the search criteria and to collect, store and to share the relevant documents from the search.
-
Citations
41 Claims
-
1-26. -26. (canceled)
-
27. :
- A method of semantically searching documents in a way that improves the efficiency of computer resources, comprising;
indexing by a processor a data set of documents having words by counting the words in the entire data set and determining a first frequency score and a first uniqueness score for each word of the data set of documents; receiving a user input by the processor of a document of interest; determining by the processor a second frequency score and a second uniqueness score for each word in the document of interest; generating by the processor a respective similarity score for the document of interest compared to each of the documents in the data set of documents in a flat manner by comparing the second frequency score and the second uniqueness score for each word in the document of interest to the first frequency score and the first uniqueness score for each word of the data set of documents; and presenting by the processor the most similar documents from the data set to the document of interest using the respective similarity score for the document of interest compared to each of the documents in the data set of documents. - View Dependent Claims (28, 29, 30, 31)
- A method of semantically searching documents in a way that improves the efficiency of computer resources, comprising;
-
32. :
- A system for semantically searching documents to improve efficiency of computer resources, comprising;
a memory containing a set of instructions; and a processor for processing the set of instructions, wherein the instructions cause the processor to perform a method comprising; indexing a data set of documents having words by counting the words in the entire data set and determining a first frequency score and a first uniqueness score for each word of the data set of documents; receiving a user input of a document of interest; determining a second frequency score and a second uniqueness score for each word in the document of interest; generating a respective similarity score for the document of interest compared to each of the documents in the data set of documents in a flat manner by comparing the second frequency score and the second uniqueness score for each word in the document of interest to the first frequency score and the first uniqueness score for each word of the data set of documents; and presenting the most similar documents from the data set to the document of interest using the respective similarity score for the document of interest compared to each of the documents in the data set of documents. - View Dependent Claims (33, 34, 35, 36)
- A system for semantically searching documents to improve efficiency of computer resources, comprising;
-
37. :
- A non-transitory computer-readable medium having tangibly embodied thereon and accessible therefrom processor-executable instructions that, when executed by at least one data processing device of at least one computer, causes said at least one data processing device to perform a method comprising;
indexing a data set of documents having words by counting the words in the entire data set and determining a first frequency score and a first uniqueness score for each word of the data set of documents; receiving a user input of a document of interest; determining a second frequency score and a second uniqueness score for each word in the document of interest; generating a respective similarity score for the document of interest compared to each of the documents in the data set of documents in a flat manner by comparing the second frequency score and the second uniqueness score for each word in the document of interest to the first frequency score and the first uniqueness score for each word of the data set of documents; and presenting the most similar documents from the data set to the document of interest using the respective similarity score for the document of interest compared to each of the documents in the data set of documents. - View Dependent Claims (38, 39, 40, 41)
- A non-transitory computer-readable medium having tangibly embodied thereon and accessible therefrom processor-executable instructions that, when executed by at least one data processing device of at least one computer, causes said at least one data processing device to perform a method comprising;
Specification