FULL TEXT QUERY AND SEARCH SYSTEMS AND METHODS OF USE
0 Assignments
0 Petitions
Accused Products
Abstract
The invention is a method for textual searching of text-based databases including databases of compiled internet content, scientific literature, abstracts for books and articles, newspapers, journals, and the like. Specifically, the algorithm supports searches using full-text or webpage as query and keyword searches allowing multiple entries and an information-content based ranking system (Shannon Information score) that uses p-values to represent the likelihood that a hit is due to random matches. Additionally, users can specify the parameters that determine hits and their ranking with scoring based on phrase matches and sentence similarities.
64 Citations
35 Claims
-
1-28. -28. (canceled)
-
29. A data processing system comprisinga database of string entries,a routine for processing the string entries, the routine selected from the group consisting of calculating a frequency distribution of string entries, associating an external frequency distribution with string entries in the database, and associating an external probability distribution with a collection of string entries in the database,
and 3) a routine for analyzing the database using the distribution.
Specification