User-context-based search engine
First Claim
1. A method for creating a universal vocabulary, the method comprising:
- selecting a finite number of domains;
creating domain lists for each domain, the domain lists each comprising selected terms substantially unique to the associated domain; and
building a vocabulary list by identifying a corpus of information organized by topical entries, counting occurrences of terms from the domain list for each topical entry, and calculating a macro-context for each topical entry by generating a vector associating occurrences of domain list terms with the corresponding domain.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus for determining contexts of information analyzed. Contexts may be determined for words, expressions, and other combinations of words in bodies of knowledge such as encyclopedias. Analysis of use provides a division of the universe of communication or information into domains, and selects words or expressions unique to those domains of subject matter as an aid in classifying information. A vocabulary list is created with a macro-context (context vector) for each, dependent upon the number of occurrences of unique terms from a domain, over each of the domains. This system may be used to find information or classify information by subsequent inputs of text, in calculation of macro-contexts, with ultimate determination of lists of micro-contests including terms closely aligned with the subject matter.
108 Citations
11 Claims
-
1. A method for creating a universal vocabulary, the method comprising:
-
selecting a finite number of domains;
creating domain lists for each domain, the domain lists each comprising selected terms substantially unique to the associated domain; and
building a vocabulary list by identifying a corpus of information organized by topical entries, counting occurrences of terms from the domain list for each topical entry, and calculating a macro-context for each topical entry by generating a vector associating occurrences of domain list terms with the corresponding domain.
-
-
2. A method for classifying information, the method comprising:
-
providing input text;
identifying a vocabulary list independent from the input text, the vocabulary list comprising a plurality of entries each associated with a macro-context;
counting occurrences of each term from the vocabulary list found;
calculating a macro-context representing summations of the macro-contexts associated with the terms from the vocabulary list found within the input text; and
determining a micro-context comprising a list of terms selected from the list of vocabulary that correspond to the input text. - View Dependent Claims (3)
-
-
4. A method for searching comprising:
-
mining a repository or information to determine macro and micro-contexts for elements of the database;
indexing the database content according to the macro and micro-contexts determined;
receiving a query from a user;
determining macro and micro-contexts associated with the query;
locating in a database information having contexts related to contexts associated with a query; and
presenting the information located to a user. - View Dependent Claims (5, 6, 7, 8, 9, 10, 11)
-
Specification