User-context-based search engine
First Claim
1. A method for classifying information, the method comprising:
- providing input text;
identifying a vocabulary list independent from the input text, the vocabulary list comprising a plurality of entries, each of the plurality of entries associated with a macro-context, wherein the macro-context comprises a vector characterizing the context of the entry by mapping a plurality of subject matters, each unique, to a corresponding plurality of weights, each weight reflecting a contribution of a corresponding subject matter of the plurality of subject matters to the entry;
counting occurrences of each term from the vocabulary list found within the input text;
calculating a macro-context representing summations of the macro-contexts associated with the terms from the vocabulary list found within the input text to characterize the context of the input text by mapping the plurality of subject matters to corresponding weights reflecting contributions of corresponding subject matters of the plurality of subject matters to the input text; and
determining a micro-context comprising a list of terms selected from the list of vocabulary that correspond to the input text.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus for determining contexts of information analyzed. Contexts may be determined for words, expressions, and other combinations of words in bodies of knowledge such as encyclopedias. Analysis of use provides a division of the universe of communication or information into domains, and selects words or expressions unique to those domains of subject matter as an aid in classifying information. A vocabulary list is created with a macro-context (context vector) for each, dependent upon the number of occurrences of unique terms from a domain, over each of the domains. This system may be used to find information or classify information by subsequent inputs of text, in calculation of macro-contexts, with ultimate determination of lists of micro-contests including terms closely aligned with the subject matter.
106 Citations
20 Claims
-
1. A method for classifying information, the method comprising:
-
providing input text; identifying a vocabulary list independent from the input text, the vocabulary list comprising a plurality of entries, each of the plurality of entries associated with a macro-context, wherein the macro-context comprises a vector characterizing the context of the entry by mapping a plurality of subject matters, each unique, to a corresponding plurality of weights, each weight reflecting a contribution of a corresponding subject matter of the plurality of subject matters to the entry; counting occurrences of each term from the vocabulary list found within the input text; calculating a macro-context representing summations of the macro-contexts associated with the terms from the vocabulary list found within the input text to characterize the context of the input text by mapping the plurality of subject matters to corresponding weights reflecting contributions of corresponding subject matters of the plurality of subject matters to the input text; and determining a micro-context comprising a list of terms selected from the list of vocabulary that correspond to the input text. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method for searching comprising:
-
mining a repository of information to determine macro and micro-contexts for elements of a database, each macro and micro-contexts characterizing the context of an element of the database by mapping a plurality of subject matters, each unique, to a corresponding plurality of weights, each weight reflecting a contribution of a corresponding subject matter of the plurality of subject matters to the element of the database; indexing the database content according to the macro and micro-contexts determined; receiving a query from a user; determining macro and micro-contexts associated with the query, the macro and micro-contexts characterizing the context of the query by mapping the plurality of subject matters to corresponding weights reflecting contributions of corresponding subject matters of the plurality of subject matters to the query; locating in a database information having contexts related to contexts associated with a query; and presenting the information located to a user. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification