Lexicon-based new idea detector
First Claim
1. A computer-implemented method for detecting new ideas within symbolic representations pertaining to a domain of endeavor, comprising:
- accessing the symbolic representations pertaining to the domain of endeavor to detect a symbol contained within the symbolic representations that had been previously identified as not being found within a base lexicon of symbols associated with the domain of endeavor;
retrieving a symbol from the symbolic representations;
searching a base lexicon of symbols associated with the domain of endeavor for an instance of the symbol;
if the instance of the symbol is not found in the base lexicon of symbols associated with the domain of endeavor, then performing the steps of;
presenting the symbol to a user as a new symbol;
receiving input from the user indicative of whether the new symbol should be tracked;
if the input received from the user indicates that the new symbol should not be tracked, then adding the symbol to the base lexicon of symbols associated with the domain of endeavor; and
if the input received from the user indicates that the new symbol should be tracked, then performing the steps of;
accumulating data indicative of a spread of multiple instances of the symbol throughout the domain of endeavor;
determining whether the spread of multiple instances of the symbol throughout the domain of endeavor exceeds a threshold; and
if the spread of multiple instances of the symbol throughout the domain of endeavor exceeds a threshold, then outputting an indication based on the symbol to a user that a new idea within the domain of endeavor has been detected.
0 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus for detecting the occurrence of new ideas in documents or communications. The method is comprised of three processes. The first process lexiconizes all words or symbols in a set of documents. The second process compares all words in a second set of documents to the words in the lexicon. Words not already in the lexicon are presented to a user who takes one of two courses of action, 1) lexiconizes the word, or, 2) declares it a “fad” indicating that the word is to be further analyzed. The third process measures the spatial and temporal spread of said fad by searching a third set of documents and computing metrics based on additional occurrences of said fad, said metrics being used to determine when a fad has achieved a level of interest denoted as a category. When a category is detected, a user is notified.
48 Citations
18 Claims
-
1. A computer-implemented method for detecting new ideas within symbolic representations pertaining to a domain of endeavor, comprising:
-
accessing the symbolic representations pertaining to the domain of endeavor to detect a symbol contained within the symbolic representations that had been previously identified as not being found within a base lexicon of symbols associated with the domain of endeavor; retrieving a symbol from the symbolic representations; searching a base lexicon of symbols associated with the domain of endeavor for an instance of the symbol; if the instance of the symbol is not found in the base lexicon of symbols associated with the domain of endeavor, then performing the steps of; presenting the symbol to a user as a new symbol; receiving input from the user indicative of whether the new symbol should be tracked; if the input received from the user indicates that the new symbol should not be tracked, then adding the symbol to the base lexicon of symbols associated with the domain of endeavor; and if the input received from the user indicates that the new symbol should be tracked, then performing the steps of; accumulating data indicative of a spread of multiple instances of the symbol throughout the domain of endeavor; determining whether the spread of multiple instances of the symbol throughout the domain of endeavor exceeds a threshold; and if the spread of multiple instances of the symbol throughout the domain of endeavor exceeds a threshold, then outputting an indication based on the symbol to a user that a new idea within the domain of endeavor has been detected. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer-readable medium bearing instructions for detecting new ideas within symbolic representations pertaining to a domain of endeavor, said instructions, when executed, arrange to cause a computer to perform the steps of:
-
accessing the symbolic representations pertaining to the domain of endeavor to detect a symbol contained within the symbolic representations that had been previously identified as not being found within a base lexicon of symbols associated with the domain of endeavor; retrieving a symbol from the symbolic representations; searching a base lexicon of symbols associated with the domain of endeavor for an instance of the symbol; if the instance of the symbol is not found in the base lexicon of symbols associated with the domain of endeavor, then performing the steps of; presenting the symbol to a user as a new symbol; receiving input from the user indicative of whether the new symbol should be tracked; if the input received from the user indicates that the new symbol should not be tracked, then adding the symbol to the base lexicon of symbols associated with the domain of endeavor; and if the input received from the user indicates that the new symbol should be tracked, then performing the steps of; accumulating data indicative of a spread of multiple instances of the symbol throughout the domain of endeavor; determining whether the spread of multiple instances of the symbol throughout the domain of endeavor exceeds a threshold; and if the spread of multiple instances of the symbol throughout the domain of endeavor exceeds a threshold, then outputting an indication based on the symbol to a user that a new idea within the domain of endeavor has been detected. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer-implemented method for detecting new ideas within symbolic representations pertaining to a domain of endeavor, comprising:
-
accessing the symbolic representations pertaining to the domain of endeavor, wherein the symbolic representations pertaining to the domain of endeavor include contents of an internet web site reachable within a specified number of indirections from an Internet Protocol (IP) address, contents of transcripts of verbal communications, or electronic representations of written communications; retrieving a symbol from the symbolic representations;
wherein the symbol includes a word, a neologism, an acronym, an abbreviation, or a string of words with a separator;searching a base lexicon of symbols associated with the domain of endeavor for an instance of the symbol; if the instance of the symbol is not found in the base lexicon of symbols associated with the domain of endeavor, then performing the steps of; presenting the symbol to a user as a new symbol; receiving input from the user indicative of whether the new symbol should be tracked; if the input received from the user indicates that the new symbol should not be tracked, then adding the symbol to the base lexicon of symbols associated with the domain of endeavor; and if the input received from the user indicates that the new symbol should be tracked, then performing the steps of; accumulating data indicative of a spread of multiple instances of the symbol throughout the domain of endeavor; determining whether the spread of multiple instances of the symbol throughout the domain of endeavor exceeds a threshold; and if the spread of multiple instances of the symbol throughout the domain of endeavor exceeds a threshold, then outputting an indication based on the symbol to a user that a new idea within the domain of endeavor has been detected. - View Dependent Claims (18)
-
Specification