System for identifying word patterns in text
First Claim
1. A system for identifying objects referenced in a stream of text, the system comprising:
- an input pipeline configured to receive an incoming stream of text comprised of words;
a text analysis module configured to consult a semantic network to automatically identify one or more word patterns in the incoming stream of text, such that each word in the incoming stream is searched once in the semantic network, to break the stream of text into individual words, to analyze each word in an order of occurrence of the word in the stream of text by comparing the individual words to identified words in the semantic network, and to, upon finding a match between an individual word in the stream of text and an identified word in the semantic network, compare the individual word and an adjacent word of the stream of text to a word pattern in the semantic network; and
an object association module configured to reference a known object within the semantic network, the known object identified by a word pattern of the semantic network.
1 Assignment
0 Petitions
Accused Products
Abstract
A system for identifying word patterns in text is conducted in real time and is highly suitable for network and Internet use. The system comprises a semantic network that may be compiled on a local computer or at a remote host and a software text analysis module for receiving the text to be analyzed, parsing the text, submitting the text to the semantic network, and receiving the results. Recognized, words are then examined, together with surrounding words in the text to determine whether the words are part of a word pattern. Word patterns are located at nodes in the semantic network in a hierarchical structure, and certain word patterns correspond to objects of the semantic network. When all word patterns involving a word are located, links are followed to objects corresponding to the word patterns. Several nodes may point to a single object, but each object is represented only once in the semantic network. Identified objects may thus be identified in real time, as the text streams through the text analysis module.
45 Citations
8 Claims
-
1. A system for identifying objects referenced in a stream of text, the system comprising:
-
an input pipeline configured to receive an incoming stream of text comprised of words; a text analysis module configured to consult a semantic network to automatically identify one or more word patterns in the incoming stream of text, such that each word in the incoming stream is searched once in the semantic network, to break the stream of text into individual words, to analyze each word in an order of occurrence of the word in the stream of text by comparing the individual words to identified words in the semantic network, and to, upon finding a match between an individual word in the stream of text and an identified word in the semantic network, compare the individual word and an adjacent word of the stream of text to a word pattern in the semantic network; and an object association module configured to reference a known object within the semantic network, the known object identified by a word pattern of the semantic network. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
Specification