Text analysis using phrase definitions and containers
First Claim
Patent Images
1. One or more computer-readable storage devices that store executable instructions that, when executed by a computer, cause the computer to perform acts comprising:
- receiving a specification of a text pattern to be matched in an input stream, said specification comprising one or more data expressions and one or more code expressions, the data expressions defining one or more phrase sets, a first one of the code expressions comprising a first container that comprises a first phrase set, a second phrase set, and a second container that comprises a third phrase set and a fourth phrase set, said specification further comprising a map set and an equivalence set that maps a plurality of phrases to a first phrase based on misspellings and equivalent phrases, said first container comprising a switch that branches between a plurality of match labels;
comparing words in said input stream with said first container, said comparing act converting, based on said map set, any of said plurality of phrases to said first phrase when any of said plurality of phrases is encountered in said input stream;
determining that a context in said input stream matches said first container based on a finding that said context matches either said first phrase set or said second phrase set, said first phrase set, said context being labeled as a first label if said context matches said first phrase set, or being labeled as a second label if said context matches said second phrase set; and
generating a report indicating that said container matches a sequence of words in said input stream.wherein a container represents the various ways in which possible phrase sets may be combined as part of a text pattern to be matched such that the context moves forward after a match has been found between the container and a sequence of words in the input stream and a next container is processed after a location of the matching sequence of words.
2 Assignments
0 Petitions
Accused Products
Abstract
In one example, a phrase analyzer may analyze a text input stream to identify phrases contained in the text input stream. The phrase analyzer may receive a specification, which includes dictionaries of phrases and synonyms, and a specification of the phrases, or sequences of phrases to be matched. The phrase analyzer may compare the input stream to the specification and may produce, as output, an identification of which phrases appear in the input stream, and where in the input stream those phrases occur.
-
Citations
20 Claims
-
1. One or more computer-readable storage devices that store executable instructions that, when executed by a computer, cause the computer to perform acts comprising:
-
receiving a specification of a text pattern to be matched in an input stream, said specification comprising one or more data expressions and one or more code expressions, the data expressions defining one or more phrase sets, a first one of the code expressions comprising a first container that comprises a first phrase set, a second phrase set, and a second container that comprises a third phrase set and a fourth phrase set, said specification further comprising a map set and an equivalence set that maps a plurality of phrases to a first phrase based on misspellings and equivalent phrases, said first container comprising a switch that branches between a plurality of match labels; comparing words in said input stream with said first container, said comparing act converting, based on said map set, any of said plurality of phrases to said first phrase when any of said plurality of phrases is encountered in said input stream; determining that a context in said input stream matches said first container based on a finding that said context matches either said first phrase set or said second phrase set, said first phrase set, said context being labeled as a first label if said context matches said first phrase set, or being labeled as a second label if said context matches said second phrase set; and generating a report indicating that said container matches a sequence of words in said input stream. wherein a container represents the various ways in which possible phrase sets may be combined as part of a text pattern to be matched such that the context moves forward after a match has been found between the container and a sequence of words in the input stream and a next container is processed after a location of the matching sequence of words. - View Dependent Claims (2, 3, 4, 5, 6, 7, 20)
-
-
8. A system comprising:
-
one or more processors; one or more data remembrance components; a specification that is stored in said one or more data remembrance components, said specification comprising; a data expression that defines a plurality of phrase sets; a code expression that defines a first container that specifies a combination that comprises a first one of said phrase sets, a second one of said phrase sets, and a second container that comprises a third one of said phrase sets and a fourth one of said phrase sets; and a map set and an equivalence set that maps a plurality of phrases to a first phrase based on misspellings and equivalent phrases, said phrase analyzer encountering one of said plurality of phrases in said input stream and, based on said map set, converting said one of said plurality of phrases to said first phrase when determining whether said input stream matches said first container; and a phrase analyzer that executes on said one or more processors, that receives said specification and an input stream, that compares words in said input stream to said first container, and that generates a report that identifies where, in said input stream, a match between said first container and said input stream is found, said first container comprising a switch that branches between a plurality of match labels, said phrase analyzer labeling a portion of said input stream that matches said first container as a first label if said portion matches said first phrase set, or being labeled as a second match if said context matches said second phrase set, wherein a container represents the various ways in which possible phrase sets may be combined as part of a text pattern to be matched such that the context moves forward after a match has been found between the container and a sequence of words in the input stream and a next container is processed after a location of the matching sequence of words. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A method of performing an action based on input text, the method comprising:
-
using a processor to perform acts comprising; receiving a map set and an equivalence set that maps a first plurality of phrases to a first phrase based on misspellings and equivalent phrases; converting occurrence of any one of said first plurality of phrases in an input text to said first phrase; comparing words in said input text to a first container that describes a combination that comprises a first phrase set, a second phrase set, and a second container, said first phrase set comprising a first plurality of phrases, said second phrase set comprising a second plurality of phrases, said second container comprising a third phrase set and a fourth phrase set; determining that a sequence of words in said input text matches said first container, said match being found based on a match between said sequence of words and said first phrase, said first phrase occurring in said text due to having been converted from one of said first plurality of phrases by said converting act, said first container comprising a switch that branches between a plurality of match labels, said sequence of words being labeled as a first label if said context matches said first phrase set, or being labeled as a second label if said context matches said second phrase set; generating a report that indicates where, in said input text, said sequence of words occurs; and taking a tangible action based on said report, wherein a container represents the various ways in which possible phrase sets may be combined as part of a text pattern to be matched such that the context moves forward after a match has been found between the container and a sequence of words in the input stream and a next container is processed after a location of the matching sequence of words. - View Dependent Claims (16, 17, 18, 19)
and wherein the method further comprises; using a processor to perform acts comprising; indicating that all of said second plurality of phrases have occurred in said input text when any one of said second plurality of phrases occurs in said input text.
-
Specification