ANAPHORA RESOLUTION FOR SEMANTIC TAGGING
First Claim
1. A method comprising:
- tokenizing, by a computer-based system, a body of text by splitting the body of text into individual tokens;
weighting, by the computer-based system and based on the tokenizing, the individual tokens having a pronoun grammatical role based on structured contextual information;
analyzing, by the computer-based system, structured contextual information to facilitate a homophora resolution;
integrating, by the computer-based system and in response to the analyzing and in response to the weighting of the individual tokens, the homophora resolution into an anaphora resolution algorithm by substituting the structured contextual information into the body of text to create a substituted body of text;
translating, by the computer-based system and based on the integrating, semantic concepts of the substituted body of text into one or more semantic tags;
analyzing, by the computer-based system and based on semantic reasoning and using the one or more semantic tags, implied relationships of the text within a group of documents to identify a specific topic; and
displaying, by the computer-based system, in response to the conducting and to a user interface, the specific identified topic of the substituted body of text.
2 Assignments
0 Petitions
Accused Products
Abstract
A semantic tagging method may add context to a sentence in order to increase search efficiency. Regardless of an author'"'"'s writing style, translating semantic concepts into tags may increase search efficiency. Automatic semantic tagging of documents may allow semantic search and reasoning. Text for semantic tagging may include an email, a website chat room, an internet forum, or a text message. Additional texts may include aggregating general consensus of an emailed topic across multiple emails, whether in the same email chain or separate emails. To increase search efficiency, the analysis of prior communications within the body of text may comprise analyzing structured contextual information to facilitate with homophora resolution. The structured contextual information may include at least one of a sender email address, one or more recipient email addresses, a subject field, a message date and time stamp, and an attachment title.
-
Citations
20 Claims
-
1. A method comprising:
-
tokenizing, by a computer-based system, a body of text by splitting the body of text into individual tokens; weighting, by the computer-based system and based on the tokenizing, the individual tokens having a pronoun grammatical role based on structured contextual information; analyzing, by the computer-based system, structured contextual information to facilitate a homophora resolution; integrating, by the computer-based system and in response to the analyzing and in response to the weighting of the individual tokens, the homophora resolution into an anaphora resolution algorithm by substituting the structured contextual information into the body of text to create a substituted body of text; translating, by the computer-based system and based on the integrating, semantic concepts of the substituted body of text into one or more semantic tags; analyzing, by the computer-based system and based on semantic reasoning and using the one or more semantic tags, implied relationships of the text within a group of documents to identify a specific topic; and displaying, by the computer-based system, in response to the conducting and to a user interface, the specific identified topic of the substituted body of text. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. An article of manufacture including a non-transitory, tangible computer readable storage medium having instructions stored thereon that, in response to execution by a computer-based system, cause the computer-based system to perform operations comprising:
-
tokenizing, by the computer-based system, a body of text by splitting the body of text into individual tokens; weighting, by the computer-based system and based on the tokenizing, the individual tokens having a pronoun grammatical role based on structured contextual information; analyzing, by the computer-based system, structured contextual information to facilitate a homophora resolution; integrating, by the computer-based system and in response to the analyzing and in response to the weighting of the individual tokens, the homophora resolution into an anaphora resolution algorithm by substituting the structured contextual information into the body of text to create a substituted body of text; translating, by the computer-based system and based on the integrating, semantic concepts of the substituted body of text into one or more semantic tags; analyzing, by the computer-based system and based on semantic reasoning and using the one or more semantic tags, implied relationships of the text within a group of documents to identify a specific topic; and displaying, by the computer-based system, in response to the conducting and to a user interface, the specific identified topic of the substituted body of text.
-
-
20. A system comprising:
-
a tangible, non-transitory memory communicating with a processor, the tangible, non-transitory memory having instructions stored thereon that, in response to execution by the processor, cause the processor to perform operations comprising; tokenizing, by the processor, a body of text by splitting the body of text into individual tokens; weighting, by the processor and based on the tokenizing, the individual tokens having a pronoun grammatical role based on structured contextual information; analyzing, by the processor, structured contextual information to facilitate a homophora resolution; integrating, by the processor and in response to the analyzing and in response to the weighting of the individual tokens, the homophora resolution into an anaphora resolution algorithm by substituting the structured contextual information into the body of text to create a substituted body of text; translating, by the processor and based on the integrating, semantic concepts of the substituted body of text into one or more semantic tags; analyzing, by the processor and based on semantic reasoning and using the one or more semantic tags, implied relationships of the text within a group of documents to identify a specific topic; and displaying, by the processor, in response to the conducting and to a user interface, the specific identified topic of the substituted body of text.
-
Specification