×

Tagging text snippets

  • US 10,331,768 B2
  • Filed: 09/20/2016
  • Issued: 06/25/2019
  • Est. Priority Date: 09/21/2015
  • Status: Active Grant
First Claim
Patent Images

1. A method of tagging a set of text snippets with a tag organized in a taxonomy, the method comprising:

  • receiving, by a processor, the set of text snippets and the tag of a set of tags, wherein the tag of the set of tags comprises a set of words;

    parsing, by the processor, the set of words in the tag into a parse tree with a root node and one or more levels to determine one or more types of phrases belonging to one or more parts of speech;

    determining by the processor,a frequency of one or more words present in the tag of the set of tags, anda headword from the one or more words present in the tag of the set of tags;

    assigning, by the processor, a numeric weight to the one or more words of the tag based on the frequency, parse tree, headword of the set of words associated with the tag, and the parts of speech category of words, wherein the numeric weight indicates relative importance of the one or more words in the tag with respect to other words present in other tags;

    determining, by the processor, correspondences between the one or more words of the tag and words present in the set of text snippets, wherein the determining of the correspondences results in identification of a same word or a similar meaning word present in the tag;

    computing, by the processor, a belief factor for the tag by applying a certainty factor algebra (CFA) based upon the numeric weight of the same word and the similar meaning word; and

    identifying, dynamically a text, from the set of text snippets, for which a feedback is sought from a user, and wherein the text is identified using an active learning approach;

    receiving, dynamically a feedback about the tag assigned to the text snippet from the user, wherein the feedback received is used for historical learning by updating a knowledge base referred for tagging the set of text snippets and future set of text snippets using the set of tags present in the taxonomy; and

    updating the dynamic feedback in the knowledge base using the active learning approach assigned to the set of text snippets, wherein the active learning approach optimizes an amount of feedback received from the user, wherein the knowledge base comprises a word sense-importance database and a lexical resource, and wherein the word sense-importance database discriminates between one or more different meanings of a word and a relative importance of the set of words associated with a tag in context.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×