×

Text sample entry group formulation

  • US 9,535,983 B2
  • Filed: 10/29/2013
  • Issued: 01/03/2017
  • Est. Priority Date: 10/29/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • an act of accessing a set of text samples, each having a corresponding text sample identifier;

    for each of at least some of the set of text samples, an act of preparing the text sample, the act of preparing the text sample comprising;

    an act of parsing a plurality of text components from the text sample; and

    for each of at least some of the parsed plurality of text components, an act of identifying the text component, the act of identifying the text component comprising;

    an act of determining if the text component is already correlated to a text component identifier, the text component identifier representing the content while being distinguished from the content;

    if the text component is already correlated to a text component identifier, assigning the text component identifier to the text component and such that when two text components are the same then the two text components will be assigned a same text component identifier;

    if the text component is not already correlated to a text component identifier, assigning a new text component identifier to the text component; and

    an act of creating a text component entry comprising a) the text sampleidentifier for the text sample from which the text component was parsed, and b) the assigned text component identifier;

    an act of creating a text sample entry group comprising a plurality of text component entries corresponding to text components parsed from the text sample, and such that the plurality of text component entries are sorted by sequence of the corresponding text component within the text sample; and

    an act of storing a plurality of text sample entry groups created by performance of the act of preparing the text sample for each of the at least some of the set of text samples, wherein the pluarity of text samples entries are stored in a text component entry table that includes a duplicate set of text component entries having a same text sample identifier and component identifier pairing.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×