×

Methods and systems for improving text segmentation

  • US 8,078,633 B2
  • Filed: 03/15/2010
  • Issued: 12/13/2011
  • Est. Priority Date: 09/30/2004
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method, comprising:

  • receiving, at a computer system, a string of characters that includes no word-delineating breaks;

    generating, by the computer system from the string of characters, a plurality of candidate word groups that are portions of the string of characters;

    determining, by the computer system, frequencies with which all or a portion of each of the candidate word groups occur in a corpus; and

    selecting, by the computer system using the determined frequencies, one or more of the candidate word groups for submission to an entity, wherein the one or more candidate word groups are selected based on each of the one or more candidate word groups having a determined frequency that is greater than determined frequencies for at least a threshold number of other candidate word groups.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×