×

Proofing of word collocation errors based on a comparison with collocations in a corpus

  • US 7,774,193 B2
  • Filed: 12/05/2006
  • Issued: 08/10/2010
  • Est. Priority Date: 12/05/2006
  • Status: Active Grant
First Claim
Patent Images

1. A method, implemented by a computing system comprising one or more processors, the method comprising:

  • comparing, using one or more of the processors, one or more collocations from a text sample with a corpus;

    identifying, using one or more of the processors, whether the collocations are disfavored in the corpus; and

    providing indications of whether the collocations are disfavored via an output device;

    in which comparing the collocations with the corpus comprises performing one or more searches of the World Wide Web using one or more query terms that comprise each of one or more of the collocations; and

    in which for each of one or more of the collocations for which searches are performed, a search is performed for each of the one or more query terms that comprise the collocation until either one of the query terms provides search results that meet a preselected threshold for matching the collocation, or all the query terms that comprise the collocation are used without meeting the preselected threshold, and further comprising;

    composing one or more query terms with a wild card replacing a word in one of the disfavored collocations;

    searching a word collocation reference for the query terms;

    identifying results of the search having a relatively high proportion of a candidate word replacing the wild card; and

    providing the results of the search having the candidate word via the output device as potentially proper word collocations.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×