×

Methods and system for fast, adaptive correction of misspells

  • US 10,372,814 B2
  • Filed: 10/18/2016
  • Issued: 08/06/2019
  • Est. Priority Date: 10/18/2016
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for adaptive correction of misspelling, the method comprising:

  • pre-training, by a processor, a pre-trained word vector;

    receiving, at the processor from a user device connected to the processor, a text for spelling analysis;

    creating, by the processor, a table of entries for each correctly spelled word in a corpus, wherein each table of entries includes alternative words of a particular correctly spelled word, each alternative word having one or more characters less than the particular correctly spelled word, the number of occurrences of each alternative word in the corpus, and links of the alternative words to the particular correctly spelled word;

    comparing, by the processor, a particular misspelled word in the text to the table of entries having an edit distance from the particular misspelled word and a minimum frequency of occurrence in the corpus to form a candidate set of entries;

    mapping, by the processor, each word in the text to the pre-trained word vector;

    obtaining, by the processor, a first vector representing a left context of the particular misspelled word and a second vector representing a right context of the particular misspelled word using a recurrent neural network (RNN);

    inputting, by the processor, the first vector and the second vector to a fully connected layer through the RNN, and inputting, by the processor, a third vector representing the particular misspelled word directly to the fully connected layer;

    replacing, by the processor, the particular misspelled word with each candidate in the candidate set of entries;

    outputting, by the processor, a context sensitive score from a logistic unit for each candidate, wherein the logistic unit is connected to the fully connected layer;

    ranking, by the processor, the candidate set of entries utilizing the context sensitive score so that each candidate has a ranking;

    ordering, by the processor, at least some of the candidates based on the ranking to identify corrections to the particular misspelled word; and

    displaying, to a user, the corrections to the particular misspelled word.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×