Spell checker with arbitrary length string-to-string transformations to improve noisy channel spelling correction
First Claim
Patent Images
1. A method comprising:
- receiving an entered string; and
determining how likely a word was to have been entered as the string based on partitioning the word w and the string s and computing probabilities for various partitionings to determine a highest likelihood of at least one edit operation that converts a first character sequence of arbitrary length in the word to a second character sequence of arbitrary length in the string.
1 Assignment
0 Petitions
Accused Products
Abstract
A spell checker based on the noisy channel model has a source model and an error model. The source model determines how likely a word w in a dictionary is to have been generated. The error model determines how likely the word w was to have been incorrectly entered as the string s (e.g., mistyped or incorrectly interpreted by a speech recognition system) according to the probabilities of string-to-string edits. The string-to-string edits allow conversion of one arbitrary length character sequence to another arbitrary length character sequence.
51 Citations
10 Claims
-
1. A method comprising:
-
receiving an entered string; and
determining how likely a word was to have been entered as the string based on partitioning the word w and the string s and computing probabilities for various partitionings to determine a highest likelihood of at least one edit operation that converts a first character sequence of arbitrary length in the word to a second character sequence of arbitrary length in the string. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A program embodied on a computer readable medium, which when executed, directs a computer to perform spell checking, including:
-
a source model component of the program to determine how likely a word w in a dictionary is to have been generated; and
an error model component of the program to determine how likely the word w was to have been incorrectly entered as the string s based on arbitrary length string-to-string transformations, wherein the error model partitions the word w and the string s and computes probabilities for various partitionings. - View Dependent Claims (8, 9, 10)
-
Specification