Automated error checking system and method
First Claim
Patent Images
1. A computer implemented error checking database, comprising:
- a data subset derived in part from past user input data, the data subset is captured in a data log that records the past user input data from 1 to N users via at least one of a network and a local data source, N being a natural number;
a subset of expressions automatically filtered from the data subset, the subset of expressions employed as a reference to facilitate error checking in accordance with current user input data;
an automated mining component to determine 1 to M (X,Y) pairs of potential correct and incorrect expressions,wherein X is expected to be an incorrect formulation of Y and M a natural number, the (X,Y) pairs are selected in part based upon a predetermined frequency threshold (FT),FT describes a frequency of occurrence of at least one of the X and Y expressions, the automated mining component creates a right/wrong list denoted as (W,R) to describe hypothetical pairings of expressions such that W describes a potential incorrect expression and R describes a potential correct expression that has been associated with the W expression; and
an error model that is formulated from the right/wrong list of expressions such that the error model learns probabilities for different types of expression errors.
3 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates to a system and methodology to facilitate automated error correction of user input data via an analysis of the input data in accordance with an automatically generated and filtered database of processed structural groupings or formulations selected and filtered from past user activities. The filtered database provides a relevant foundation of potential phrases, topics, symbols, speech and/or colloquial structures of interest to users—which are automatically determined from previous user activity, and employed to facilitate automated error checking in accordance with the user'"'"'s current input, command and/or request for information.
53 Citations
32 Claims
-
1. A computer implemented error checking database, comprising:
-
a data subset derived in part from past user input data, the data subset is captured in a data log that records the past user input data from 1 to N users via at least one of a network and a local data source, N being a natural number; a subset of expressions automatically filtered from the data subset, the subset of expressions employed as a reference to facilitate error checking in accordance with current user input data; an automated mining component to determine 1 to M (X,Y) pairs of potential correct and incorrect expressions, wherein X is expected to be an incorrect formulation of Y and M a natural number, the (X,Y) pairs are selected in part based upon a predetermined frequency threshold (FT), FT describes a frequency of occurrence of at least one of the X and Y expressions, the automated mining component creates a right/wrong list denoted as (W,R) to describe hypothetical pairings of expressions such that W describes a potential incorrect expression and R describes a potential correct expression that has been associated with the W expression; and an error model that is formulated from the right/wrong list of expressions such that the error model learns probabilities for different types of expression errors. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computer implemented automated error checking method, comprising:
-
modeling past user input data relating to at least one query;
automatically creating an error checking database by filtering the past user input data in accordance with the modeling;automatically mining 1 to M (X,Y) pairs of potential correct and incorrect expressions, wherein X is expected to be an incorrect formulation of Y and M is a natural number, the (X,Y) pairs are selected in part based upon a predetermined frequency threshold (FT), FT describes a frequency of occurrence of at least one of the X and Y expressions; creating a right/wrong list denoted as (W,R) to describe hypothetical pairings of expressions such that W describes a potential incorrect expression and R describes a potential correct expression that has been associated with the W expression; and formulating from the right/wrong list of expressions such that probabilities for different types of expression errors can be learned. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A computer implemented automated error checking system, comprising:
-
means for at least one of accessing and processing past user input data; means for modeling the past user input data; means for determining current user input data; means for providing at least one error suggestion in response to the current user input data based at least in part on the modeling of the past user input data; means for automatically mining 1 to M (X,Y) pairs of potential correct and incorrect expressions, wherein X is expected to be an incorrect formulation of Y and M is a natural number, the (X,Y) pairs are selected in part based upon a predetermined frequency threshold (FT), FT describes a frequency of occurrence of at least one of the X and Y expressions; means for creating a right/wrong list denoted as (W,R) to describe hypothetical pairings of expressions such that W describes a potential incorrect expression and R describes a potential correct expression that has been associated with the W expression; and means for formulating an error model from the right/wrong list of expressions such that the error model learns probabilities for different types of expression errors.
-
-
26. A computer-readable medium having stored thereon a data structure, comprising:
-
a first data field containing application data associated with a user interface; a second data field containing error correction data derived from an error model that is trained in accordance with previous user inputs; a third data field containing an automated mining component to determine 1 to M (X,Y) pairs of potential correct and incorrect expressions, wherein X is expected to be an incorrect formulation of Y and M is a natural number, the (X,Y) pairs are selected in part based upon a predetermined frequency threshold (FT), FT describes a frequency of occurrence of at least one of the X and Y expressions, the automated mining component creates a right/wrong list denoted as (W,R) to describe hypothetical pairings of expressions such that W describes a potential incorrect expression and R describes a potential correct expression that has been associated with the W expression; and a fourth data field containing an error model that is formulated from the right/wrong list of expressions such that the error model learns probabilities for different types of expression errors. - View Dependent Claims (27, 28, 29)
-
-
30. A computer implemented method to automatically create a dictionary, comprising:
-
creating a dictionary having a structure wherein at least one correct string is marked as correct and at least one incorrect string is marked as incorrect, the correct and incorrect strings employable in a chain; determining a subsequent string that points to at least one of another incorrect string in the chain and to another correct string at an end of the chain to facilitate error checking in accordance with the dictionary; and determining that a string x is an error of string y and string y is an error of string z and if string z is correct, then performing at least one of the following; pruning determined string x and determined string y from the dictionary; maintaining pruned string x and pruned string y in the dictionary, but marking the pruned strings as errors; maintaining pruned string x and pruned string y in the dictionary; causing both maintained string x and maintained string y to point to determined string z; and maintaining pruned string x and pruned string y in the dictionary and having pruned string x point to pruned string y and pruned string y point to determined string z. - View Dependent Claims (31, 32)
-
Specification