Common phrase identification and language dictation recognition systems and methods for using the same
First Claim
1. A computerized method for analyzing verbal records to improve a textual transcript, the method comprising the steps of:
- identifying a training set and a test set of transcribed verbal records, the training set comprising a first subset of a plurality of transcribed verbal records, and the test set comprising a different second subset of the plurality of transcribed verbal records;
for the each transcribed verbal record in the training set, determine a plurality of possible common phrases comprising a plurality of sequences of words appearing in the each verbal record in the training set, the each of the plurality of possible common phrases further having a minimum word length;
for each of the plurality of possible common phrases, determine a best parameter;
for each of the plurality of possible common phrases, finding a phrase accuracy based at least in part on a test for false positives;
saving the best parameter for the each of the plurality of possible common phrases; and
,applying the each of the plurality of possible common phrases to the transcribed verbal records, using the phrase accuracy, to create the textual transcript.
0 Assignments
0 Petitions
Accused Products
Abstract
In at least one exemplary embodiment for common phrase identification and language dictation recognition systems and methods for using the same, the system comprises a database capable of receiving a plurality of verbal records, the verbal record comprising at least one identifier and at least one verbal feature and a processor operably coupled to the database, where the processor has and executes a software program. The processor being operational to identify a subset of the plurality of verbal records from the database, extract at least one verbal feature from the identified records, analyze the at least one verbal feature of the subset of the plurality of verbal records, process the subset of the plurality of records using the analyzed feature according to at least one reasoning approach, generate a processed verbal record using the processed subset of the plurality of records, and deliver the processed verbal record to a recipient. The processor being further operational to identify common phrases in parts of the verbal record, identifying a body of work for building a set of common phrases, analyze documents in a training set to find some common phrases, and replacing phrases with the common phrases.
15 Citations
23 Claims
-
1. A computerized method for analyzing verbal records to improve a textual transcript, the method comprising the steps of:
-
identifying a training set and a test set of transcribed verbal records, the training set comprising a first subset of a plurality of transcribed verbal records, and the test set comprising a different second subset of the plurality of transcribed verbal records; for the each transcribed verbal record in the training set, determine a plurality of possible common phrases comprising a plurality of sequences of words appearing in the each verbal record in the training set, the each of the plurality of possible common phrases further having a minimum word length; for each of the plurality of possible common phrases, determine a best parameter; for each of the plurality of possible common phrases, finding a phrase accuracy based at least in part on a test for false positives; saving the best parameter for the each of the plurality of possible common phrases; and
,applying the each of the plurality of possible common phrases to the transcribed verbal records, using the phrase accuracy, to create the textual transcript. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system for analyzing verbal records to improve a textual transcript, the system comprising:
-
a database configured to receive a plurality of transcribed verbal records; a processor operably connected to the database, and configured to; identify a training set and a test set of the transcribed verbal records, the training set comprising a first subset of the plurality of transcribed verbal records, and the test set comprising a different second subset of the plurality of transcribed verbal records; determine a plurality of possible common phrases for the each verbal record in the training set, the plurality of possible common phrases comprising a plurality of sequences of words appearing in the each verbal record in the training set, the each of the plurality of possible common phrases further having a minimum word length; determine a best parameter for each of the plurality of possible common phrases; determine a phrase accuracy based at least in part on a test for false positives; save the best parameter for the each of the plurality of possible common phrases; and
,apply the each of the plurality of possible common phrases to the transcribed verbal records, using the phrase accuracy to create the textual transcript. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
Specification