Automated voice and speech labeling
First Claim
Patent Images
1. A method for converting speech to text, comprising the steps of:
- receiving a digital signal comprising a recorded spoken input;
obtaining at least one measurement of said digital signal, the measurement comprising a first measured portion of said recorded spoken input and a second measured portion of said recorded spoken input;
identifying at least one characteristic of said digital signal by comparing said first measured portion of said recorded spoken input to a first database of digital audio signal characteristics;
transcribing said first measured portion of said recorded spoken input using said at least one characteristic of said digital signal to create an initial transcription;
backfilling said first database of digital audio signal characteristics with at least one characteristic from a second database of digital audio signal characteristics;
identifying a second characteristic of said digital signal by comparing said second measured portion of said digital signal to said backfilled first database of digital audio signal characteristics;
transcribing said second measured portion of said recorded spoken input using said second characteristic of said digital signal.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for voice and speech analysis which correlates a speaker signal source and a normalized signal comprising measurements of input acoustic data to a database of language, dialect, accent, and/or speaker attributes in order to create a transcription of the input acoustic data.
-
Citations
20 Claims
-
1. A method for converting speech to text, comprising the steps of:
-
receiving a digital signal comprising a recorded spoken input; obtaining at least one measurement of said digital signal, the measurement comprising a first measured portion of said recorded spoken input and a second measured portion of said recorded spoken input; identifying at least one characteristic of said digital signal by comparing said first measured portion of said recorded spoken input to a first database of digital audio signal characteristics; transcribing said first measured portion of said recorded spoken input using said at least one characteristic of said digital signal to create an initial transcription; backfilling said first database of digital audio signal characteristics with at least one characteristic from a second database of digital audio signal characteristics; identifying a second characteristic of said digital signal by comparing said second measured portion of said digital signal to said backfilled first database of digital audio signal characteristics; transcribing said second measured portion of said recorded spoken input using said second characteristic of said digital signal. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 19)
-
-
9. A system for converting speech to text, the system comprising:
-
a digital audio signal comprising an encoding of a recorded spoken input; means for obtaining at least one measurement of said digital audio signal, the measurement comprising a first measured portion of said recorded spoken input and a second measured portion of said recorded spoken input; means for comparing said first measured portion of said recorded spoken input to a first database of digital audio signal characteristics; means for identifying at least one characteristic of said digital audio signal based on said comparison; means for transcribing said first measured portion of said spoken input using said at least one characteristic of the digital audio signal to create an initial transcription; means for backfilling said first database of digital audio signal characteristics with at least one characteristic from a second database of digital audio signal characteristics; means for identifying a second characteristic of said digital signal by comparing said second measured portion of said digital signal to said backfilled first database of digital audio signal characteristics; and means for transcribing said second measured portion of said recorded spoken input using said second characteristic of said digital signal. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18, 20)
-
Specification