Voice enabled knowledge system
First Claim
1. A system for converting speech to text comprising:
- a speech recognition engine for understanding the spoken words of a user, further comprising;
a representation unit to represent the spoken words;
a model classification unit to classify the spoken words;
a training database to match the spoken words with preset words, and a search unit to search for the spoken word in said training database, based on the results of said model classification.
0 Assignments
0 Petitions
Accused Products
Abstract
This invention discloses a voice enabled knowledge system, comprising a speech recognition engine and text to speech engine. The speech recognition engine further comprises a representation unit to represent the spoken words, a model classification unit to classify the spoken words, a training database to match the spoken words with preset words and a search unit to search for the spoken word in said training database, based on the results of said model classification. The text to speech engine for conversion of an input text to speech, comprises a text pre-processing unit for analyzing the input text in a sentence form, a prosody unit for word recognition using said acoustic model, a concatenation unit for converting the diphone equivalents into words and thereafter into a sentence and an audio output device for speech output.
-
Citations
15 Claims
-
1. A system for converting speech to text comprising:
a speech recognition engine for understanding the spoken words of a user, further comprising;
a representation unit to represent the spoken words;
a model classification unit to classify the spoken words;
a training database to match the spoken words with preset words, and a search unit to search for the spoken word in said training database, based on the results of said model classification.
-
2. A system for converting text to speech comprising:
a text to speech engine for understanding the spoken words of a user, further comprising;
a text pre-processing unit for analyzing the input text in a sentence form;
a prosody unit for word recognition using said acoustic model;
a concatenation unit for converting the diphone equivalents into words and thereafter into a sentence; and
an audio output device for speech output.
-
3. A voice enabled knowledge system, comprising:
-
a speech recognition engine for understanding the spoken words of a user, further comprising;
a representation unit to represent the spoken words;
a model classification unit to classify the spoken words;
a training database to match the spoken words with preset words, a search unit to search for the spoken word in said training database, based on the results of said model classification; and
a text to speech engine for conversion of an input text to speech, further comprising;
a text pre-processing unit for analyzing the input text in a sentence form;
a prosody unit for word recognition using said acoustic model;
a concatenation unit for converting the diphone equivalents into words and thereafter into a sentence; and
an audio output device for speech output. - View Dependent Claims (4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
Specification