Method and system for mitigating delay in receiving audio stream during production of sound from audio stream
First Claim
1. An apparatus comprising:
- transceiver circuitry configured to receive an audio stream, the audio stream including an audio waveform;
a memory configured to store the received audio stream;
audio production circuitry configured to produce sound using the audio waveform;
processing circuitry configured to;
analyze the received audio stream and identify a modification segment of the audio waveform, the modification segment being a segment of the audio waveform where production of the audio waveform may be modified to mitigate a delay in receiving the audio stream by temporally extending the modification segment without substantially affecting clarity of the produced sound, anddrive production of sound using the audio waveform based at least in part on the modification segment that was identified;
wherein the audio stream includes metadata associated with the audio waveform that indicates a position of a specific type of sound included in the audio waveform, and the processing circuitry is configured to analyze the associated metadata to identify the modification segment having the position within the specific type of sound; and
wherein the specific type of sound is phonemes having natural pauses, phonemes having voiceless glottal plosives, phonemes related to vowels, phonemes related to fricatives, quasi-stationary audio waveform segments of phonemes, middle audio waveform segments of phonemes, lip positions having natural pauses, or lip positions having voiceless glottal plosives.
1 Assignment
0 Petitions
Accused Products
Abstract
A communication component modifies production of an audio waveform at determined modification segments to thereby mitigate the effects of a delay in processing and/or receiving a subsequent audio waveform. The audio waveform and/or data associated with the audio waveform are analyzed to identify the modification segments based on characteristics of the audio waveform and/or data associated therewith. The modification segments show where the production of the audio waveform may be modified without substantially affecting the clarity of the sound or audio. In one embodiment, the invention modifies the sound production at the identified modification segments to extend production time and thereby mitigate the effects of delay in receiving and/or processing a subsequent audio waveform for production.
-
Citations
24 Claims
-
1. An apparatus comprising:
-
transceiver circuitry configured to receive an audio stream, the audio stream including an audio waveform; a memory configured to store the received audio stream; audio production circuitry configured to produce sound using the audio waveform; processing circuitry configured to; analyze the received audio stream and identify a modification segment of the audio waveform, the modification segment being a segment of the audio waveform where production of the audio waveform may be modified to mitigate a delay in receiving the audio stream by temporally extending the modification segment without substantially affecting clarity of the produced sound, and drive production of sound using the audio waveform based at least in part on the modification segment that was identified; wherein the audio stream includes metadata associated with the audio waveform that indicates a position of a specific type of sound included in the audio waveform, and the processing circuitry is configured to analyze the associated metadata to identify the modification segment having the position within the specific type of sound; and wherein the specific type of sound is phonemes having natural pauses, phonemes having voiceless glottal plosives, phonemes related to vowels, phonemes related to fricatives, quasi-stationary audio waveform segments of phonemes, middle audio waveform segments of phonemes, lip positions having natural pauses, or lip positions having voiceless glottal plosives. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system comprising:
-
a transmitting device for transmitting an audio stream including an audio waveform; a receiving device for receiving the audio stream including audio production circuitry configured to produce sound using the audio waveform of the audio stream; processing circuitry of the transmitting device configured to analyze the audio stream and identify a modification segment of the audio waveform, the modification segment being a segment of the audio waveform where production of the audio waveform may be modified to mitigate a delay when the receiving device receives the audio stream by temporally extending the modification segment without substantially affecting clarify of the produced sound; and processing circuitry of the receiving device configured for driving the production of sound using the audio waveform based at least in part on the modification segment that was identified; wherein the audio stream includes metadata associated with the audio waveform that indicates a position of a specific type of sound included in the audio waveform; wherein the processing circuitry of the transmitting device is configured to analyze the associated metadata and identify modification segment having the position within the specific type of sound; and wherein the specific type of sound is phonemes having natural pauses, phonemes having voiceless glottal plosives, phonemes related to vowels, phonemes related to fricatives, quasi-stationary audio waveform segments of phonemes, middle audio waveform segments of phonemes, lip positions having natural pauses, or lip positions having voiceless glottal plosives. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A method of producing sound from an audio waveform, the audio waveform being included in a received audio stream, the method comprising:
-
analyzing the audio stream to identify a modification segment of the audio waveform, the modification segment being a segment of the audio waveform where production of the audio waveform may be modified to mitigate a delay in receiving the received the audio stream by temporally extending the modification segment without substantially affecting clarity of the produced sound; producing sound using the audio waveform based at least in part on the modification segment that was identified; wherein the audio stream includes metadata associated with the audio waveform that indicates a position of a specific type of sound included in the audio waveform; analyzing the associated metadata; and identifying the modification segment having the position within the specific type of sound, the specific type of sound being phonemes having natural pauses, phonemes having voiceless glottal plosives, phonemes related to vowels, phonemes related to fricatives, quasi-stationary audio waveform segments of phonemes, middle audio waveform segments of phonemes, lip positions having natural pauses, or lip positions having voiceless glottal plosives. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
Specification