Apparatus and method for providing voice recognition for multiple speakers

US 20040186712A1
Filed: 03/18/2003
Published: 09/23/2004
Est. Priority Date: 03/18/2003
Status: Active Grant

First Claim

Patent Images

1. A method for converting all audio information from a plurality of participants to combined textual information, comprising the steps of:

performing independent voice recognition operations on audio information from each of the plurality of participants thereby creating a stream of textual information for each of the plurality of participants;

inserting time stamps into each stream of textual information; and

combining the streams of textual information and time stamps into combined textual information on the basis of the time stamps in each of the plurality of streams of textual information whereby the combined textual information is textual information of the all audio information.

View all claims

27 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Utilizing individual voice recognition units for each speaker in a conference to perform automatic transcription of that speaker'"'"'s contribution to the conference. The output of each of the voice recognition units is then merged on a time basis to produce a textual transcription of the entire telecommunication conference call.

Citations

19 Claims

1. A method for converting all audio information from a plurality of participants to combined textual information, comprising the steps of:
- performing independent voice recognition operations on audio information from each of the plurality of participants thereby creating a stream of textual information for each of the plurality of participants;
  
  inserting time stamps into each stream of textual information; and
  
  combining the streams of textual information and time stamps into combined textual information on the basis of the time stamps in each of the plurality of streams of textual information whereby the combined textual information is textual information of the all audio information.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
- - 2. The method of claim 1 wherein the step of performing is executed by a separate voice recognition unit for each of the plurality of participants.
  - 3. The method of claim 2 comprises the step of training each voice recognition unit for its participant.
  - 4. The method of claim 1 wherein the step of performing is executed by one processor.
  - 5. The method of claim 4 comprises the step of training the processor for each participant.
  - 6. The method of claim 1 wherein the step of inserting comprises the step of placing a time stamp with each word of each stream of textual information.
  - 7. The method of claim 6 wherein the step of combining comprises the step of sorting the words from each of the streams into the combined information on the basis of the time stamps.
  - 8. The method of claim 1 further comprises storing the audio information with time stamps.

9. A processor-readable medium comprising processor-executable instructions configured for:
- performing independent voice recognition operations on audio information from each of the plurality of participants thereby creating a stream of textual information for each of the plurality of participants;
  
  inserting time stamps into each stream of textual information; and
  
  combining the streams of textual information and time stamps into combined textual information on the basis of the time stamps in each of the plurality of streams of textual information whereby the combined textual information is textual information of the audio.
- View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
- - 10. The processor-readable medium of claim 9 wherein the performing is executed by a separate voice recognition unit for each of the plurality of participants.
  - 11. The processor-readable medium of claim 10 comprises the training each voice recognition unit for its participant.
  - 12. The processor-readable medium of claim 9 wherein the performing is executed by one processor.
  - 13. The processor-readable medium of claim 12 comprises the training the processor for each participant.
  - 14. The processor-readable medium of claim 9 wherein the inserting comprises the placing a time stamp with each word of each stream of textual information.
  - 15. The processor-readable medium of claim 14 wherein the combining comprises the sorting the words from each of the streams into the combined information on the basis of the time stamps.
  - 16. The processor-readable medium of claim 9 further comprises storing the audio information with time stamps.

17. An apparatus for converting all audio information from a plurality of participants to combined information, comprising:
- a plurality of voice recognition units each assigned to one of the plurality of participants whereby audio information from each of the assigned participant is processed to produce a textual stream with time stamps; and
  
  a controller response to the textual streams with time stamps for combining the streams into combined textual information.
- View Dependent Claims (18, 19)
- - 18. The apparatus of claim 17 wherein each voice recognition units is implemented by an individual processor.
  - 19. The apparatus of claim 17 wherein the plurality of voice recognition units are implemented by one processor.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Arlington Technologies, LLC (Dominion Harbor Enterprises, LLC)
Original Assignee
Avaya Incorporated
Inventors
Gentle, Christopher Reon, Goringe, Christopher Michael, Orbach, Julian James, Harrison, Rodney, Coles, Scott David

Granted Patent

US 7,844,454 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/235
CPC Class Codes

G10L 15/26 Speech to text systems G10L...

Apparatus and method for providing voice recognition for multiple speakers

First Claim

27 Assignments

0 Petitions

Accused Products

Abstract

Citations

19 Claims

Specification

Solutions

Use Cases

Quick Links

Apparatus and method for providing voice recognition for multiple speakers

First Claim

27 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

19 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links