SPEECH COMMUNICATION SYSTEM AND METHOD, AND ROBOT APPARATUS

US 20120232891A1
Filed: 05/16/2012
Published: 09/13/2012
Est. Priority Date: 07/03/2003
Status: Active Grant

First Claim

Patent Images

1. A speech communication system enabling a conversation with a conversation partner, said system comprising:

a generation unit configured to generate a plurality of auditory communications according to a predetermined rule;

a speech recognition unit, in an apparatus, configured to recognize a speech content of the conversation partner;

an estimation control unit configured to estimate intentions of the conversation partner from the speech content recognized by the speech recognition unit;

a conversation control unit configured to dynamically select one of the plurality of auditory communications based on the estimation by the estimation control unit;

an audio output unit configured to output the one of the plurality of auditory communications selected by the conversation control unit;

an image recognition unit, in the apparatus, configured to recognize a face of the conversation partner;

a touch sensing unit, in the apparatus, configured to recognize a touch input by the conversation partner;

a tracking control unit configured to determine whether or not to continue the conversation based on a recognition result from the image recognition unit or the touch sensing unit; and

a network interface configured to communicate with an external network.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

This invention realizes a speech communication system and method, and a robot apparatus capable of significantly improving entertainment property. A speech communication system with a function to make conversation with a conversation partner is provided with a speech recognition means for recognizing speech of the conversation partner, a conversation control means for controlling conversation with the conversation partner based on the recognition result of the speech recognition means, an image recognition means for recognizing the face of the conversation partner, and a tracking control means for tracing the existence of the conversation partner based on one or both of the recognition result of the image recognition means and the recognition result of the speech recognition means. The conversation control means controls conversation so as to continue depending on tracking of the tracking control means.

28 Citations

View as Search Results

20 Claims

1. A speech communication system enabling a conversation with a conversation partner, said system comprising:
- a generation unit configured to generate a plurality of auditory communications according to a predetermined rule;
  
  a speech recognition unit, in an apparatus, configured to recognize a speech content of the conversation partner;
  
  an estimation control unit configured to estimate intentions of the conversation partner from the speech content recognized by the speech recognition unit;
  
  a conversation control unit configured to dynamically select one of the plurality of auditory communications based on the estimation by the estimation control unit;
  
  an audio output unit configured to output the one of the plurality of auditory communications selected by the conversation control unit;
  
  an image recognition unit, in the apparatus, configured to recognize a face of the conversation partner;
  
  a touch sensing unit, in the apparatus, configured to recognize a touch input by the conversation partner;
  
  a tracking control unit configured to determine whether or not to continue the conversation based on a recognition result from the image recognition unit or the touch sensing unit; and
  
  a network interface configured to communicate with an external network.
- View Dependent Claims (2, 3, 4, 5)
- - 2. The speech communication system according to claim 1,wherein at least one of the estimation control unit and the conversation control unit are performed remotely from the apparatus and transmitted to the apparatus via the external network.
  - 3. The speech communication system according to claim 1,wherein the estimation control unit is further configured to estimate intentions of the conversation partner from the speech content recognized by the speech recognition unit and one or more previous interactions with the conversation partner.
  - 4. The speech communication system according to claim 1, wherein the conversation control unit is further configured to dynamically select one of the plurality of auditory communications based on the estimation by the estimation control unit and a selection algorithm.
  - 5. The speech communication system according to claim 4, wherein the selection algorithm is up dateable.

6. A speech communication apparatus enabling a conversation with a conversation partner, comprising:
- a speech recognition unit configured to recognize a speech content of the conversation partner;
  
  an audio output unit configured to output auditory communications;
  
  an image recognition unit configured to recognize a face of the conversation partner;
  
  a touch sensing unit configured to recognize a touch input by the conversation partner;
  
  a tracking control unit configured to determine whether or not to continue the conversation based on a recognition result from the image recognition unit or the touch sensing unit; and
  
  a network interface configured to communicate with an external network.
- View Dependent Claims (7, 8, 9, 10, 11)
- - 7. The speech communication apparatus according to claim 6,wherein the audio output unit is further configured to output one of a plurality of auditory communications selected by a conversation control unit,wherein the conversation control unit is configured to dynamically select one of a plurality of auditory communications, generated by a generation unit, based on an estimation by an estimation control unit,wherein the estimation control unit is configured to estimate intentions of the conversation partner from the speech content recognized by the speech recognition unit, andwherein the generation unit is configured to generate the plurality of auditory communications according to a predetermined rule.
  - 8. The speech communication apparatus according to claim 7,wherein at least one of the estimation control unit and the conversation control unit are performed remotely from the apparatus and transmitted to the apparatus via the external network.
  - 9. The speech communication apparatus according to claim 7,wherein the estimation control unit is further configured to estimate intentions of the conversation partner from the speech content recognized by the speech recognition unit and one or more previous interactions with the conversation partner.
  - 10. The speech communication apparatus according to claim 7, wherein the conversation control unit is further configured to dynamically select one of the plurality of auditory communications based on the estimation by the estimation control unit and a selection algorithm.
  - 11. The speech communication apparatus according to claim 10, wherein the selection algorithm is updateable.

12. A speech communication method enabling a conversation with a conversation partner, said method comprising:
- generating a plurality of auditory communications according to a predetermined rule;
  
  recognizing, using a speech recognition unit in an apparatus, a speech content of the conversation partner;
  
  estimating intentions of the conversation partner from the speech content recognized by the recognizing of the speech recognition unit;
  
  dynamically selecting one of the plurality of auditory communications based on the estimation by the estimating;
  
  outputting the one of the plurality of auditory communications selected by the dynamically selecting;
  
  recognizing, using an image recognition unit in the apparatus, a face of the conversation partner;
  
  recognizing, using a touch sensing unit in the apparatus, a touch input by the conversation partner; and
  
  determining whether or not to continue the conversation based on a recognition result from recognizing by the image recognition unit or the touch sensing unit.
- View Dependent Claims (13, 14, 15, 16)
- - 13. The speech communication method according to claim 12,wherein at least one of the estimating and the dynamically selecting are performed remotely from the apparatus and transmitted to the apparatus via an external network.
  - 14. The speech communication method according to claim 12,wherein the estimating further comprises estimating intentions of the conversation partner from the speech content recognized by the speech recognition unit and one or more previous interactions with the conversation partner.
  - 15. The speech communication method according to claim 12, wherein the dynamically selecting further comprises dynamically selecting one of the plurality of auditory communications based on the estimation by the estimation control unit and a selection algorithm.
  - 16. The speech communication system according to claim 15, wherein the selection algorithm is updateable.

17. A non-transitory computer readable medium having stored thereon a program that when executed by a computing device causes the computing device to implement a speech communication method enabling a conversation with a conversation partner, said method comprising:
- generating a plurality of auditory communications according to a predetermined rule;
  
  recognizing, using a speech recognition unit in an apparatus, a speech content of the conversation partner;
  
  estimating intentions of the conversation partner from the speech content recognized by the recognizing by the speech recognition unit;
  
  dynamically selecting one of the plurality of auditory communications based on the estimation by the estimating;
  
  outputting the one of the plurality of auditory communications selected by the dynamically selecting;
  
  recognizing, using an image recognition unit in the apparatus, a face of the conversation partner;
  
  recognizing, using a touch sensing unit in the apparatus, a touch input by the conversation partner; and
  
  determining whether or not to continue the conversation based on a recognition result from recognizing by the image recognition unit or the touch sensing unit.
- View Dependent Claims (18, 19, 20)
- - 18. The non-transitory computer readable medium according to claim 17,wherein at least one of the estimating and the dynamically selecting are performed remotely from the apparatus and transmitted to the apparatus via an external network.
  - 19. The non-transitory computer readable medium according to claim 17,wherein the estimating further comprises estimating intentions of the conversation partner from the speech content recognized by the speech recognition unit and one or more previous interactions with the conversation partner.
  - 20. The non-transitory computer readable medium according to claim 17, wherein the dynamically selecting further comprises dynamically selecting one of the plurality of auditory communications based on the estimation by the estimation control unit and an updatable selection algorithm.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Sony Corporation (Sony Group Corp.)
Original Assignee
Sony Corporation (Sony Group Corp.)
Inventors
Aoyama, Kazumi, Shimomura, Hideki

Granted Patent

US 8,321,221 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/231
CPC Class Codes

G10L 15/22 Procedures used during a sp...

SPEECH COMMUNICATION SYSTEM AND METHOD, AND ROBOT APPARATUS

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

28 Citations

20 Claims

Specification

Solutions

Use Cases

Quick Links

SPEECH COMMUNICATION SYSTEM AND METHOD, AND ROBOT APPARATUS

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

28 Citations

20 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links