System and method for verifying origin of input through spoken language analysis
First Claim
Patent Images
1. A method of controlling access to a computing system comprising:
- selecting first test text data to be articulated as a first speech utterance by a first entity providing input to the computing system based on measurable differences in articulation between a human speaker and a machine for the first test text data exceeding a target threshold;
storing a voice print for the first entity at the computing system based on said first speech utterance being converted into recognized speech data;
wherein said first entity can include either a human or a computer using a synthesized voice;
receiving a second speech utterance by a second entity;
processing said second recognized speech data with said computing system to determine whether said second speech utterance also originated from said first entity; and
controlling whether said second entity is allowed to access an account and/or data based on comparing said voice print to said second recognized speech data.
2 Assignments
0 Petitions
Accused Products
Abstract
An audible based electronic challenge system is used to control access to a computing resource by using a test to identify an origin of a voice. The test is based on analyzing a spoken utterance to determine if it was articulated by an unauthorized human or a text to speech (TTS) system.
149 Citations
19 Claims
-
1. A method of controlling access to a computing system comprising:
-
selecting first test text data to be articulated as a first speech utterance by a first entity providing input to the computing system based on measurable differences in articulation between a human speaker and a machine for the first test text data exceeding a target threshold; storing a voice print for the first entity at the computing system based on said first speech utterance being converted into recognized speech data; wherein said first entity can include either a human or a computer using a synthesized voice; receiving a second speech utterance by a second entity; processing said second recognized speech data with said computing system to determine whether said second speech utterance also originated from said first entity; and controlling whether said second entity is allowed to access an account and/or data based on comparing said voice print to said second recognized speech data. - View Dependent Claims (2)
-
-
3. A method of identifying a source of data input to a computing system using prosodic elements of speech comprising:
-
a) presenting a challenge item to an entity, which challenge item is associated with a reference set of words and associated reference prosodic scores; b) receiving speech utterance from an entity related to said challenge item including an input set of words; c) processing said speech utterance with said computing system to compute input prosodic scores of said input set of words; d) comparing said input prosodic scores and said reference prosodic scores; and e) generating a determination of whether said speech utterance originated from a machine or a human based on step (d); wherein said challenge item is supplemented with visual cues, said visual cues being adapted to induce said reference prosodic scores. - View Dependent Claims (4, 5)
-
-
6. A method of identifying a source of data input to a computing system using prosodic elements of speech comprising:
-
a) presenting a challenge item to an entity, which challenge item is associated with a reference set of words and associated prosodic characteristics; b) receiving speech utterance from an entity related to said challenge item; wherein said reference set of words represents a selected set of one or more contiguous words which when vocalized have a measurable difference in prosodic characteristics between a reference human voice and a reference computer synthesized voice that exceeds a target threshold; c) processing said speech utterance with said computing system to compute first prosodic characteristics of said entity; d) generating a determination of whether said speech utterance originated from a machine or a human based on step (c). - View Dependent Claims (7, 8, 9, 10)
-
-
11. A method of implementing a CAPTCHA (Completely Automatic Public Turing Test To Tell Humans And Computers Apart) to identify a source of data input to a computing system comprising:
-
a) using a set of human test subjects to identify whether a reference challenge item was vocalized by a human or a computer; b) training the computing system with samples of human voices and computer synthesized voices articulating a set of the reference challenge items; c) receiving a speech utterance from an entity related to one of said set of reference challenge items; d) determining with the trained computer system whether said speech utterance was vocalized by a machine or a human. - View Dependent Claims (12)
-
-
13. A method of implementing a CAPTCHA (Completely Automatic Public Turing Test To Tell Humans And Computers Apart) to identify a source of data input to a computing system comprising:
-
a) training the computing system with samples of human voices articulating a set of reference challenge items; b) receiving a speech utterance from an entity related to one of said set of reference challenge items; c) determining with the trained computer system whether said speech utterance was vocalized by a machine or a human; wherein said computing system uses one or more speech models that are optimized for identifying humans using said set of reference challenge items; and wherein said set of reference challenge items represent a selected set of one more contiguous words which when articulated have a difference in acoustical characteristics between a reference human voice and a reference computer synthesized voice that exceeds a target threshold as measured by a reference group of human listeners, and at least some of said acoustical characteristics are used to train said one or more speech models. - View Dependent Claims (14)
-
-
15. A challenge system for identifying a source of data input to a computing system comprising:
-
one or more software routines implemented in a computer readable medium and adapted to cause the challenge system to; select first test text data to be articulated as a first speech utterance by a first entity providing input to the computing system, based on measurable differences in articulation between a human speaker and a machine for the first test text data exceeding a target threshold; store a voice print for the first entity at the computing system based on said first speech utterance being converted into recognized speech data; wherein said first entity can include either a human or a computer using a synthesized voice; receive a second speech utterance by a second entity; process said second recognized speech data with said computing system to determine whether said second speech utterance also originated from said first entity; control whether said second entity is allowed to access an account and/or data based on comparing said voice print to said second recognized speech data.
-
-
16. A challenge system for identifying a source of data input to a computing system using prosodic elements of speech comprising:
-
one or more software routines implemented in a computer readable medium and adapted to cause the challenge system to; a) present a challenge item to an entity, which challenge item is associated with a reference set of words and associated reference prosodic scores; b) receive speech utterance from an entity related to said challenge item including an input set of words; c) process said speech utterance with said computing system to compute input prosodic scores of said input set of words; d) compare said input prosodic scores and said reference prosodic scores; and e) generate a determination of whether said speech utterance originated from a machine or a human based on step (d); wherein said challenge item is supplemented with visual cues, said visual cues being adapted to induce said reference prosodic scores.
-
-
17. A challenge system for identifying a source of data input to a computing system using prosodic elements of speech comprising:
-
one or more software routines implemented in a computer readable medium and adapted to cause the challenge system to; a) present a challenge item to an entity, which challenge item is associated with a reference set of words and associated prosodic characteristics; b) receive speech utterance from an entity related to said challenge item; wherein said reference set of words represents a selected set of one or more contiguous words which when vocalized have a measurable difference in prosodic characteristics between a reference human voice and a reference computer synthesized voice that exceeds a target threshold; c) process said speech utterance with said computing system to compute first prosodic characteristics of said entity; d) generate a determination of whether said speech utterance originated from a machine or a human based on step (c).
-
-
18. A system for implementing a CAPTCHA (Completely Automatic Public Turing Test To Tell Humans And Computers Apart) to identify a source of data input to a computing system comprising:
-
one or more software routines implemented in a computer readable medium and adapted to cause the challenge system to; a) use a set of human test subjects to identify whether a reference challenge item was vocalized by a human or a computer; b) train the computing system with samples of human voices and computer synthesized voices articulating a set of the reference challenge items; c) receive a speech utterance from an entity related to one of said set of reference challenge items; d) determine with the trained computer system whether said speech utterance was vocalized by a machine or a human.
-
-
19. A system for implementing a CAPTCHA (Completely Automatic Public Turing Test To Tell Humans And Computers Apart) to identify a source of data input to a computing system comprising:
-
one or more software routines implemented in a computer readable medium and adapted to cause the challenge system to; a) train the computing system with samples of human voices articulating a set of reference challenge items; b) receive a speech utterance from an entity related to one of said set of reference challenge items; c) determine with the trained computer system whether said speech utterance was vocalized by a machine or a human; wherein said computing system uses one or more speech models that are optimized for identifying humans using said set of reference challenge items wherein said set of reference challenge items represent a selected set of one more contiguous words which when articulated have a difference in acoustical characteristics between a reference human voice and a reference computer synthesized voice that exceeds a target threshold as measured by a reference group of human listeners, and at least some of said acoustical characteristics are used to train said one or more speech models.
-
Specification