System and method for generating challenge items for CAPTCHAs
First Claim
Patent Images
1. A method implemented by a computing system for selecting challenge items to discriminate between humans and machines in determining access to data and/or resources of a target computing system, comprising:
- (a) providing data identifying a first set of diphones to be assessed by the computing system, wherein each of said first set of diphones represents a sound associated with an articulation of a pair of phonemes in a natural language;
(b) generating a plurality of articulation scores using the computing system based on measuring acoustical characteristics of a reference machine text to speech (TTS) system articulation of each of said first set of diphones; and
(c) selecting challenge text including words and phrases from the natural language using the computing system based on said plurality of articulation scores;
wherein said challenge text has machine-indicative acoustical characteristics detectable by a speech processing computing system when articulated by the reference machine TTS system or another machine TTS system such that said challenge text is useable by an utterance-based challenge system for discriminating between humans and machines.
1 Assignment
0 Petitions
Accused Products
Abstract
Challenge items for an audible based electronic challenge system are generated using a variety of techniques to identify optimal candidates. The challenge items are intended for use in a computing system that discriminates between humans and text to speech (TTS) system.
93 Citations
18 Claims
-
1. A method implemented by a computing system for selecting challenge items to discriminate between humans and machines in determining access to data and/or resources of a target computing system, comprising:
-
(a) providing data identifying a first set of diphones to be assessed by the computing system, wherein each of said first set of diphones represents a sound associated with an articulation of a pair of phonemes in a natural language; (b) generating a plurality of articulation scores using the computing system based on measuring acoustical characteristics of a reference machine text to speech (TTS) system articulation of each of said first set of diphones; and (c) selecting challenge text including words and phrases from the natural language using the computing system based on said plurality of articulation scores; wherein said challenge text has machine-indicative acoustical characteristics detectable by a speech processing computing system when articulated by the reference machine TTS system or another machine TTS system such that said challenge text is useable by an utterance-based challenge system for discriminating between humans and machines. - View Dependent Claims (2, 3, 4, 5, 6, 8)
-
-
7. A method of selecting challenge data using a computing system to be used for accessing data and/or resources of a target computing system comprising:
-
a) selecting a candidate challenge item which includes one or both of text words or visual images; b) measuring first computer-related acoustical characteristics of a computer synthesized utterance consisting of audio of challenge content associated with said candidate challenge item using the computing system; c) measuring second human-related acoustical characteristics of a human utterance consisting of audio of said challenge content; d) generating a challenge item score based on measuring a difference in said first computer-related and second human-related acoustical characteristics; and e) designating said candidate challenge item as a reference challenge item when said challenge item score exceeds a target threshold, indicating that said challenge content has machine-indicative acoustical characteristics detectable by a speech processing system. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A method implemented by a computing system for accessing data and/or resources of a target computing system comprising:
-
a) defining a plurality of demographic groups, said demographic groups being based on one or more of age, sex, or domicile; b) providing a plurality of Completely Automatic Public Turing Test To Tell Humans and Computers Apart (CAPTCHA) challenge items consisting of a combination of images and solicited utterances with the computing system; c) for each of said challenge items, using the computing system to compare a first reference acoustic response of a machine entity and a second reference acoustic response provided by a representative of each of said plurality of demographic groups; and d) for each demographic group, selecting an optimal set of CAPTCHA challenge items determined by the computing system to yield the greatest acoustic response difference between said second reference acoustic response of said representative and said first reference acoustic response of said machine entity. - View Dependent Claims (15)
-
-
16. A system for identifying challenge data to be used for accessing data and/or resources of a target computing system comprising:
-
a first computing system; and one or more software routines embodied in a non-transitory computer readable medium that, when executed, cause the first computing system to; (a) provide data identifying a first set of diphones to be assessed the first computing system, wherein each of said first set of diphones represents a sound associated with an articulation of a pair of phonemes in a natural language; (b) generate a plurality of articulation scores based on measuring acoustical characteristics of a reference machine text to speech (TTS) system articulation of each of said first set of diphones; and (c) select challenge text including words and phrases from the natural language using the computing system based on said plurality of articulation scores; wherein said challenge text has machine-indicative acoustical characteristics detectable by a speech processing computing system when articulated by the machine TTS system or another machine TTS system such that said challenge text is useable by an utterance-based challenge system for discriminating between humans and machines.
-
-
17. A system for identifying challenge data to be used for accessing data and/or resources of a target computing system comprising:
-
a first computing system; and one or more software routines embodied in a computer readable medium that, when executed, cause the first computing system to; a) select a candidate challenge item which includes one or both of text words or visual images; b) measure first computer-related acoustical characteristics of a computer synthesized utterance consisting of audio of challenge content associated with said candidate challenge item; c) measure second human-related acoustical characteristics of a human utterance consisting of audio of said challenge content; d) generate a challenge item score based on measuring a difference in said first computer-related and second human-related acoustical characteristics; and e) designate said candidate challenge item as a reference challenge item when said challenge item score exceeds a target threshold, indicating that said challenge content has machine-indicative acoustical characteristics detectable by a speech processing system.
-
-
18. A system for identifying challenge data to be used for accessing data and/or resources of a target computing system comprising:
-
a first computing system; and one or more software routines embodied in a computer readable medium that, when executed, cause the computing system to; a) define a plurality of demographic groups, said demographic groups being based on one or more of age, sex, or domicile; b) provide a plurality of Completely Automatic Public Turing Test To Tell Humans and Computers Apart (CAPTCHA) challenge items consisting of a combination of images and solicited utterances; c) for each of said challenge items, compare a first reference response of a machine entity and a second reference response provided by a representative of each of said demographic groups; and d) for each demographic group, select an optimal set of CAPTCHA challenge items determined by the computing system to yield the greatest response difference between said second reference acoustic response of said representative and said first reference acoustic response of said machine entity.
-
Specification