System and method of recording utterances using unmanaged crowds for natural language processing
First Claim
1. A computer-implemented method of recording utterances from unmanaged crowds for natural language processing, the method being implemented in a user device having one or more physical processors programmed with computer program instructions that, when executed by the one or more physical processors, cause the user device to perform the method, the method comprising:
- obtaining, by the user device, a token;
transmitting, by the user device, the token to a remote device via a network;
receiving, at the user device, from the remote device, one or more utterances to be uttered by a user and one or more campaign configuration parameters based on the token, wherein the one or more utterances and the one or more campaign configuration parameters are associated with a campaign that is associated with a natural language processing data collection effort;
configuring, by the user device, the computer program instructions based on the one or more campaign configuration parameters;
presenting to the user, by the user device, the one or more utterances to be uttered by the user; and
recording, by the user device, audio of the user uttering the one or more utterances.
7 Assignments
0 Petitions
Accused Products
Abstract
A system and method of recording utterances for building Named Entity Recognition (“NER”) models, which are used to build dialog systems in which a computer listens and responds to human voice dialog. Utterances to be uttered may be provided to users through their mobile devices, which may record the user uttering (e.g., verbalizing, speaking, etc.) the utterances and upload the recording to a computer for processing. The use of the user'"'"'s mobile device, which is programmed with an utterance collection application (e.g., configured as a mobile app), facilitates the use of crowd-sourcing human intelligence tasking for widespread collection of utterances from a population of users. As such, obtaining large datasets for building NER models may be facilitated by the system and method disclosed herein.
-
Citations
32 Claims
-
1. A computer-implemented method of recording utterances from unmanaged crowds for natural language processing, the method being implemented in a user device having one or more physical processors programmed with computer program instructions that, when executed by the one or more physical processors, cause the user device to perform the method, the method comprising:
-
obtaining, by the user device, a token; transmitting, by the user device, the token to a remote device via a network; receiving, at the user device, from the remote device, one or more utterances to be uttered by a user and one or more campaign configuration parameters based on the token, wherein the one or more utterances and the one or more campaign configuration parameters are associated with a campaign that is associated with a natural language processing data collection effort; configuring, by the user device, the computer program instructions based on the one or more campaign configuration parameters; presenting to the user, by the user device, the one or more utterances to be uttered by the user; and recording, by the user device, audio of the user uttering the one or more utterances. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A system for recording utterances from unmanaged crowds for natural language processing, the system comprising:
-
a user device having one or more physical processors programmed with computer program instructions that, when executed by the one or more physical processors, cause the user device to; obtain a token; transmit the token to a remote device via a network; receive, from the remote device, one or more utterances to be uttered by a user and one or more campaign configuration parameters based on the token, wherein the one or more utterances and the one or more campaign configuration parameters are associated with a campaign that is associated with a natural language processing data collection effort; configure the computer program instructions based on the one or more campaign configuration parameters; present to the user the one or more utterances to be uttered by the user; and record audio of the user uttering the one or more utterances. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32)
-
Specification