Automated removal of personally identifiable information
First Claim
1. A method of processing natural language communications comprising:
- receiving, via a natural language system, user-input that includes restricted information spoken by a user;
identifying the restricted information in the user-input, the identifying comprising;
determining that one or more words of the user-input match one or more words of a restricted string;
determining, at least partly in response to determining that the one or more words of the user-input match the one or more words of the restricted string, a context of the one or more words of the user-input; and
determining that the one or more words of the user-input comprise the restricted information based at least in part on the context and based at least in part on the one or more words of the user-input matching the one or more words of the restricted string;
generating obfuscated user-input by removing the restricted information from the user-input;
storing the obfuscated user-input in a datastore; and
training the natural language system with training data that includes the obfuscated user-input.
2 Assignments
0 Petitions
Accused Products
Abstract
A natural language system may receive user-input. The user-input may include personal or restrictable information. The natural language system may provide a dual processing system. The natural language system may store a true copy of the user-input, which may include the personal or restrictable information. The natural language system may also generate an obfuscated copy of the user-input that does not contain personal or restricted information. The true copy of the user-input may be stored in a secure storage system and may be retrieved by authorized personnel, which may include the user who provided the user-input. The obfuscated copy of the user-input may be stored in a storage system and may be employed in ongoing training of the natural language system.
36 Citations
20 Claims
-
1. A method of processing natural language communications comprising:
-
receiving, via a natural language system, user-input that includes restricted information spoken by a user; identifying the restricted information in the user-input, the identifying comprising; determining that one or more words of the user-input match one or more words of a restricted string; determining, at least partly in response to determining that the one or more words of the user-input match the one or more words of the restricted string, a context of the one or more words of the user-input; and determining that the one or more words of the user-input comprise the restricted information based at least in part on the context and based at least in part on the one or more words of the user-input matching the one or more words of the restricted string; generating obfuscated user-input by removing the restricted information from the user-input; storing the obfuscated user-input in a datastore; and training the natural language system with training data that includes the obfuscated user-input. - View Dependent Claims (2, 3)
-
-
4. A method of processing natural language communications comprising:
-
identifying, via a natural language system, restricted information in a user-input, the identifying of the restricted information comprising; identifying a structure of a restrictable-phrase candidate included in the user-input; comparing the structure of the restrictable-phrase candidate to a restricted structure; and determining that the restrictable-phrase candidate includes the restricted information based at least in part on the comparing of the structure of the restrictable-phrase candidate to the restricted structure; generating obfuscated user-input in which the restricted information is not discernible; providing at least a portion of the user-input to a first datastore; and providing the obfuscated user-input to a second datastore. - View Dependent Claims (5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. One or more non-transitory computer-readable storage media having computer-executable instructions thereon which, when executed by a computing device, implement a method comprising:
-
identifying restricted information in a user-input; determining, from the user-input, a context of the restricted information; determining, based at least in part on the context of the restricted information, non-restricted information to substitute for the restricted information; generating an obfuscated user-input by replacing the restricted information in the user-input with the non-restricted information; providing at least a portion of the user-input to a first datastore; and providing the obfuscated user-input to a second datastore. - View Dependent Claims (18)
-
-
19. A natural language system comprising:
-
one or more processors; and one or more computer-readable media storing computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform acts comprising; identifying restricted information in a user-input; determining, from the user-input, a context of the restricted information; determining, based at least in part on the context of the restricted information, non-restricted information to substitute for the restricted information; generating an obfuscated user-input by replacing the restricted information in the user-input with the non-restricted information; providing at least a portion of the user-input to a first datastore; and providing the obfuscated user-input to a second datastore. - View Dependent Claims (20)
-
Specification