Audio signal de-identification
First Claim
1. A computer-implemented method, tangibly embodied as a computer program recorded in a computer-readable medium, the method comprising:
- (A) identifying a first portion of an original audio signal, the first portion representing sensitive content, comprising;
(A)(1) generating a report, the report comprising;
(a) content representing information in the original audio signal, and (b) a timestamp indicating a temporal position of the first portion of the original audio signal;
(A)(2) identifying a first personally identifying concept in the report;
(A)(3) identifying a first timestamp in the report corresponding to the first personally identifying concept; and
(A)(4) identifying a portion of the original audio signal corresponding to the first personally identifying concept by using the first timestamp; and
(B) producing a modified audio signal in which the first portion is protected against unauthorized disclosure.
8 Assignments
0 Petitions
Accused Products
Abstract
Techniques are disclosed for automatically de-identifying spoken audio signals. In particular, techniques are disclosed for automatically removing personally identifying information from spoken audio signals and replacing such information with non-personally identifying information. De-identification of a spoken audio signal may be performed by automatically generating a report based on the spoken audio signal. The report may include concept content (e.g., text) corresponding to one or more concepts represented by the spoken audio signal. The report may also include timestamps indicating temporal positions of speech in the spoken audio signal that corresponds to the concept content. Concept content that represents personally identifying information is identified. Audio corresponding to the personally identifying concept content is removed from the spoken audio signal. The removed audio may be replaced with non-personally identifying audio.
-
Citations
4 Claims
-
1. A computer-implemented method, tangibly embodied as a computer program recorded in a computer-readable medium, the method comprising:
-
(A) identifying a first portion of an original audio signal, the first portion representing sensitive content, comprising; (A)(1) generating a report, the report comprising;
(a) content representing information in the original audio signal, and (b) a timestamp indicating a temporal position of the first portion of the original audio signal;(A)(2) identifying a first personally identifying concept in the report; (A)(3) identifying a first timestamp in the report corresponding to the first personally identifying concept; and (A)(4) identifying a portion of the original audio signal corresponding to the first personally identifying concept by using the first timestamp; and (B) producing a modified audio signal in which the first portion is protected against unauthorized disclosure. - View Dependent Claims (2, 3)
-
-
4. A system comprising:
-
sensitive content identification means comprising; means for identifying a first portion of an original audio signal, the first portion representing sensitive content; means for generating a report, the report comprising; (a) content representing information in the original audio signal, and (b) at least one timestamp indicating at least one temporal position of at least one portion of the original audio signal corresponding to the content; means for identifying a first personally identifying concept in the report; means for identifying a first timestamp in the report corresponding to the first personally identifying concept; and means for identifying a portion of the original audio signal corresponding to the first personally identifying concept by using the first timestamp; and modified audio signal production means for producing a modified audio signal in which the identified first portion is protected against unauthorized disclosure.
-
Specification