Dynamic protection of personal information in audio recordings
First Claim
1. A method to protect personal identifiable information (PII) or sensitive personal information (SPI) in audio recordings, comprising:
- obtaining a first audio file associated with a first channel of a voice call;
obtaining a second audio file associated with a second channel of the voice call;
partitioning, by one or more processors, the first audio file into one or more segments comprising spoken digits, each segment comprising a start time and an end time, by;
detecting a spoken digit in the first audio file at a timepoint;
determining whether a subsequent spoken digit occurred within a certain amount of time of a previous spoken digit; and
setting the end time of the segment when the certain amount of time between subsequent digits is exceeded with no digits being detected;
identifying in which of the one or more segments of the first audio file the number of spoken digits exceeds a certain number;
partitioning, by the one or more processors, the second audio file into segments corresponding in time to the identified segments of the first audio file;
determining whether to tag the voice call as containing PII or SPI in response to determining whether one or more trigger words are spoken in at least one of the corresponding segments of the second audio file;
partitioning, by one or more processors, the second audio file into one or more segments comprising spoken digits, each segment comprising a start time and an end time, by;
detecting a spoken digit in the second audio file at a timepoint;
determining whether a subsequent spoken digit occurred within a certain amount of time of a previous spoken digit; and
setting the end time of the segment when the certain amount of time between subsequent digits is exceeded with no digits being detected;
identifying one or more segments of the second audio file in which the number of spoken digits exceeds a certain number;
partitioning the first audio file into one or more segments corresponding in time to the one or more identified segments in the second audio file;
determining whether to tag the voice call as containing PII or SPI in response to determining whether one or more trigger words are spoken in at least one of the one or more corresponding segments of the first audio file.
1 Assignment
0 Petitions
Accused Products
Abstract
In a general aspect, audio recordings are managed to protect personal identifiable information (PII) and or sensitive personal information (SPI). In an aspect, a first audio file associated with a voice call and a second audio file associated with the voice call are obtained. The first audio file is partitioned into one or more segments. Which of the one or more segments of the first audio file the number of spoken digits exceeds a certain number are identified. The second audio file is partitioned into segments corresponding in time to the identified segments of the first audio file. The voice call is tagged as containing PII or SPI in response to determining that trigger words are spoken in at least one of the segments of the second audio file.
-
Citations
23 Claims
-
1. A method to protect personal identifiable information (PII) or sensitive personal information (SPI) in audio recordings, comprising:
-
obtaining a first audio file associated with a first channel of a voice call; obtaining a second audio file associated with a second channel of the voice call; partitioning, by one or more processors, the first audio file into one or more segments comprising spoken digits, each segment comprising a start time and an end time, by; detecting a spoken digit in the first audio file at a timepoint; determining whether a subsequent spoken digit occurred within a certain amount of time of a previous spoken digit; and setting the end time of the segment when the certain amount of time between subsequent digits is exceeded with no digits being detected; identifying in which of the one or more segments of the first audio file the number of spoken digits exceeds a certain number; partitioning, by the one or more processors, the second audio file into segments corresponding in time to the identified segments of the first audio file; determining whether to tag the voice call as containing PII or SPI in response to determining whether one or more trigger words are spoken in at least one of the corresponding segments of the second audio file; partitioning, by one or more processors, the second audio file into one or more segments comprising spoken digits, each segment comprising a start time and an end time, by; detecting a spoken digit in the second audio file at a timepoint; determining whether a subsequent spoken digit occurred within a certain amount of time of a previous spoken digit; and setting the end time of the segment when the certain amount of time between subsequent digits is exceeded with no digits being detected; identifying one or more segments of the second audio file in which the number of spoken digits exceeds a certain number; partitioning the first audio file into one or more segments corresponding in time to the one or more identified segments in the second audio file; determining whether to tag the voice call as containing PII or SPI in response to determining whether one or more trigger words are spoken in at least one of the one or more corresponding segments of the first audio file. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A node device, for protecting personal identifiable information (PII) or sensitive personal information (SPI) in audio recordings, comprising one or more processors and memory storing instructions that when executed by the one or more processors cause the node device to:
-
obtain a first audio file associated with a first channel of a voice call; obtain a second audio file associated with a second channel of the voice call; partition the first audio file into one or more segments comprising spoken digits, each segment comprising a start time and an end time, by; detecting a spoken digit in the first audio file at a timepoint;
determining whether a subsequent spoken digit occurred within a certain amount of time of a previous spoken digit;and setting the end time of the segment when the certain amount of time between subsequent digits is exceeded with no digits being detected; identify in which of the one or more segments of the first audio file the number of spoken digits exceeds a certain number; partition the second audio file into segments corresponding in time to the identified segments of the first audio file;
tag the voice call as containing PII or SPI in response to determining that one or more trigger words are spoken in at least one of the corresponding segments of the second audio file;partition the second audio file into one or more segments comprising spoken digits, each segment comprising a start time and an end time, by; detecting a spoken digit in the second audio file at a timepoint;
determining whether a subsequent spoken digit occurred within a certain amount of time of a previous spoken digit; and
setting the end time of the segment when the certain amount of time between subsequent digits is exceeded with no digits being detected;identify one or more segments of the second audio file in which the number of spoken digits exceeds a certain number;
partition the first audio file into one or more segments corresponding in time to the one or more identified segments in the second audio file;tag the voice call as containing PII or SPI in response to determining that one or more trigger words are spoken in at least one of the one or more corresponding segments of the first audio file. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A non-transitory computer-readable medium for protecting personal identifiable information (PII) or sensitive personal information (SPI) in audio recordings,
comprising instructions that when executed by one or more processors of a node device are operable to cause the node device to: -
obtain a first audio file associated with a first channel of a voice call;
obtain a second audio file associated with a second channel of the voice call;partition the first audio file into one or more segments comprising spoken digits, each segment comprising a start time and an end time, by; detecting a spoken digit in the first audio file at a timepoint; determining whether a subsequent spoken digit occurred within a certain amount of time of a previous spoken digit; and setting the end time of the segment when the certain amount of time between subsequent digits is exceeded with no digits being detected; identify in which of the one or more segments of the first audio file the number of spoken digits exceeds a certain number; partition the second audio file into segments corresponding in time to the identified segments of the first audio file; and tag the voice call as containing PII or SPI in response to determining that one or more trigger words are spoken in at least one of the corresponding segments of the second audio file; partition the second audio file into one or more segments comprising spoken digits, each segment comprising a start time and an end time, by; detecting a spoken digit in the second audio file at a timepoint;
determining whether a subsequent spoken digit occurred within a certain amount of time of a previous spoken digit;and setting the end time of the segment when the certain amount of time between subsequent digits is exceeded with no digits being detected; identify one or more segments of the second audio file in which the number of spoken digits exceeds a certain number; partition the first audio file into one or more segments corresponding in time to the one or more identified segments in the second audio file;
tag the voice call as containing PII or SPI in response to determining that one or more trigger words are spoken in at least one of the one or more corresponding segments of the first audio file. - View Dependent Claims (18, 19, 20, 21, 22, 23)
-
Specification