Accoustic context recognition using local binary pattern method and apparatus
First Claim
Patent Images
1. A method comprising:
- receiving an audio signal at a microphone that is built in to an electronic device;
constructing an audio signal spectrogram, having adjacent pixels, indicative of the audio received at the microphone;
dividing the audio signal spectrogram into a plurality of blocks;
constructing a plurality of local binary patterns (LBP) based on a comparison of the adjacent pixels of the spectrogram;
creating, for each of the plurality of blocks of the spectrogram, a respective LBP histogram that is based on a number of times different LBPs occur in a corresponding block of the plurality of blocks;
identifying clusters of LBP histograms from the respective LBP histograms;
generating, for each of the clusters, a code word representing a corresponding cluster;
creating a codebook histogram based upon correspondence between the LBP histograms and the code words representing the clusters; and
classifying, using a machine learning model, the codebook histogram to identify environmental context that indicates a location of the electronic device at the time the audio signal was received at the microphone;
wherein the respective LBP histogram, for each of the plurality of blocks of the spectrogram, is indicative of an acoustic context of the audio input over a period of time;
wherein the audio signal spectrogram is a linear spectrum representation of the audio input over a frequency range and wherein each of the plurality of blocks for which an LBP histogram is created represents a particular sub-time over the period of time and a particular sub-frequency range that is within the frequency range of the audio signal.
2 Assignments
0 Petitions
Accused Products
Abstract
Various exemplary aspects are directed to acoustic context recognition apparatuses and methods involving isolating and identifying context(s) of an acoustic environment. In one exemplary embodiment, source audio is converted into audio spectrograms, each spectrogram indicative of a period of time. The series of spectrograms are analyzed to identify audio patterns, over a period of time, which are indicative of an environmental context of the source audio. In many embodiments of the present disclosure, acoustic context recognition also includes comparing the identified audio patterns to known environmental contexts.
5 Citations
17 Claims
-
1. A method comprising:
-
receiving an audio signal at a microphone that is built in to an electronic device; constructing an audio signal spectrogram, having adjacent pixels, indicative of the audio received at the microphone; dividing the audio signal spectrogram into a plurality of blocks; constructing a plurality of local binary patterns (LBP) based on a comparison of the adjacent pixels of the spectrogram; creating, for each of the plurality of blocks of the spectrogram, a respective LBP histogram that is based on a number of times different LBPs occur in a corresponding block of the plurality of blocks; identifying clusters of LBP histograms from the respective LBP histograms; generating, for each of the clusters, a code word representing a corresponding cluster; creating a codebook histogram based upon correspondence between the LBP histograms and the code words representing the clusters; and classifying, using a machine learning model, the codebook histogram to identify environmental context that indicates a location of the electronic device at the time the audio signal was received at the microphone; wherein the respective LBP histogram, for each of the plurality of blocks of the spectrogram, is indicative of an acoustic context of the audio input over a period of time; wherein the audio signal spectrogram is a linear spectrum representation of the audio input over a frequency range and wherein each of the plurality of blocks for which an LBP histogram is created represents a particular sub-time over the period of time and a particular sub-frequency range that is within the frequency range of the audio signal. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method comprising the steps of:
-
receiving an audio signal spectrogram indicative of an audio input; dividing the audio signal spectrogram into a plurality of blocks; constructing a plurality of local binary patterns (LBP) based on a comparison of adjacent pixels of the spectrogram; creating, for each of the plurality of blocks of the spectrogram, a respective LBP histogram that is based on a number of times different LBPs occur in a corresponding block of the plurality of blocks; identifying clusters of LBP histograms from the respective LBP histograms for each of the plurality of blocks of the spectrogram; generating, based upon the clusters, a codebook containing a plurality of code words; mapping the LBP histograms to the code words of the codebook to create a codebook histogram; and training a machine learning algorithm using the LBP histograms as features therefore to identify environmental context that indicates a location of an electronic device at the time an audio signal is received at a microphone of the electronic device; wherein the respective LBP histogram, for each of the plurality of blocks of the spectrogram, is indicative of an acoustic context of the audio input over a period of time; wherein the audio signal spectrogram is a linear spectrum representation of the audio input over a frequency range and wherein each of the plurality of blocks for which an LBP histogram is created represents a particular sub-time over the period of time and a particular sub-frequency range that is within the frequency range of the audio signal. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
Specification