×

Speech signal similarity

  • US 8,670,983 B2
  • Filed: 08/30/2011
  • Issued: 03/11/2014
  • Est. Priority Date: 09/02/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method for determining a similarity between a first audio source and a second audio source, the method comprising:

  • for the first audio source, performing the steps of;

    determining, using an analysis module of a computer, a first plurality of segments of the first audio source;

    determining, using the analysis module, a first frequency of occurrence for each of a plurality of phoneme sequences in the first audio source;

    determining, using the analysis module, a first weighted frequency for each of the plurality of phoneme sequences based on the first frequency of occurrence for the phoneme sequence;

    wherein determining the first weighted frequency includes emphasizing phoneme sequences that occur in few segments of the first plurality of segments relative to phoneme sequences that occur in many segments of the first plurality of segments;

    for the second audio source, performing the steps of;

    determining, using the analysis module, a second plurality of segments of the second audio source;

    determining, using the analysis module, a second frequency of occurrence for each of a plurality of phoneme sequences in the second audio source;

    determining, using the analysis module, a second weighted frequency for each of the plurality of phoneme sequences based on the second frequency of occurrence for the phoneme sequence;

    wherein determining the second weighted frequency includes emphasizing phoneme sequences that occur in few segments of the second plurality of segments relative to phoneme sequences that occur in many segments of the second plurality of segments;

    comparing, using a comparison module of a computer, the first weighted frequency for each phoneme sequence with the second weighted frequency for the corresponding phoneme sequence; and

    generating, using the comparison module, a similarity score representative of a similarity between the first audio source and the second audio source based on the results of the comparing.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×