×

Estimating fractional chirp rate with multiple frequency representations

  • US 9,922,668 B2
  • Filed: 12/15/2015
  • Issued: 03/20/2018
  • Est. Priority Date: 02/06/2015
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer-implemented method for automatic speaker recognition, the method comprising:

  • obtaining a first portion of a speech signal;

    computing a first frequency representation from the first portion of the speech signal using a first fractional chirp rate;

    computing a first score using an auto-correlation of the first frequency representation;

    computing a second frequency representation from the first portion of the speech signal using a second fractional chirp rate;

    computing a second score using an auto-correlation of the second frequency representation;

    comparing the first score and the second score;

    determining a first estimated fractional chirp rate of the first portion of the speech signal corresponding to a highest score of the first score and the second score;

    determining a first estimated pitch of the first portion of the speech signal using the first estimated fractional chirp rate;

    obtaining a second portion of the speech signal, the second portion of the speech signal being at least partially non-overlapping with the first portion of the speech signal;

    computing a third frequency representation from the second portion of the speech signal using a third fractional chirp rate;

    computing a third score using an auto-correlation of the third frequency representation;

    computing a fourth frequency representation from the second portion of the speech signal using a fourth fractional chirp rate;

    computing a fourth score using an auto-correlation of the fourth frequency representation;

    comparing the third score and the fourth score;

    determining a second estimated fractional chirp rate of the second portion of the speech signal corresponding to a highest score of the third score and the fourth score;

    determining a second estimated pitch of the second portion of the speech signal using the second estimated fractional chirp rate;

    computing a sequence of pitch estimates, the sequence of pitch estimates comprising the first estimated pitch and the second estimated pitch; and

    applying the sequence of pitch estimates to recognize a speaker as a source of the speech signal.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×