×

Speech enhancement with low-order non-negative matrix factorization

  • US 10,276,179 B2
  • Filed: 06/16/2017
  • Issued: 04/30/2019
  • Est. Priority Date: 03/06/2017
  • Status: Active Grant
First Claim
Patent Images

1. A method performed by a computing device for enhancing speech, the method comprising:

  • accessing multiple dictionaries of dictionary atoms, the dictionaries being generated from clean speech samples by performing a non-negative matrix factorization (“

    NMF”

    ) of frequency-domain (“

    FD”

    ) clean speech sample representations of the clean speech samples, each NMF having a unique initialization, wherein each of the multiple dictionaries comprises a reduced number of dictionary atoms to conserve processing power;

    receiving noisy speech;

    generating a FD noisy speech representation of the noisy speech;

    for each of the multiple dictionaries, generating a FD clean speech representation corresponding to the FD noisy speech representation by performing a NMF of the FD noisy speech representation based on the dictionary atoms of the dictionaries;

    generating an enhanced FD clean speech representation of the noisy speech by combining the FD clean speech representations generated using each dictionary with the reduced number of dictionary atoms, the combining includes averaging the FD clean speech representations; and

    converting the enhanced FD clean speech representation into clean speech that represents an enhancement of the noisy speech.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×