×

Method and apparatus for voice searching for stored content using uniterm discovery

  • US 8,015,005 B2
  • Filed: 02/15/2008
  • Issued: 09/06/2011
  • Est. Priority Date: 02/15/2008
  • Status: Active Grant
First Claim
Patent Images

1. In an electronic device, a method comprising:

  • storing, by the electronic device, content, wherein said content includes one or more of text, images, audio, videos, and multimedia content;

    tagging, by the electronic device, the content with an audio tag;

    receiving, by the electronic device, a voice query to retrieve content stored on the device;

    completing, by the electronic device, a voice-to-voice search utilizing uniterms of the audio tag and a phoneme latent lattice model generated from the voice query to identify audio tags tagged to stored content, which audio tags provide one or more uniterms that score within the phoneme lattice model; and

    outputting, by the electronic device, retrieved content associated with the identified audio tags having uniterms that score within the phoneme lattice model, wherein the retrieved content is outputted in an order corresponding to an order in which the uniterms are structured within the voice query;

    wherein said completing further comprises;

    generating, by the electronic device, one or more first phoneme lattices from audio tags;

    determining, by the electronic device, one or more best paths from the one or more first phoneme lattices;

    extracting, by the electronic device, one or more uniterms from the one or more first phoneme lattices;

    storing, by the electronic device, the one or more uniterms and the one or more best paths in a uniterm index database; and

    re-associating, by the electronic device, the one or more uniterms with corresponding stored content with the associated audio tag from which the uniterm was generated; and

    wherein extracting one or more uniterms comprises;

    generating, by the electronic device, a next latent statistical lattice model from the one or more phoneme lattices generated from the audio tags;

    extracting, by the electronic device, phoneme strings with a length that is at least equal to a pre-set minimum length from the phoneme lattices as the one or more best paths;

    scoring, by the electronic device, the one or more best paths against the next latent statistical lattice model; and

    identifying, by the electronic device, a preset number of best strings as the uniterms selected to represent the phoneme lattice.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×