×

System and method for providing speech recognition using personal vocabulary in a network environment

  • US 9,201,965 B1
  • Filed: 09/30/2009
  • Issued: 12/01/2015
  • Est. Priority Date: 09/30/2009
  • Status: Active Grant
First Claim
Patent Images

1. A method, comprising:

  • receiving data propagating in a network environment;

    ignoring Joint Photographic Experts Group (JPEG) documents in the data;

    identifying an audio and video media file in the data, wherein the audio and video media file is associated with a plurality of individuals;

    generating a text file based on the audio and video media file;

    comparing the text file to a plurality of blacklisted words;

    dropping the text file if a blacklisted word is found in the text file;

    identifying, using a processor, selected words within the text file based on a whitelist to create a first word list, wherein the first word list includes fewer words than the text file;

    comparing the selected words in the first word list to a personal vocabulary database associated with an individual from the plurality of individuals, wherein the personal vocabulary database associated with the individual includes one or more words that the individual added to the personal vocabulary database, and wherein words in the personal vocabulary database associated with the individual may be marked as private; and

    removing from the first word list, one or more of the selected words to create a second word list based on the selected words not being found in the personal vocabulary database associated with the individual, wherein the second word list includes fewer words then the first word list, wherein at least one of the selected words that is removed is associated with a false positive from two words that phonetically sound similar.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×