Method and apparatus for speech recognition using latent semantic adaptation

US 7,124,081 B1
Filed: 09/28/2001
Issued: 10/17/2006
Est. Priority Date: 09/28/2001
Status: Expired due to Fees

First Claim

Patent Images

1. A method for generating a speech recognition database comprising:

generating a latent semantic analysis (LSA) space from a training corpus of documents representative of a language, wherein the LSA space includes one or more document vectors;

receiving a new document that represents a change in the language; and

adapting the LSA space to reflect the change in the language, wherein the adapting includes changing a position of the one or more document vectors in the LSA space by the change in the language.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method and apparatus for speech recognition using latent semantic adaptation is described herein. According to one aspect of the present invention, a method for recognizing speech comprises using latent semantic analysis (LSA) to generate an LSA space for a collection of documents and to continually adapt the LSA space with new documents as they become available. Adaptation of the LSA space is optimally two-sided, taking into account the new words in the new documents. Alternatively, adaptation is one-sided, taking into account the new documents but discarding any new words appearing in those documents.

Citations

55 Claims

1. A method for generating a speech recognition database comprising:
- generating a latent semantic analysis (LSA) space from a training corpus of documents representative of a language, wherein the LSA space includes one or more document vectors;
  
  receiving a new document that represents a change in the language; and
  
  adapting the LSA space to reflect the change in the language, wherein the adapting includes changing a position of the one or more document vectors in the LSA space by the change in the language.
- View Dependent Claims (2, 3, 10, 11)
- - 2. The method of claim 1, wherein adapting the LSA space to reflect the change in the language comprises transforming the LSA space to take into account the new document'"'"'s influence on the LSA space without re-computing the LSA space.
  - 3. The method of claim 2, wherein transforming the LSA space comprises:
    - obtaining a training document vector that characterizes a semantic position of the training document within the LSA space;
      
      computing a new document vector that characterizes a semantic position of the new document within the LSA space;
      
      deriving a document vector transformation matrix; and
      
      applying the document vector transformation matrix to the training document vector and the new document vector to shift a position of each document vector in the LSA space,where the shift in the position reflects the change in the language.
  - 10. The method of claim 1, wherein the change in the language is a change in the language'"'"'s domain.
  - 11. The method of claim 1, wherein the change in the language is a change in the language'"'"'s style.

4. A method for generating a speech recognition database comprising:
- generating a latent semantic analysis (LSA) space from a training corpus of documents representative of a language, wherein the LSA space includes one or more document vectors;
  
  receiving a new document that represents a change in the language; and
  
  adapting the LSA space to reflect the change in the language, wherein the change in the language includes changing a position of the one or more document vectors, wherein adapting the LSA space to reflect the change in the language comprises transforming the LSA space to take into account the new document'"'"'s influence on the LSA space without re-computing the LSA space, wherein the transforming the LSA space comprisesobtaining a training document vector that characterizes a semantic position of the training document within the LSA space;
  
  computing a new document vector that characterizes a semantic position of the new document within the LSA space;
  
  deriving a document vector transformation matrix; and
  
  applying the document vector transformation matrix to the training document vector and the new document vector to shift a position of each document vector in the LSA space, where the shift in the position reflects the change in the language;
  
  obtaining a training word vector that characterizes a semantic position of the training word within the LSA space;
  
  computing a new word vector that characterizes a semantic position of the new word within the LSA space;
  
  deriving a word vector transformation matrix; and
  
  applying the word vector transformation matrix to the training word vector and the new word vector to shift a position of each word vector in the LSA space, where the shift in the position reflects the change in the language.
- View Dependent Claims (5, 6, 7, 8, 9)
- - 5. The method of claim 4, wherein:
    - the training document vector is VS, where VS is computed from a right singular matrix V and a diagonal matrix S, each of which was obtained from a previous singular value decomposition (SVD) of a training word-document matrix constructed during the generation of the LSA space, the training word-document matrix representing the extent to which each of the words appears in each of the documents of the training corpus;
      
      the new document vector ZS, where ZS is computed from the diagonal matrix S and an extension matrix Z, wherein Z is an extension of the right singular matrix V obtained by folding in a new word-document matrix, the new word-document matrix representing the extent to which a new word appears in the new document; and
      
      the document vector transformation matrix is J, wherein J is obtained from a Choleski decomposition of a matrix derived from an extension matrix Y, wherein Y is an extension of a left singular matrix U obtained by folding in the new word-document matrix and wherein U was obtained from the previous SVD of the training word-document matrix constructed during the generation of the LSA space.
  - 6. The method of claim 5, wherein:
    - the training word vector is US, wherein US is computed from the left singular matrix U and the diagonal matrix S;
      
      the new word vector is YS, wherein YS is computed from the diagonal matrix S and the extension matrix Y; and
      
      the word vector transformation matrix is K, wherein K is obtained from a Choleski decomposition of a matrix derived from the extension matrix Z.
  - 7. The method of claim 6, wherein transforming the LSA space comprises applying the document vector transformation matrix and the word vector transformation matrix simultaneously.
  - 8. The method of claim 6, wherein when the new document matrix contains more new documents than new words, then transforming the LSA space comprises:
    - applying the word vector transformation matrix K, first; and
      
      applying the document vector transformation matrix J second, wherein the extension matrix Y is not obtained by folding in the new word-document matrix, but is rather derived from the extension matrix Z.
  - 9. The method of claim 6, wherein when the new document matrix contains more new words than new documents, then transforming the LSA space comprises:
    - applying the document vector transformation matrix J first; and
      
      applying the word vector transformation matrix K second, wherein the extension matrix Z is not obtained by folding in the new word-document matrix, but is rather derived from the extension matrix Y.

12. A computer-readable medium having executable instructions to cause a computer to perform a method for generating a speech recognition database comprising:
- generating a latent semantic analysis (LSA) space from a training corpus of documents representative of a language, wherein the LSA space includes one or more document vectors;
  
  receiving a new document that represents a change in the language; and
  
  adapting the LSA space to reflect the change in the language, wherein the adapting includes changing a position of the one or more document vectors in the LSA space by the change in the language.
- View Dependent Claims (13, 14, 21, 22)
- - 13. The computer-readable medium of claim 12, wherein adapting the LSA space to reflect the change in the language further comprises transforming the LSA space to take into account the new document'"'"'s influence on the LSA space without re-computing the LSA space.
  - 14. The computer-readable medium of claim 13, wherein transforming the LSA space further comprises:
    - obtaining a training document vector that characterizes a semantic position of the training document within the LSA space;
      
      computing a new document vector that characterizes a semantic position of the new document within the LSA space;
      
      deriving a document vector transformation matrix; and
      
      applying the document vector transformation matrix to the training document vector and the new document vector to shift a position of each document vector in the LSA space, where the shift in the position reflects the change in the language.
  - 21. The computer-readable medium of claim 12, wherein the change in the language is a change in the language'"'"'s domain.
  - 22. The computer-readable medium of claim 12, wherein the change in the language is a change in the language'"'"'s style.

15. A computer-readable medium having executable instructions to cause a computer to perform a method for generating a speech recognition database comprising:
- generating a latent semantic analysis (LSA) space from a training corpus of documents representative of a language, wherein the LSA space includes one or more document vectors;
  
  receiving a new document that represents a change in the language; and
  
  adapting the LSA space to reflect the change in the language, wherein the change in the language includes changing a position of the one or more document vectors, wherein adapting the LSA space to reflect the change in the language further comprises transforming the LSA space to take into account the new document'"'"'s influence on the LSA space without re-computing the LSA space, wherein the transforming the LSA space comprisesobtaining a training document vector that characterizes a semantic position of the training document within the LSA space;
  
  computing a new document vector that characterizes a semantic position of the new document within the LSA space;
  
  deriving a document vector transformation matrix; and
  
  applying the document vector transformation matrix to the training document vector and the new document vector to shift a position of each document vector in the LSA space, where the shift in the position reflects the change in the language;
  
  obtaining a training word vector that characterizes a semantic position of the training word within the LSA space;
  
  computing a new word vector that characterizes a semantic position of the new word within the LSA space;
  
  deriving a word vector transformation matrix; and
  
  applying the word vector transformation matrix to the training word vector and the new word vector to shift a position of each word vector in the LSA space, where the shift in the position reflects the change in the language.

16. A computer-readable medium having executable instructions to cause a computer to perform a method for generating a speech recognition database comprising:
- generating a latent semantic analysis (LSA) space from a training corpus of documents representative of a language, wherein the LSA space includes one or more document vectors;
  
  receiving a new document that represents a change in the language; and
  
  adapting the LSA space to reflect the change in the language, wherein the change in the language includes changing a position of the one or more document vectors, wherein adapting the LSA space to reflect the change in the language further comprises transforming the LSA space to take into account the new document'"'"'s influence on the LSA space without re-computing the LSA space, wherein transforming the LSA space comprisesobtaining a training document vector that characterizes a semantic position of the training document within the LSA space;
  
  computing a new document vector that characterizes a semantic position of the new document within the LSA space;
  
  deriving a document vector transformation matrix; and
  
  applying the document vector transformation matrix to the training document vector and the new document vector to shift a position of each document vector in the LSA space, where the shift in the position reflects the change in the language, wherein the training document vector is VS where VS is computed from a right singular matrix V and a diagonal matrix S, each of which was obtained from a previous singular value decomposition (SVD) of a training word-document matrix constructed during the generation of the LSA space, the training word-document matrix representing the extent to which each of the words appears in each of the documents of the training corpus;
  
  the new document vector is ZS where ZS is computed from the diagonal matrix S and an extension matrix Z, wherein Z is an extension of the right singular matrix V obtained by folding in a new word-document matrix, the new word-document matrix representing the extent to which a new word appears in the new document; and
  
  the document vector transformation matrix is J, wherein J is obtained from a Choleski decomposition of a matrix derived from an extension matrix Y, wherein Y is an extension of a left singular matrix U obtained by folding in the new word-document matrix, and wherein U was obtained from the previous SVD of the training word-document matrix constructed during the generation of the LSA space.
- View Dependent Claims (17, 18, 19, 20)
- - 17. The computer-readable medium of claim 16, wherein:
    - the training word vector is US, wherein US is computed from the left singular matrix U and the diagonal matrix S;
      
      the new word vector is YS, wherein YS is computed from the diagonal matrix S and the extension matrix Y; and
      
      the word vector transformation matrix is K, wherein K is obtained from a Choleski decomposition of a matrix derived from the extension matrix Z.
  - 18. The computer-readable medium of claim 17, wherein transforming the LSA space further comprises applying the document vector transformation matrix and the word vector transformation matrix simultaneously.
  - 19. The computer-readable medium of claim 17, wherein, when the new document matrix contains more new documents than new words, transforming the LSA space further comprises:
    - applying the word vector transformation matrix K, first; and
      
      applying the document vector transformation matrix is J second, wherein the extension matrix Y is not obtained by folding in the new word-document matrix, but is rather derived from the extension matrix Z.
  - 20. The computer-readable medium of claim 17, wherein, when the new document matrix contains more new words than new documents, transforming the LSA space comprises:
    - applying the document vector transformation matrix J first; and
      
      applying the word vector transformation matrix K second, wherein the extension matrix Z is not obtained by folding in the new word-document matrix, but is rather derived from the extension matrix Y.

23. An apparatus for generating a speech recognition database, the apparatus comprising:
- a latent semantic analysis (LSA) space generator to generate an LSA space from a training corpus of documents representative of a language, wherein the LSA space includes one or more document vectors;
  
  a document receiver to receive a new document that represents a change in the language; and
  
  an LSA space adapter to adapt the LSA space to reflect the change in the language, wherein adapting includes changing a position of the one or more document vectors in the LSA space by the change in the language.
- View Dependent Claims (24, 25, 32, 33)
- - 24. The apparatus of claim 23, wherein LSA space adapter transforms the LSA space to take into account the new documents influence on the LSA space without re-computing the LSA space.
  - 25. The apparatus of claim 24, wherein the LSA space adapter transforms the LSA space by:
    - obtaining a training document vector that characterizes a semantic position of the training document within the LSA space;
      
      computing a new document vector that characterizes a semantic position of the new document within the LSA space;
      
      deriving a document vector transformation matrix; and
      
      applying the document vector transformation matrix to the training document vector and the new document vector to shift a position of each document vector in the LSA space, where the shift in the position reflects the change in the language.
  - 32. The apparatus of claim 23, wherein the change in the language is a change in the language'"'"'s domain.
  - 33. The apparatus of claim 23, wherein the change in the language is a change in the language'"'"'s style.

26. An apparatus for generating a speech recognition database, the apparatus comprising:
- a latent semantic analysis (LSA) space generator to generate an LSA space from a training corpus of documents representative of a language, wherein the LSA space includes one or more document vectors;
  
  a document receiver to receive a new document that represents a change in the language; and
  
  an LSA space adapter to adapt the LSA space to reflect the change in the language, wherein the change in the language includes changing a position of the one or more document vectors, wherein LSA space adapter transforms the LSA space to take into account the new document'"'"'s influence on the LSA space without recomputing the LSA space, wherein the LSA space adapter transforms the LSA space byobtaining a training document vector that characterizes a semantic position of the training document within the LSA space;
  
  computing a new document vector that characterizes a semantic position of the new document within the LSA space;
  
  deriving a document vector transformation matrix; and
  
  applying the document vector transformation matrix to the training document vector and the new document vector to shift a position of each document vector in the LSA space, where the shift in the position reflects the change in the language;
  
  obtaining a training word vector that characterizes a semantic position of the training word within the LSA space;
  
  computing a new word vector that characterizes a semantic position of the new word within the LSA space;
  
  deriving a word vector transformation matrix; and
  
  applying the word vector transformation matrix to the training word vector and the new word vector to shift a position of each word vector in the LSA space, where the shift in the position reflects the change in the language.
- View Dependent Claims (27, 28, 29, 30, 31)
- - 27. The apparatus of claim 26, wherein:
    - the training document vector is VS, where VS is computed from a right singular matrix V and a diagonal matrix S, each of which was obtained from a previous singular value decomposition (SVD) of a training word-document matrix constructed during the generation of the LSA space, the training word-document matrix representing the extent to which each of the words appears in each of the documents of the training corpus;
      
      the new document vector ZS, where ZS is computed from the diagonal matrix S and an extension matrix Z, wherein Z is an extension of the right singular matrix V obtained by folding in a new word-document matrix, the new word-document matrix representing the extent to which a new word appears in the new document; and
      
      the document vector transformation matrix is J, wherein J is obtained from a Choleski decomposition of a matrix derived from an extension matrix Y, wherein Y is an extension of a left singular matrix U obtained by folding in the new word-document matrix, and wherein U was obtained from the previous SVD of the training word-document matrix constructed during the generation of the LSA space.
  - 28. The apparatus of claim 26, wherein:
    - the training word vector is US, where US is computed from a left singular matrix U and the diagonal matrix S;
      
      the new word vector is YS, where YS is computed from the diagonal matrix S and the extension matrix Y; and
      
      the word vector transformation matrix is K, wherein K is obtained from a Choleski decomposition of a matrix derived from the extension matrix Z.
  - 29. The apparatus of claim 26, wherein the LSA space adapter transforms the LSA space by applying the document vector transformation matrix and the word vector transformation matrix simultaneously.
  - 30. The apparatus of claim 26, wherein when the new document matrix contains more new documents than new words, then the LSA space adapter transforms space by:
    - applying the word vector transformation matrix K, first; and
      
      applying the document vector transformation matrix is J second, wherein the extension matrix Y is not obtained by folding in the new word-document matrix, but is rather derived from the extension matrix Z.
  - 31. The apparatus of claim 26, wherein when the new document matrix contains more new words than new documents, then the LSA space adapter transforms the LSA space by:
    - applying the document vector transformation matrix J first; and
      
      applying the word vector transformation matrix K second, wherein the extension matrix Z is not obtained by folding in the new word-document matrix, but is rather derived from the extension matrix Y.

34. An apparatus for recognizing speech, the apparatus comprising:
- means for recognizing an audio input as a new document; and
  
  means for processing the new document using latent semantic adaptation, wherein the means for processing includemeans for generating a latent semantic analysis (LSA) space from a training corpus of documents representative of a language, wherein the LSA space includes one or more document vectors;
  
  means for receiving the new document that represents a change in the language; and
  
  means for adapting the LSA space to reflect the change in the language, wherein the means for adapting includes means for changing a position of the one or more document vectors in the LSA space by the change in the language; and
  
  means, coupled to the means for processing, for semantically inferring from a vector representation of the new document which of a plurality of known words and known documents correlate to the new document.
- View Dependent Claims (35, 36, 43, 44)
- - 35. The apparatus of claim 34, wherein the means for adapting the LSA space to reflect the change in the language comprises a means for transforming the LSA space to take into account the new document'"'"'s influence on the LSA space without re-computing the LSA space.
  - 36. The apparatus of claim 35, wherein the means for transforming the LSA space comprises:
    - means for obtaining a training document vector that characterizes a semantic position of the training document within the LSA space;
      
      means for computing a new document vector that characterizes a semantic position of the new document within the LSA space;
      
      means for deriving a document vector transformation matrix; and
      
      means for applying the document vector transformation matrix to the training document vector and the new document vector to shift a position of each document vector in the LSA space, where the shift in the position reflects the change in the language.
  - 43. The apparatus of claim 34, wherein the change in the language is a change in the language'"'"'s domain.
  - 44. The apparatus of claim 34 wherein the change in the language is a change in the language'"'"'s style.

37. An apparatus for recognizing speech, the apparatus comprising:
- means for recognizing an audio input as a new document; and
  
  means for processing the new document using latent semantic adaptation, wherein the means for processing includemeans for generating a latent semantic analysis (LSA) space from a training corpus of documents representative of a language, wherein the LSA space includes one or more document vectors;
  
  means for receiving the new document that represents a change in the language; and
  
  means for adapting the LSA space to reflect the change in the language, wherein the change in the language includes changing a position of the one or more document vectors, wherein the means for adapting the LSA space to reflect the change in the language comprises a means for transforming the LSA space to take into account the new document'"'"'s influence on the LSA space without re-computing the LSA space, wherein the means for transforming the LSA space comprisesmeans for obtaining a training document vector that characterizes a semantic position of the training document within the LSA space;
  
  means for computing a new document vector that characterizes a semantic position of the new document within the LSA space;
  
  means for deriving a document vector transformation matrix; and
  
  means for applying the document vector transformation matrix to the training document vector and the new document vector to shift a position of each document vector in the LSA space, where the shift in the position reflects the change in the language;
  
  means, coupled to the means for processing, for semantically inferring from a vector representation of the new document which of a plurality of known words and known documents correlate to the new document,means for obtaining a training word vector that characterizes a semantic position of the training word within the LSA space;
  
  means for computing a new word vector that characterizes a semantic position of the new word within the LSA space;
  
  means for deriving a word vector transformation matrix; and
  
  means for applying the word vector transformation matrix to the training word vector and the new word vector to shift a position of each word vector in the LSA space, where the shift in the position reflects the change in the language.
- View Dependent Claims (38, 39)
- - 38. The apparatus of claim 37, wherein:
    - the training document vector is VS, where VS is computed from a right singular matrix V and a diagonal matrix S, each of which was obtained from a previous singular value decomposition (SVD) of a training word-document matrix constructed during the generation of the LSA space, the training word-document matrix representing the extent to which each of the words appears in each of the documents of the training corpus;
      
      the new document vector ZS, where ZS is computed from the diagonal matrix S and an extension matrix Z, wherein Z is an extension of the right singular matrix V obtained by folding in a new word-document matrix, the new word-document matrix representing the extent to which a new word appears in the new document; and
      
      the document vector transformation matrix is J, wherein J is obtained from a Choleski decomposition of a matrix derived from an extension matrix Y, wherein Y is an extension of a left singular matrix U obtained by folding in the new word-document matrix, and wherein U was obtained from the previous SVD of the training word-document matrix constructed during the generation of the LSA space.
  - 39. The apparatus of claim 38, wherein:
    - the training word vector is US, wherein US is computed from the left singular matrix U and the diagonal matrix S;
      
      the new word vector is YS, where YS is computed from the the diagonal matrix S and the extension matrix Y; and
      
      the word vector transformation matrix is K, wherein K is obtained from a Choleski decomposition of a matrix derived from the extension matrix Z.

40. An apparatus for recognizing speech, the apparatus comprising:
- means for recognizing an audio input as a new document; and
  
  means for processing the new document using latent semantic adaptation, wherein the means for processing includemeans for generating a latent semantic analysis (LSA) space from a training corpus of documents representative of a language, wherein the LSA space includes one or more document vectors;
  
  means for receiving the new document that represents a change in the language; and
  
  means for adapting the LSA space to reflect the change in the language, wherein the change in the language includes changing a position of the one or more document vectors, wherein the means for adapting the LSA space to reflect the change in the language comprises a means for transforming the LSA space to take into account the new document'"'"'s influence on the LSA space without re-computing the LSA space, wherein the means for transforming the LSA space comprisesmeans for obtaining a training document vector that characterizes a semantic position of the training document within the LSA space;
  
  means for computing a new document vector that characterizes a semantic position of the new document within the LSA space;
  
  means for deriving a document vector transformation matrix; and
  
  means for applying the document vector transformation matrix to the training document vector and the new document vector to shift a position of each document vector in the LSA space, where the shift in the position reflects the change in the language; and
  
  means, coupled to the means for processing, for semantically inferring from a vector representation of the new document which of a plurality of known words and known documents correlate to the new document,wherein the means for transforming the LSA space further comprisesmeans for applying the document vector transformation matrix and the word vector transformation matrix simultaneously.

41. An apparatus for recognizing speech, the apparatus comprising:
- means for recognizing an audio input as a new document; and
  
  means for processing the new document using latent semantic adaptation, wherein the means for processing includemeans for generating a latent semantic analysis (LSA) space from a training corpus of documents representative of a language, wherein the LSA space includes one or more document vectors;
  
  means for receiving the new document that represents a change in the language; and
  
  means for adapting the LSA space to reflect the change in the language, wherein the change in the language includes changing a position of the one or more document vectors, wherein the means for adapting the LSA space to reflect the change in the language comprises a means for transforming the LSA space to take into account the new document'"'"'s influence on the LSA space without re-computing the LSA space, wherein the means for transforming the LSA space comprises;
  
  means for obtaining a training document vector that characterizes a semantic position of the training document within the LSA space;
  
  means for computing a new document vector that characterizes a semantic position of the new document within the LSA space;
  
  means for deriving a document vector transformation matrix; and
  
  means for applying the document vector transformation matrix to the training document vector and the new document vector to shift a position of each document vector in the LSA space, where the shift in the position reflects the change in the language; and
  
  means, coupled to the means for processing, for semantically inferring from a vector representation of the new document which of a plurality of known words and known documents correlate to the new document,wherein when the new document matrix contains more new documents than new words, then the means for transforming the LSA space further comprisesmeans for applying the word vector transformation matrix K, first; and
  
  means for applying the document vector transformation matrix J second, wherein the means for obtaining the extension matrix Y is not by folding in the new word-document matrix, but is rather by deriving extension matrix Y from the extension matrix Z.

42. An apparatus for recognizing speech, the apparatus comprising:
- means for recognizing an audio input as a new document; and
  
  means for processing the new document using latent semantic adaptation, wherein the means for processing includemeans for generating a latent semantic analysis (LSA) space from a training corpus of documents representative of a language wherein the LSA space includes one or more document vectors;
  
  means for receiving the new document that represents a change in the language; and
  
  means for adapting the LSA space to reflect the change in the language, wherein the change in the language includes changing a position of the one or more document vectors, wherein the means for adapting the LSA space to reflect the change in the language comprises a means for transforming the LSA space to take into account the new document'"'"'s influence on the LSA space without re-computing the LSA space, wherein the means for transforming the LSA space comprises;
  
  means for obtaining a training document vector that characterizes a semantic position of the training document within the LSA space;
  
  means for computing a new document vector that characterizes a semantic position of the new document within the LSA space;
  
  means for deriving a document vector transformation matrix; and
  
  means for applying the document vector transformation matrix to the training document vector and the new document vector to shift a position of each document vector in the LSA space, where the shift in the position reflects the change in the language; and
  
  means, coupled to the means for processing, for semantically inferring from a vector representation of the new document which of a plurality of known words and known documents correlate to the new document, wherein when the new document matrix contains more new words than new documents, then the means for transforming the LSA space further comprisesmeans for applying the document vector transformation matrix J first; and
  
  means for applying the word vector transformation matrix K second, wherein the means for obtaining the extension matrix Z is not by folding in the new word-document matrix, but is rather by deriving the extension matrix Z from the extension matrix Y.

45. A system for processing speech, the system comprising:
- a speech recognition database comprising a latent semantic analysis (LSA) space generated from a training corpus of documents representative of a language, wherein the LSA space includes one or more document vectors;
  
  an input receiver to receive a new document that represents a change in the language; and
  
  a processing system to adapt the LSA space to reflect the change in the language, wherein the adapting includes changing a position of the one or more document vectors in the LSA space by the change in the language.
- View Dependent Claims (46, 47, 54, 55)
- - 46. The system of claim 45, wherein the processing system adapts the LSA space by transforming the LSA space to take into account the new document'"'"'s influence on the LSA space without re-computing the LSA space.
  - 47. The system of claim 46, wherein the processing system transforms the LSA space by:
    - obtaining a training document vector that characterizes a semantic position of the training document within the LSA space;
      
      computing a new document vector that characterizes a semantic position of the new document within the LSA space;
      
      deriving a document vector transformation matrix; and
      
      applying the document vector transformation matrix to the training document vector and the new document vector to shift a position of each document vector in the LSA space, where the shift in the position reflects the change in the language.
  - 54. The system of claim 45, wherein the change in the language is a change in the language'"'"'s domain.
  - 55. The system of claim 45, wherein the change in the language is a change in the language'"'"'s style.

48. A system for processing speech, the system comprising:
- a speech recognition database comprising a latent semantic analysis (LSA) space generated from a training corpus of documents representative of a language, wherein the LSA space includes one or more document vectors;
  
  an input receiver to receive a new document that represents a change in the language; and
  
  a processing system to adapt the LSA space to reflect the change in the language, wherein the change in the language includes changing a position of the one or more document vectors, wherein the processing system adapts the LSA space by transforming the LSA space to take into account the new document'"'"'s influence on the LSA space without re-computing the LSA space, wherein the processing system transforms the LSA space byobtaining a training document vector that characterizes a semantic position of the training document within the LSA space;
  
  computing a new document vector that characterizes a semantic position of the new document within the LSA space;
  
  deriving a document vector transformation matrix; and
  
  applying the document vector transformation matrix to the training document vector and the new document vector to shift a position of each document vector in the LSA space, where the shift in the position reflects the change in the language;
  
  obtaining a training word vector that characterizes a semantic position of the training word within the LSA space;
  
  computing a new word vector that characterizes a semantic position of the new word within the LSA space;
  
  deriving a word vector transformation matrix; and
  
  applying the word vector transformation matrix to the training word vector and the new word vector to shift a position of each word vector in the LSA space, where the shift in the position reflects the change in the language.
- View Dependent Claims (49, 50, 51, 52, 53)
- - 49. The system of claim 48, wherein:
    - the training document vector is VS, where VS is computed from a right singular matrix V and a diagonal matrix S, each of which was obtained from a previous singular value decomposition (SVD) of a training word-document matrix constructed during the generation of the LSA space, the training word-document matrix representing the extent to which each of the words appears in each of the documents of the training corpus;
      
      the new document vector ZS, where ZS is computed from the diagonal matrix S and an extension matrix Z, wherein Z is an extension of the right singular matrix V obtained by folding in a new word-document matrix, the new word-document matrix representing the extent to which a new word appears in the new document; and
      
      the document vector transformation matrix is J, wherein J is obtained from a Choleski decomposition of a matrix derived from an extension matrix Y, wherein Y is an extension of a left singular matrix U obtained by folding in the new word-document matrix, and wherein U was obtained from the previous SVD of the training word-document matrix constructed during the generation of the LSA space.
  - 50. The system of claim 49, wherein:
    - the training word vector is US, where US is computed from a left singular matrix U and the diagonal matrix S;
      
      the new word vector is YS, wherein YS is computed from the diagonal matrix S and the extension matrix Y; and
      
      the word vector transformation matrix is K, wherein K is obtained from a Choleski decomposition of a matrix derived from the extension matrix Z.
  - 51. The system of claim 49, wherein the processing system transforms the LSA space by applying the document vector transformation matrix and the word vector transformation matrix simultaneously.
  - 52. The system of claim 49, wherein when the new document matrix contains more new documents than new words, then the processing system transforms space by:
    - applying the word vector transformation matrix K, first; and
      
      applying the document vector transformation matrix is J second, wherein the extension matrix Y is not obtained by folding in the new word-document matrix, but is rather derived from the extension matrix Z.
  - 53. The system of claim 49, wherein when the new document matrix contains more new words than new documents, then the processing system transforms the LSA space by:
    - applying the document vector transformation matrix J first; and
      
      applying the word vector transformation matrix K second, wherein the extension matrix Z is not obtained by folding in the new word-document matrix, but is rather derived from the extension matrix Y.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Apple Inc.
Original Assignee
Apple Computer Incorporated (Apple Inc.)
Inventors
Bellegarda, Jerome R.
Primary Examiner(s)
Hudspeth, David
Assistant Examiner(s)
Sked, Matthew J.

Application Number

US09/967,072
Time in Patent Office

1,845 Days
Field of Search

704/240, 704/236, 704/231, 704/244, 707/5
US Class Current

704/255
CPC Class Codes

G10L 15/1815 Semantic context, e.g. disa...

G10L 15/183 using context dependencies,...

Method and apparatus for speech recognition using latent semantic adaptation

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

55 Claims

Specification

Solutions

Use Cases

Quick Links

Method and apparatus for speech recognition using latent semantic adaptation

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

55 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links