×

System and method of providing intent predictions for an utterance prior to a system detection of an end of the utterance

  • US 10,614,799 B2
  • Filed: 11/24/2015
  • Issued: 04/07/2020
  • Est. Priority Date: 11/26/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method of determining an intent prediction for a natural language utterance, prior to a system detection of an end of the natural language utterance, based on a portion of the natural language utterance and statistical information that correlates requests that are linked together in that one type of request statistically follows another type of request, the method being implemented on a computer system that includes one or more physical processors executing computer program instructions which, when executed by the one or more physical processors, perform the method, the method comprising:

  • receiving, by the computer system, a first natural language utterance of a user;

    determining, by the computer system, a first type of user request based on the first natural language utterance;

    identifying, by the computer system, at least a second type of user request based on the first type of user request and the statistical information that indicates that the first type of request is made via one or more first spoken words followed by the second type of request, wherein the statistical information indicates that users other than the user have made the second type of request after the first type of request, and wherein identifying the second type of user request comprises determining that the users other than the user made the second type of request after having made the first type of request;

    receiving, by the computer system, a first portion of a second natural language utterance of the user;

    performing, by the computer system, speech recognition on the first portion of the second natural language utterance to recognize one or more words of the first portion of the second natural language utterance;

    receiving, by the computer system, visual inputs provided by the user, wherein the visual inputs are streamed to the computer system and processed in parallel;

    determining, by the computer system, prior to a detection of an end of the second natural language utterance, a first intent associated with the second natural language utterance based on the first portion of the second natural language utterance and the identified type of second user request;

    determining, by the computer system, an intent associated with the visual inputs provided by the user; and

    generating, by the computer system, at least one response for presentation to the user, utilizing a pre-fetched result related to at least one of the first intent and the intent associated with the visual inputs.

View all claims
  • 7 Assignments
Timeline View
Assignment View
    ×
    ×