×

System and method to acquire paraphrases

  • US 9,672,204 B2
  • Filed: 05/28/2010
  • Issued: 06/06/2017
  • Est. Priority Date: 05/28/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method for acquiring paraphrases for use in natural language processing applications, the method being conducted using an automated survey platform that is configured to permit access to a group of crowd-workers for performing portions of a task while other portions are performed by the automated survey platform, and using a processor configured to perform the method, the method comprising:

  • receiving raw text as input to the processor, the raw text being a result of obtaining opinions of a group of consumers or customers of a product or service;

    sentence breaking the raw text into individual sentences by the processor by use of automatic natural language processing techniques;

    providing the individual sentences and a first survey by the processor through the automated survey platform to a plurality of annotating sources,wherein each annotating source reviews the individual sentences and determines an assessment of the individual sentences based on the first survey, andwherein the automated survey platform is accessed by the plurality of annotating sources for the first survey, the plurality of annotating sources for the first survey being crowd-workers connected via their respective networked computers over the internet with the automated survey platform;

    receiving results of the first survey from the each of the annotating sources for the first survey by the processor;

    filtering the results of the first survey by the processor to group the individual sentences that were the subject of the first survey into groups that have received like assessments from the annotating sources;

    providing the filtered survey results and a second survey by the processor to a plurality of annotating sources for the second survey, the plurality of annotating sources for the second survey being crowd-workers connected via their respective networked computers over the internet with the automated survey platform,wherein each annotating source for the second survey conducts the second survey based on the filtered results to determine portions of the individual sentences included in the filtered results that indicate the assessment, andwherein the plurality of annotating sources for the first survey and the plurality of annotating sources for the second survey each comprises a sampling of people to complete the first survey and the second survey, respectively, and wherein the sampling of people comprises respondents to the automated survey platform;

    receiving results of the second survey from the plurality of annotating sources for the second survey by the processor; and

    automatically generating by the processor paraphrases based on the results of the second survey, wherein the paraphrases are pairs of expression that have a same meaning.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×