System and methods for automatically detecting deceptive content

  • US 10,642,975 B2
  • Filed: 10/18/2012
  • Issued: 05/05/2020
  • Est. Priority Date: 10/19/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method for classifying textual opinion information as truthful or deceptive, comprising the steps of:

  • communicating with a processor a first source of opinion information via a communication interface connecting a user system to a network to collect opinion information, wherein the opinion information consists of at least one set of known deceptive opinion information and at least one set of known truthful opinion information forming an initial dataset, the communication interface being a wired communication interface or a wireless communication interface;

    storing by the processor the opinion information in a main memory of the user system;

    analyzing separately by the processor each of the set of known deceptive opinion information and the set of known truthful opinion information of the opinion information to determine features associated with each set in the initial dataset, wherein the machine-analysis comprises a genre identification approach that reviews each part of speech of the opinion information;

    automatically generating a model based on the analyzing step in which a first set of features comprising nouns, adjectives, prepositions, determiners, coordinating conjunctions are associated with the set of known deceptive opinion information and a second set of features comprising verbs, adverbs, pronouns and pre-determiners are associated with the set of known truthful opinion information;

    receiving by the processor an online review of a product or a service;

    applying by the processor the model to the online review, wherein the processor identifies text of the online review as one or more nouns, adjectives, prepositions, determiners, coordinating conjunctions, verbs, adverbs, pronouns and pre-determiners;

    calculating by the processor a first number of features of the first set in the text and a second number of features of the second set in the text; and

    categorizing by the processor the online review as deceptive when the first number is greater than the second number and categorizing the online review as truthful when the second number is greater than the first number.

View all claims
    ×
    ×

    Thank you for your feedback

    ×
    ×