×

Machined book detection

  • US 9,372,850 B1
  • Filed: 12/19/2012
  • Issued: 06/21/2016
  • Est. Priority Date: 12/19/2012
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method of identifying machine generated text, comprising:

  • analyzing a plurality of pre-determined machine generated works;

    storing, in a database, pre-determined centers of mass corresponding to the pre-determined machine generated works;

    storing, in the database, pre-determined shape descriptors corresponding to the pre-determined machine generated works;

    analyzing a plurality of pre-determined non-machine generated works;

    storing, in the database, pre-determined centers of mass corresponding to the pre-determined non-machine generated works;

    storing, in the database, pre-determined shape descriptors corresponding to the pre-determined non-machine generated works;

    receiving a textual work submitted for publishing;

    generating an N-gram of N words of the textual work;

    plotting the N-gram in an N-dimensional space;

    generating a 2-dimensional plot based, at least in part, on the plot of the N-gram in N-dimensional space;

    calculating a center of mass of the 2-dimensional plot;

    calculating a shape descriptor of the 2-dimensional plot, wherein the shape descriptor includes one or more points defining a shape of the 2-dimensional plot;

    comparing the center of mass to the pre-determined centers of mass and the shape descriptor to the pre-determined shape descriptors;

    calculating, based at least in part on the comparison, a confidence score indicative of a correlation between the textual work and at least one of the pre-determined machine generated works;

    determining, based at least in part on the confidence score, that the textual work is machine generated; and

    rejecting the textual work for publishing based on the determination.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×