×

METHOD AND APPARATUS FOR DETECTING SENSITIVE CONTENT IN A DOCUMENT

  • US 20100076957A1
  • Filed: 09/10/2008
  • Published: 03/25/2010
  • Est. Priority Date: 09/10/2008
  • Status: Active Grant
First Claim
Patent Images

1. A computer-executed method for detecting sensitive content in a document, the method comprising:

  • receiving a document;

    identifying a set of terms in the document that are candidate sensitive terms;

    generating a combination of terms, based on the identified terms, that is associated with a semantic meaning;

    performing searches through a corpus based on the combination of terms and determining hit counts returned for each term in the combination and for the combination;

    determining whether the combination of terms is sensitive based on the hit count for the combination and the hit counts for the individual terms in the combination; and

    generating a result that indicates portions of the document which contain sensitive combinations.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×