×

Document tagging and retrieval using entity specifiers

  • US 9,971,782 B2
  • Filed: 10/13/2015
  • Issued: 05/15/2018
  • Est. Priority Date: 10/16/2013
  • Status: Active Grant
First Claim
Patent Images

1. A computer-performed method of organizing a collection of electronic documents, the method comprising:

  • in a computer system, storing entries in multiple dictionaries separate from and not associated with any particular one of the electronic documents, wherein the multiple dictionaries are data structures stored within the computer system, wherein individual ones of the multiple dictionaries correspond to one of a plurality of different subjects, wherein the entries contain a descriptive term and wherein entries corresponding to an entity contain an entity type code indicating that the entry is an entity entry with respect to a subject of the one of the multiple dictionaries in which the entry is stored and a category of that entity, wherein entity entries are identified as belonging to one or more special categories of terms that have special meaning with respect to their corresponding subjects, and wherein at least some of the descriptive terms are present in two or more of the multiple dictionaries;

    responsive to requests within the computer system, accessing the collection of electronic documents by matching tags stored in a tag database separate from the documents and the dictionaries and containing terms collected from the electronic documents with descriptive terms in the multiple dictionaries to determine one or more subjects of the electronic documents from subjects of one or more of the multiple dictionaries that contain the descriptive terms matching the terms contained in the electronic documents, whereby a speed of matching the documents to the one or more subjects is increased by accessing the tags from the tag database;

    responsive to the matching detecting a match between a descriptive term in one of the tags in the tag database and in one of the multiple dictionaries, determining whether or not the entry containing the descriptive term in the corresponding dictionary has an entity type code;

    responsive to determining that the entry in the corresponding dictionary has an entity type code, providing an indication of the entity type code in conjunction with an indication of the one or more subjects of the electronic documents along with the one or more subjects determined by the determining in response to the requests;

    storing a representation of the one or more subjects determined by the determining along with the indication of the entity type code in a memory of the computer system to provide the response to the requests; and

    generating the multiple dictionaries by collecting the descriptive terms for each dictionary, determining whether a descriptive term is an entity or a non-entity, and storing the descriptive terms as entries in the dictionaries along with the indications whether or not the entries are entity entries or non-entity entries.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×