×

System and method for generating personal vocabulary from network data

  • US 8,990,083 B1
  • Filed: 09/30/2009
  • Issued: 03/24/2015
  • Est. Priority Date: 09/30/2009
  • Status: Active Grant
First Claim
Patent Images

1. A method, comprising:

  • receiving data propagating in a network environment at a streaming database feeder;

    ignoring Joint Photographic Experts Group (JPEG) documents in the data;

    updating tags for each user in the network environment using a user-sub stream created for the user by the streaming database feeder, wherein each user-sub stream includes at least a portion of the data propagating in the network environment, wherein the tags are words and phrases that are associated with each user, wherein the data includes documents and, for at least a portion of the documents in the data, each original document is copied to create an anonymous document and a document that contains selected words within the data based on a whitelist, wherein the whitelist includes a plurality of designated words to be tagged, wherein documents that include data in a blacklist are dropped, and wherein the anonymous documents contain a concept field and some of the data in the anonymous documents is selected for the whitelist, and wherein the document that contains selected words does not include the concept field;

    assigning a weight to the selected words based on at least one characteristic associated with the data;

    associating the selected words to an individual, wherein the weight for a selected word is higher if the individual propagates the data; and

    generating a resultant composite of the selected words that are tagged.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×