Dynamic lexicon
First Claim
1. A method of transmitting dictionary updates in a system for real-time analysis of content comprising steps of:
- providing a local copy of a dictionary and associated lexical tables;
downloading extensions to said dictionary and said tables as needed to account for new terms from a central location, wherein said extensions assign implied lexical values to said new terms;
periodically downloading from said central location newly-computed lexical values for each term in said dictionary; and
re-indexing documents in a local archive in real time based on said newly-computed lexical values.
1 Assignment
0 Petitions
Accused Products
Abstract
In a system for content management, a dynamic lexicon allows dictionary and lexical data at NLP (natural-language processing) engines at remote sites to stay current with table data at a central location without suffering the time loss involved in computing new tables at the remote sites, or computing new tables at the central site and distributing them. As new terms are added to the dictionary, each term is assigned a unique token identifier. A first step involves downloading extensions to the table data in real time whenever a new word or expression is encountered. A second step involves periodically updating the table data in real time with recomputed data transmitted in compact data files from the central location. Content items in the local archive are re-indexed based on the updated table data. Maintaining tokens across generations of tables allows documents in different languages to be associated without requiring translation.
27 Citations
31 Claims
-
1. A method of transmitting dictionary updates in a system for real-time analysis of content comprising steps of:
-
providing a local copy of a dictionary and associated lexical tables;
downloading extensions to said dictionary and said tables as needed to account for new terms from a central location, wherein said extensions assign implied lexical values to said new terms;
periodically downloading from said central location newly-computed lexical values for each term in said dictionary; and
re-indexing documents in a local archive in real time based on said newly-computed lexical values. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A process for updating a lexicon in real time by extension, comprising steps of:
-
assigning a unique token to each term in a dictionary;
creating extensions to the dictionary wherein each term added to dictionary is assigned an implied lexical value;
transmitting an extension to lexical tables incorporating said implied values to a client when an analysis at said client machine first encounters a new term.
-
-
15. A process for updating a lexicon in real time by replacement comprising steps of:
-
assigning a unique token to each term in a dictionary;
periodically re-computing lexical values for said dictionary;
periodically downloading recomputed lexical values as vectors to a client, wherein each vector is associated with a token; and
re-indexing items in said clients archive in real time using said re-computed lexical values.
-
-
16. A method for maintaining currency of an index of content items comprising steps of:
-
establishing an update schedule that minimizes unbalance between old and new lexical tables;
downloading re-computed lexical values to a client;
initiating re-indexing of an archive at said client wherein items are resigned in real time; and
continuing re-indexing until the entire archive has been resigned.
-
-
17. A content management system comprising:
-
a server;
at least one client; and
means for dynamically transmitting dictionary updates from said server to said at least one client for real-time analysis of content.
-
-
18. The system of claim 34, wherein said means for dynamically transmitting dictionary updates from said server to said at least one client for real-time analysis of content comprises
means for downloading down loading extensions to a dictionary and said lexical tables at said client from said server as whenever a new term is encountered, wherein said extensions assign implied lexical values to said new terms; -
means for periodically downloading from said server newly-computed lexical values for each term in said dictionary; and
means for re-indexing documents in a client archive in real time based on said newly-computed lexical values. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
-
31. A method of associating documents across languages without translation in a content management system that includes a lexicon comprising steps of:
-
assigning a unique token to each term;
maintaining the same tokens from generation to generation of lexical tables;
assigning the same tokens to equivalent words, expressions or word combinations in another language so that tables for the two languages correspond; and
associating documents across languages without translating based on said corresponding tables.
-
Specification