Knowledge System Method and Apparatus
First Claim
Patent Images
1. A method for acquiring a knowledge base of associated ideas comprising the steps of:
- providing a pair of documents representing the same content in two different languages, wherein the first of said pair of documents is expressed in a first language, and the second of said pair of documents is expressed in a second language;
receiving a query to be analyzed, wherein said query is expressed in said first language, and wherein said query consists of a word or word string;
analyzing said first of said pair of documents to identify all occurrences of said query in said first of said pair of documents;
selecting a plurality of ranges of words in said second of said pair of documents, wherein said selected ranges correspond to each of said identified occurrences of said query in said first of said pair of documents based on user defined criteria;
calculating the frequency of words and word strings contained in said selected ranges based on occurrences of all unique words and word strings;
returning a list of occurrences of all unique words and word strings if said unique words and word strings occur in more than one of the selected ranges using said calculated frequency; and
ranking said returned list of occurrences based on said calculation of the frequency of words and word strings.
0 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus for automating the acquisition, reconstruction, and generation of knowledgebases of associated ideas and using such knowledgebases in many application including machine translation of human languages, search and retrieval of unstructured text, or other data, based on concept search, voice recognition, data compression, and artificial intelligence systems.
-
Citations
12 Claims
-
1. A method for acquiring a knowledge base of associated ideas comprising the steps of:
-
providing a pair of documents representing the same content in two different languages, wherein the first of said pair of documents is expressed in a first language, and the second of said pair of documents is expressed in a second language; receiving a query to be analyzed, wherein said query is expressed in said first language, and wherein said query consists of a word or word string; analyzing said first of said pair of documents to identify all occurrences of said query in said first of said pair of documents; selecting a plurality of ranges of words in said second of said pair of documents, wherein said selected ranges correspond to each of said identified occurrences of said query in said first of said pair of documents based on user defined criteria; calculating the frequency of words and word strings contained in said selected ranges based on occurrences of all unique words and word strings; returning a list of occurrences of all unique words and word strings if said unique words and word strings occur in more than one of the selected ranges using said calculated frequency; and ranking said returned list of occurrences based on said calculation of the frequency of words and word strings. - View Dependent Claims (2)
-
-
3. A method for acquiring a knowledge base of associated ideas comprising the steps of:
-
a. providing a plurality of document pairs where each pair represents the same content in a first language and a second language; b. receiving a query to be analyzed, wherein said query is expressed in said first language, and wherein said query consists of a word or word string; c. analyzing all first language documents in said plurality of document pairs to identify all occurrences of said query; d. selecting a plurality of ranges of words in all second language documents in said plurality of document pairs that corresponds to each of said identified occurrences of said query in said first language document, based on user defined criteria; e. calculating the frequency of words and word strings contained in said selected ranges based on occurrences of all unique words and word strings; f. returning a list of occurrences of all unique words and word strings if said unique words and word strings occur in more than one of the selected ranges using said calculated frequency; and g. ranking said returned list of occurrences based on said calculation of the frequency of words and word strings. - View Dependent Claims (4)
-
-
5. A computer device including a processor, a memory coupled to the processor, and a program stored in the memory, wherein the computer is configured to execute the program and perform the steps of:
-
providing a pair of documents representing the same idea in two different languages, wherein the first of said pair of documents is expressed in a first language, and the second of said pair of documents is expressed in a second language; receiving a query to be analyzed, wherein said query is expressed in said first language, and wherein said query consists of a word or word string; analyzing said first of said pair of documents to identify all occurrences of said query in said first of said pair of documents; selecting a plurality of ranges of words in said second of said pair of documents, wherein said selected ranges correspond to each of said identified occurrences of said query in said first of said pair of documents based on user defined criteria; calculating the frequency of words and word strings contained in said selected ranges based on occurrences of all unique words and word strings; and returning a list of occurrences of all unique words and word strings if said unique words and word strings occur in more than one of the selected ranges using said calculated frequency; and ranking said returned list of occurrences based on said calculation of the frequency of words and word strings. - View Dependent Claims (6)
-
-
7. A computer device including a processor, a memory coupled to the processor, and a program stored in the memory, wherein the computer is configured to execute the program and perform the steps of:
-
a. providing a plurality of document pairs where each pair represents the same content in a first language and a second language; b. receiving a query to be analyzed, wherein said query is expressed in said first language, and wherein said query consists of a word or word string; c. analyzing all first language documents in said plurality of document pairs to identify all occurrences of said query; d. selecting a plurality of ranges of words in all second language documents in said plurality of document pairs that corresponds to each of said identified occurrences of said query in said first language document, based on user defined criteria; e. calculating the frequency of words and word strings contained in said selected ranges based on occurrences of all unique words and word strings; f. returning a list of occurrences of all unique words and word strings if said unique words and word strings occur in more than one of the selected ranges using said calculated frequency; and g. ranking said returned list of occurrences based on said calculation of the frequency of words and word strings. - View Dependent Claims (8)
-
-
9. A computer readable storage medium having stored thereon a program executable by a computer processor for performing the steps of:
-
providing a pair of documents representing the same idea in two different languages, wherein the first of said pair of documents is expressed in a first language, and the second of said pair of documents is expressed in a second language; receiving a query to be analyzed, wherein said query is expressed in said first language, and wherein said query consists of a word or word string; analyzing said first of said pair of documents to identify all occurrences of said query in said first of said pair of documents; selecting a plurality of ranges of words in said second of said pair of documents, wherein said selected ranges correspond to each of said identified occurrences of said query in said first of said pair of documents based on user defined criteria; calculating the frequency of words and word strings contained in said selected ranges based on occurrences of all unique words and word strings; returning a list of occurrences of all unique words and word strings if said unique words and word strings occur in more than one of the selected ranges using said calculated frequency; and ranking said returned list of occurrences based on said calculation of the frequency of words and word strings. - View Dependent Claims (10)
-
-
11. A computer readable storage medium having stored thereon a program executable by a computer processor for performing the steps of:
-
a. providing a plurality of document pairs where each pair represents the same content in a first language and a second language; b. receiving a query to be analyzed, wherein said query is expressed in said first language, and wherein said query consists of a word or word string; c. analyzing all first language documents in said plurality of document pairs to identify all occurrences of said query; d. selecting a plurality of ranges of words in all second language documents in said plurality of document pairs that corresponds to each of said identified occurrences of said query in said first language document, based on user defined criteria; e. calculating the frequency of words and word strings contained in said selected ranges based on occurrences of all unique words and word strings; f. returning a list of occurrences of all unique words and word strings if said unique words and word strings occur in more than one of the selected ranges using said calculated frequency; and g. ranking said returned list of occurrences based on said calculation of the frequency of words and word strings. - View Dependent Claims (12)
-
Specification