System and method for use of semantic understanding in storage, searching and providing of data or other content information
First Claim
1. A computer-based method of expressing an input text to have a semantic meaning, comprising the steps of:
- receiving an input text expressed as a plurality of sentences, each of which sentences includes a plurality of words therein;
parsing the input text using a link grammar and one of a plurality of lexicons, including a different lexicon for each of different input languages, to determine a syntax within the input text, including linkages between the words;
using a set of semantic rules to transform the syntax to a semantic rendering or meaning,wherein each rule maps a particular linkage type or words in the input sentence to a semantic relationship component, or to a concept node participating in a semantic relationship component, respectively,wherein a plurality of rules can be combined algebraically, andwherein the set of semantic rules can be executed in combination or in succession to transform the input text to a plurality of semantic relationships expressed as a conceptual graph and corresponding to the input text; and
creating a linearized tuple based rendering of the conceptual graph, byreceiving the plurality of semantic relationships,ordering a plurality of tuples according to a sort criteria, andstoring the plurality of tuples as a tuple conceptual graph (TCG) together with a unique name or other TCG identifier.
2 Assignments
0 Petitions
Accused Products
Abstract
A system and method for using semantic understanding in storing and searching data and other information. A linearized tuple-based version of a conceptual graph can be created from a user input. A plurality of conceptual graphs, or portions thereof, can be compared to determine matches. An associative database can be created and/or searched using a hierarchy of conceptual graphs in tuple format, so that the data storage and searching of such database is optimized. The associative database can be used to integrate data from multiple different sources; form part of an Internet or other search engine; or used in other implementations. Also disclosed herein is a system and method for use of semantic understanding in searching and providing of content is described herein. In accordance with an embodiment, the system comprises a Syntactic Parser (SP) or statistical word tokenizer for data retrieval and parsing; a Syntax To Semantics (STS) transformational algebra-based semantic rule set, and an Associative Database (ADB) of linearized tuple conceptual graphs (TCG), utilizing a conceptual graph formalism. Data can be represented within the ADB, enabling both fast data retrieval in the form of semantic objects and a broad ranging taxonomy of content.
27 Citations
12 Claims
-
1. A computer-based method of expressing an input text to have a semantic meaning, comprising the steps of:
-
receiving an input text expressed as a plurality of sentences, each of which sentences includes a plurality of words therein; parsing the input text using a link grammar and one of a plurality of lexicons, including a different lexicon for each of different input languages, to determine a syntax within the input text, including linkages between the words; using a set of semantic rules to transform the syntax to a semantic rendering or meaning, wherein each rule maps a particular linkage type or words in the input sentence to a semantic relationship component, or to a concept node participating in a semantic relationship component, respectively, wherein a plurality of rules can be combined algebraically, and wherein the set of semantic rules can be executed in combination or in succession to transform the input text to a plurality of semantic relationships expressed as a conceptual graph and corresponding to the input text; and creating a linearized tuple based rendering of the conceptual graph, by receiving the plurality of semantic relationships, ordering a plurality of tuples according to a sort criteria, and storing the plurality of tuples as a tuple conceptual graph (TCG) together with a unique name or other TCG identifier.
-
-
2. A computer-based method of comparing a plurality of tuple conceptual graph (TCG), comprising the steps of:
-
receiving a first tuple conceptual graph (TCG), together with a first name or other TCG identifier and a first set of linearized tuples, and a second TCG, together with a second name or other TCG identifier and a second set of linearized tuples, wherein each of the first and second tuple conceptual graph (TCGs) are created by storing a plurality of semantic relationships as tuples; ordering the first and second set of linearized tuples within each of the first and second TCGs according to a sort criteria, and folding tuple relationships into a minimal canonical representation by successively examining and merging sorted tuple relationships and resolving arguments upon ties; comparing the first name or other TCG identifier and the second name or other TCG identifier to determine a match, and if so then identifying the first TCG as an equal match of the second TCG; if the first name or other TCG identifier and the second name or other TCG identifier do not match, then successively comparing the tuples in the first TCG with the tuples in the second TCG to determine a full or partial match results between the tuples; and reporting the results of the full or partial match.
-
-
3. A computer-based method of storing and retrieving text within a database, the text having a semantic rendering or meaning in a TCG, comprising the steps of:
-
providing a database content including other texts, a tuple conceptual graph (TCG) hierarchy which defines a plurality of tuple conceptual graphs (TCGs) corresponding to the other texts, ordered within branches of the TCG hierarchy so that those TCGs that are more specific are stored logically below TCGs that are less specific, a relation hierarchy, and a node hierarchy; receiving a TCG rendering of an input text expressed as a plurality of sentences or as a query, each of which includes a plurality words therein; accessing the database and previously stored plurality of TCGs corresponding to the other texts; matching, including successively comparing the TCG and the tuples within it corresponding to the input text, with the database content from top down, first against the TCG hierarchy, then the relation hierarchy, and then the node hierarchy, to determine an appropriate location for subsets of information within the input text within the database, including determining full or partial matches between the TCG corresponding to the input text and the TCGs in the database corresponding to the other texts, wherein a partial match indicates a close semantic relationship between partially-matching TCGs; and one or both of storing the TCG for the input text within the database, and/or providing full or partially matching TCG information from the database semantically matching the input text.
-
-
4. A computer-based method of discovering, storing and retrieving text within a database according to a semantic hierarchy, comprising the steps of:
-
providing a database content and a tuple conceptual graph (TCG) hierarchy, relation hierarchy, and node hierarchy, wherein a TCG is created by receiving a plurality of semantic relationships, ordering a plurality of tuples according to a sort criteria, and storing the plurality of tuples as a TCG, and wherein a plurality of tuple conceptual graphs (TCGs) are ordered within branches of the TCG hierarchy so that those TCGs that are more specific are stored logically below TCGs that are less specific; receiving an input text which includes a plurality of phrases and words therein; using a set of semantic rules to transform the syntax to a semantic rendering or meaning, including creating a plurality of tuples corresponding to the input text; comparing the tuples in the input text with the database content according to the TCG hierarchy, relation hierarchy, and node hierarchy, to determine an appropriate location for subsets of information within the input text within the database; updating the database content with the subsets of information within the input text; and performing TCG joins where appropriate, based either on partial tuple overlap or over any concept node argument to tuple information which comes from different sources, to combine content within the database into new TCG reflecting new semantic information which is not fully or directly present in any individual textual source or previously stored in the database.
-
-
5. A system for use of semantic understanding in searching and providing of content, comprising:
-
a computer including a microprocessor; a syntactic parser or word tokenizer for data retrieval and parsing; a syntax to semantics transformational algebra-based semantic rule set; an associative database of linearized tuple conceptual graphs (TCG) which defines a plurality of tuple conceptual graphs (TCGs) corresponding to text data, ordered within branches of the TCG hierarchy so that those TCGs that are more specific are stored logically below TCGs that are less specific; an interface for allowing a user to input one or more additional data, or requests for new data, to be added to or matched against the associative database; wherein the system semantically interprets an original data, including an original Web page or other content, and uses a link grammar, rules, and algebra-based transformations to automatically populate the associative database with semantic links; and wherein when the one or more additional data or requests for new data is received, the system uses the information therein to one or more of modify the database or prepare a response to the request which includes semantically related web pages or content, or advertising, and which is then provided as, or in addition to, a response to the request. - View Dependent Claims (6, 7, 8)
-
-
9. A method for use of semantic understanding in searching and providing of content, comprising:
-
providing access to a syntactic parser or word tokenizer for data retrieval and parsing; providing access to a syntax to semantics transformational algebra-based semantic rule set; providing access to an associative database of linearized tuple conceptual graphs (TCG) which defines a plurality of tuple conceptual graphs (TCGs) corresponding to text data, ordered within branches of the TCG hierarchy so that those TCGs that are more specific are stored logically below TCGs that are less specific; and semantically interpreting an original data, including an original Web page or other content, and using a link grammar, rules, and algebra-based transformations to populate the associative database with semantic links, and wherein when one or more additional data or a request for new data is received, using the information therein to one or more of modify the database or prepare a response to the request which includes semantically related web pages or content, or advertising, and which is then provided as a response to the request. - View Dependent Claims (10, 11, 12)
-
Specification