Method and system for analyzing and translating various languages with use of semantic hierarchy
First Claim
1. A method for analyzing natural language texts using a lexical-semantic hierarchy, the method comprising:
- creating, by a computing device, the lexical-semantic hierarchy comprising at least one language-independent semantic class, wherein the semantic class includes one or more semantic attributes and one or more models;
creating at least one language-specific lexical class associated with the lexical-semantic hierarchy, wherein the language-specific lexical class includes one or more semantic and syntactical attributes and one or more models, and wherein the semantic and syntactical attributes and models are inheritable from respective parent units to child units;
creating an instance of the lexical-semantic hierarchy in a memory of the computing device;
identifying a new lexical meaning in a natural language;
identifying at least one candidate position of the new lexical meaning in the lexical-semantic hierarchy in the memory of the computing device on the basis of a dictionary entry for the new lexical meaning;
comparing one or more example sentences comprising the new lexical meaning against one or more semantic and syntactical models of the identified candidate position in the lexical-semantic hierarchy;
creating a new unit in the identified candidate position of the lexical-semantic hierarchy;
creating a syntactical and semantic model for the new unit;
storing the new unit and the created syntactical and semantic model in the memory of the computing device;
receiving a sentence in the natural language; and
representing, using the syntactical and semantic model of the new unit, the sentence in a language-independent form.
5 Assignments
0 Petitions
Accused Products
Abstract
A method and computer system for analyzing sentences of various languages and constructing a language-independent semantic structure are provided. On the basis of comprehensive knowledge about languages and semantics, exhaustive linguistic descriptions are created, and lexical, morphological, syntactic, and semantic analyses for one or more sentences of a natural or artificial language are performed. A computer system is also provided to implement, analyze and store various linguistic structures and to perform lexical, morphological, syntactic, and semantic analyses. As result, a generalized data structure, such as a semantic structure, is generated and used to describe the meaning of one or more sentences in language-independent form, applicable to automated abstracting, machine translation, control systems, Internet information retrieval, etc.
96 Citations
18 Claims
-
1. A method for analyzing natural language texts using a lexical-semantic hierarchy, the method comprising:
-
creating, by a computing device, the lexical-semantic hierarchy comprising at least one language-independent semantic class, wherein the semantic class includes one or more semantic attributes and one or more models; creating at least one language-specific lexical class associated with the lexical-semantic hierarchy, wherein the language-specific lexical class includes one or more semantic and syntactical attributes and one or more models, and wherein the semantic and syntactical attributes and models are inheritable from respective parent units to child units; creating an instance of the lexical-semantic hierarchy in a memory of the computing device; identifying a new lexical meaning in a natural language; identifying at least one candidate position of the new lexical meaning in the lexical-semantic hierarchy in the memory of the computing device on the basis of a dictionary entry for the new lexical meaning; comparing one or more example sentences comprising the new lexical meaning against one or more semantic and syntactical models of the identified candidate position in the lexical-semantic hierarchy; creating a new unit in the identified candidate position of the lexical-semantic hierarchy; creating a syntactical and semantic model for the new unit; storing the new unit and the created syntactical and semantic model in the memory of the computing device; receiving a sentence in the natural language; and representing, using the syntactical and semantic model of the new unit, the sentence in a language-independent form. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer system, comprising:
-
a memory for storing data for access by an application program being executed on a computer system, wherein the data relates to a syntactical and semantic model of a natural language, the data comprising a semantic description of entities expressed in the natural language, parametric characteristics of the natural language, and semantic relations with entities of the natural language; and a processor, coupled to the memory; wherein the memory is configured to store a data structure, the data structure comprising; a plurality of language-independent entities represented by a plurality of semantic classes, wherein the semantic classes are arranged into a hierarchy, wherein each of the plurality of semantic classes includes at least one semantic attribute and a semantic model, and wherein the semantic attributes and the semantic models are inherited from parent units to child units; a plurality of descriptions of semantic relations with the entities, wherein the descriptions of semantic relations are represented by a hierarchical sequence of elements; a plurality of syntactic descriptions of the natural language, wherein one or more of the plurality of syntactic descriptions are associated with one or more semantic descriptions and one or more lexical descriptions of the natural language; a plurality of lexical descriptions of the natural language, wherein one or more of the plurality of lexical descriptions of the natural language are associated with one or more of said syntactical descriptions and one or more of said semantic descriptions; and a plurality of morphological descriptions of the natural language, wherein one or more of the plurality of morphological descriptions of the natural language are associated with one or more of said plurality of lexical descriptions of the natural language, wherein the processor is configured to; receive a source sentence in a source language, analyze the source sentence using at least one of the plurality of language-independent entities, and represent the source sentence in a language-independent form, using at least one of the plurality of assigned descriptions of semantic relations, at least one of the plurality of syntactic descriptions of the natural language, at least one of the plurality of lexical descriptions of the natural language, and at least one of the plurality of morphological descriptions of the natural language. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. A computer-readable non-transitory storage medium comprising executable instructions that, when executed by a processor, cause the processor to:
-
create a lexical-semantic hierarchy comprising at least one language-independent semantic class, wherein the semantic class includes one or more semantic attributes and one or more models; create at least one language-specific lexical class associated with the lexical-semantic hierarchy, wherein the language-specific lexical class includes one or more semantic and syntactical attributes and one or more models, and wherein the semantic and syntactical attributes and models are inheritable from respective parent units to child units; create an instance of the lexical-semantic hierarchy in the memory; identify a new lexical meaning in a natural language; identify at least one candidate position of the new lexical meaning in the lexical-semantic hierarchy in the memory on the basis of a dictionary entry for the new lexical meaning; compare one or more example sentences comprising the new lexical meaning against one or more semantic and syntactical models of the identified candidate position in the lexical-semantic hierarchy; create a new unit in position of the lexical-semantic hierarchy; create a syntactical and semantic model for the new unit; store, in said memory, the new unit and the created syntactical and semantic model; receive a sentence in the natural language; and represent, using the syntactical and semantic model of the new unit, sentence in a language-independent form.
-
Specification