Method and system for concept generation and management
First Claim
1. A method for defining and generating a set of concepts and identifying said concepts in text, comprising:
- a) defining said set of concepts wherein;
i) each of said concepts comprises a pattern;
ii) each of said patterns comprising one of the following;
1) a description sufficiently constrained to be matchable to zero or more extents;
each of said extents comprising a set of zero or more items wherein each of said items is an instance of a linguistic entity;
each of said instances of said linguistic entity is identified in a) text, or b) a knowledge resource;
or c) both a) and b); and
said pattern is matchable to zero or more of said extents corresponding to said description;
or 2) an operator and a list of zero or more arguments wherein each of said arguments is a further pattern; and
said pattern comprising said operator and said list of arguments is matchable to extents that are the result of applying said operator to further extents that are matchable by said arguments;
or 3) a reference to a further concept comprising a further pattern; and
said pattern comprising said reference to said further concept is matchable to extents that are matchable by said further pattern; and
iii) any said further pattern is a pattern; and
b) generating said concepts from text or one or more sources of knowledge; and
c) identifying said concepts in text.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention is in two parts. The first part is manual, semi-automatic, and automatic methods and a system for generating concepts. The second part is a method and system for the management of concepts. Such concepts (lower case c) are linguistics-based patterns or set of patterns. Each pattern comprises other patterns, concepts, and linguistic entities of various kinds, and operations on or between those patterns, concepts, and linguistic entities. The present invention improves upon the notion of Concepts as defined within the Concept Specification Language (CSL) of PCT Application No. WO 02/27524 by Fass et al. (2001). CSL Concepts are linguistics-based Patterns or set of Patterns. Each Pattern comprises other Patterns, Concepts, and linguistic entities of various kinds, and Operations on or between those Patterns, Concepts, and linguistic entities. Central to the first part of the invention are notions of a “User concept Description” (UcD), User Concept Description (UCD), “concept wizard,” and “Concept wizard.” UcDs and UCDs are representations of what is used to generate a concept or Concept, including, but not limited to, knowledge sources used as the basis of generation, the data model used to control generation, and instructions (Directives) governing generation. The concept wizards and Concept wizards are tools for navigating users through concept and Concept generation.
-
Citations
112 Claims
-
1. A method for defining and generating a set of concepts and identifying said concepts in text, comprising:
-
a) defining said set of concepts wherein;
i) each of said concepts comprises a pattern;
ii) each of said patterns comprising one of the following;
1) a description sufficiently constrained to be matchable to zero or more extents;
each of said extents comprising a set of zero or more items wherein each of said items is an instance of a linguistic entity;
each of said instances of said linguistic entity is identified in a) text, or b) a knowledge resource;
orc) both a) and b); and
said pattern is matchable to zero or more of said extents corresponding to said description;
or2) an operator and a list of zero or more arguments wherein each of said arguments is a further pattern; and
said pattern comprising said operator and said list of arguments is matchable to extents that are the result of applying said operator to further extents that are matchable by said arguments;
or3) a reference to a further concept comprising a further pattern; and
said pattern comprising said reference to said further concept is matchable to extents that are matchable by said further pattern; and
iii) any said further pattern is a pattern; and
b) generating said concepts from text or one or more sources of knowledge; and
c) identifying said concepts in text. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. A method for defining and generating a set of concepts and identifying said concepts in text, comprising:
-
a) identifying linguistic entities in the text of documents and other text-forms;
b) annotating said identified linguistic entities in a text markup language to produce linguistically annotated documents and other text-forms;
c) storing said linguistically annotated documents and other text-forms;
d) defining concepts that also makes use of patterns wherein;
i) each of said concepts comprises a pattern;
ii) each of said patterns comprising one of the following;
1) a description sufficiently constrained to be matchable to zero or more extents;
each of said extents comprising a set of zero or more items wherein each of said items is an instance of a linguistic entity, each of said instances of said linguistic entity is identified in a) text, or b) a knowledge resource;
orc) both a) and b); and
said pattern is matchable to zero or more of said extents corresponding to said description;
or2) an operator and a list of zero or more arguments wherein each of said arguments is a further pattern; and
said pattern comprising said operator and said list of arguments is matchable to extents that are the result of applying said operator to further extents that are matchable by said arguments;
or3) a reference to a further concept comprising a further pattern; and
said pattern comprising said reference to said further concept is matchable to extents that are matchable by said further pattern; and
iii) any said further pattern is a pattern; and
e) generating said concepts from text of documents and other text-forms, and other sources of knowledge;
f) managing said concepts, both generated and non-generated;
g) identifying concepts using linguistic information, where said concepts occur in one of;
i) said text of documents and other text-forms in which linguistic entities have been identified in step a);
orii) said linguistically annotated documents and other text-forms of step b);
oriii) stored linguistically annotated documents and other text-forms of step c);
h) annotation of said identified concepts in said text markup language to produce conceptually annotated documents and other text-forms;
i) storage of said conceptually annotated documents and other text-forms. - View Dependent Claims (28, 29, 30, 31)
-
-
32. A method for defining and generating a set of Concepts and identifying said Concepts in text, comprising:
-
a) defining said set of Concepts wherein;
i) each of said Concepts comprises a Pattern;
ii) each of said Patterns comprising one of the following;
1) a Basic Pattern comprising a description sufficiently constrained to be matchable to zero or more extents;
each of said extents comprising a set of zero or more items wherein each of said items is an instance of a linguistic entity;
each of said instances of said linguistic entity is identified in b) text, or b) a knowledge resource;
orc) both a) and b); and
said Basic Pattern is matchable to zero or more of said extents corresponding to said description;
or2) an Operator Pattern comprising an Operator and a list of zero or more Arguments wherein each of said Arguments is a further Pattern; and
said Operator Pattern is matchable to extents that are the result of applying said Operator to further extents that are matchable by said Arguments;
or3) a Concept Call comprising a reference to a further Concept comprising a further Pattern; and
said Concept Call is matchable to extents that are matchable by said further Pattern; and
iii) any said further Pattern is a Pattern; and
b) generating said Concepts from text or one or more sources of knowledge; and
c) identifying said Concepts in text. - View Dependent Claims (33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 73, 74, 75, 76)
-
-
72. The method of claim 72 wherein a User Concept Group (UCG) is used to group and name a set of Concepts, said UCG comprising:
-
a) a named Concept that refers to named groups of Concepts or Patterns, or other groups;
b) said UCGs can be extracted from any set of Concepts.
-
-
77. A method for defining and generating a set of concepts and identifying said concepts in text, comprising:
-
a) identifying linguistic entities in the text of documents and other text-forms;
b) annotating said identified linguistic entities in a text markup language to produce linguistically annotated documents and other text-forms;
c) storing said linguistically annotated documents and other text-forms;
d) defining Concepts that also makes use of Patterns wherein;
i) each of said Concepts comprises a Pattern;
ii) each of said Patterns comprising one of the following;
1) a Basic Pattern comprising a description sufficiently constrained to be matchable to zero or more extents;
each of said extents comprising a set of zero or more items wherein each of said items is an instance of a linguistic entity;
each of said instances of said linguistic entity is identified in c) text, or b) a knowledge resource;
orc) both a) and b); and
said Basic Pattern is matchable to zero or more of said extents corresponding to said description;
or2) an Operator Pattern comprising an Operator and a list of zero or more Arguments wherein each of said Arguments is a further Pattern; and
said Operator Pattern is matchable to extents that are the result of applying said Operator to further extents that are matchable by said Arguments;
or3) a Concept Call comprising a reference to a further Concept comprising a further Pattern; and
said Concept Call is matchable to extents that are matchable by said further Pattern; and
iii) any said further Pattern is a Pattern; and
e) generating said Concepts from text of documents and other text-forms, and other sources of knowledge;
f) managing said Concepts, both generated and non-generated;
g) identifying Concepts using linguistic information, where said Concepts occur in one of;
i) said text of documents and other text-forms in which linguistic entities have been identified in step a);
oriv) said linguistically annotated documents and other text-forms of step b);
orv) stored linguistically annotated documents and other text-forms of step c);
h) annotation of said identified Concepts in said text markup language to produce conceptually annotated documents and other text-forms;
i) storage of said conceptually annotated documents and other text-forms. - View Dependent Claims (78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112)
-
Specification