Concept matching
First Claim
1. A method for developing a system for retrieving text related to a selected concept within a text corpus, the method comprising:
- identifying a set of at least three semantic classes which in combinations thereof express the concept, each of the semantic classes expressing a constituent notion of the concept;
identifying a set of keywords for each of the semantic classes to be used in text searching in a text corpus, each set of keywords including at least one user-selected keyword, at least some of the semantic classes including keywords which are used in relevant expressions in retrieved text when the constituent notion is being conveyed and including keywords having different meanings from other keywords of the same semantic class and which are not synonymous with the other keywords;
establishing, with a processing unit of a computer system, a plurality of syntactic rules to be applied to retrieved text which includes keywords, each of the syntactic rules identifying a first of the set of semantic classes and a second of the set of semantic classes, the rule being satisfied when any keyword from the first of the semantic classes is in a syntactic relationship with any keyword from the second of the semantic classes, the syntactic relationship comprising any one of a plurality of syntactic relationships.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for developing a system for retrieving text related to a selected concept within a text corpus includes identifying a set of semantic classes which express the concept and identifying a set of keywords for each of the semantic classes to be used in text searching in a text corpus. Each set of keywords includes at least one keyword. A plurality of syntactic rules are established which are to be applied to retrieved text which includes keywords. Each of the syntactic rules identifies a first of the semantic classes and a second of the semantic classes. A rule is satisfied when a keyword from the first of the semantic classes is in a syntactic relationship with a keyword from the second of the semantic classes. The syntactic relationship can be any one of a plurality of syntactic relationships.
44 Citations
20 Claims
-
1. A method for developing a system for retrieving text related to a selected concept within a text corpus, the method comprising:
-
identifying a set of at least three semantic classes which in combinations thereof express the concept, each of the semantic classes expressing a constituent notion of the concept; identifying a set of keywords for each of the semantic classes to be used in text searching in a text corpus, each set of keywords including at least one user-selected keyword, at least some of the semantic classes including keywords which are used in relevant expressions in retrieved text when the constituent notion is being conveyed and including keywords having different meanings from other keywords of the same semantic class and which are not synonymous with the other keywords; establishing, with a processing unit of a computer system, a plurality of syntactic rules to be applied to retrieved text which includes keywords, each of the syntactic rules identifying a first of the set of semantic classes and a second of the set of semantic classes, the rule being satisfied when any keyword from the first of the semantic classes is in a syntactic relationship with any keyword from the second of the semantic classes, the syntactic relationship comprising any one of a plurality of syntactic relationships. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A method for developing a system for retrieving text related to a selected concept within a text corpus, the method comprising:
-
identifying a set of at least four semantic classes which in combinations thereof express the concept; identifying a set of keywords for each of the semantic classes, where the keywords are to be used by the system in text searching in a text corpus, each set of keywords including at least one keyword, at least some of the semantic classes including keywords which are used in relevant expressions in retrieved text when the constituent notion is being conveyed and including keywords having different meanings from other keywords of the same semantic class and which are not synonymous with the other keywords; establishing, with a processing unit of a computer system, a plurality of syntactic rules which are to be applied by the system to retrieved text which includes keywords from the sets of keywords, each of the syntactic rules identifying a first of the semantic classes and a second of the semantic classes, the rule being satisfied when any keyword from the first of the semantic classes is in a syntactic relationship with any keyword from the second of the semantic classes, the syntactic relationship comprising any one of a plurality of syntactic relationships, the establishing of the plurality of syntactic rules comprising; automatically parsing selected examples of text to identify syntactic relationships between a keyword of a first of the semantic classes and a keyword of a second of the semantic classes; and presenting the identified relationships to a user for manually selecting syntactic rules which select those syntactic relationships which express the desired concept.
-
-
19. A method for retrieving text related to a selected concept within a text corpus comprising:
-
identifying a set of semantic classes which, in combinations thereof, express the concept, each of the semantic classes expressing a constituent notion of the concept; identifying a set of keywords for each of the semantic classes to be used in text searching in a text corpus, each set of keywords including a plurality of keywords, at least some of the semantic classes including keywords which are used in relevant expressions in retrieved text when the constituent notion is being conveyed and including keywords having different meanings from other keywords of the same semantic class and which are not synonymous with the other keywords; establishing, with a processing unit of a computer system, a plurality of syntactic rules to be applied to retrieved text which includes keywords, each of the syntactic rules identifying a first of the semantic classes and a second of the semantic classes, the rule being satisfied when any one of the keywords from the first of the semantic classes and any one of the keywords from the second of the semantic classes are in any one of a plurality of syntactic relationships; and applying the syntactic rules to a text corpus to identify text within the text corpus which satisfies at least one of the syntactic rules.
-
-
20. A computer system for developing a text retrieval system comprising:
-
a memory for storing a set of semantic classes which, in combinations thereof, express a concept, each of the semantic classes expressing a constituent notion of the concept; memory for storing keywords for each of the semantic classes to be used in text searching in a text corpus and for storing syntactic rules to be applied to retrieved text which includes keywords, at least some of the semantic classes including keywords which are used in relevant expressions in retrieved text when the constituent notion is being conveyed and including keywords having different meanings from other keywords of the same semantic class and which are not synonymous with the other keywords; a component which suggests sample sentences to a user which include one or more of the stored keywords; a user input device for enabling a user to select sentences from the suggested sentences; and a component which proposes syntactic rules which are met by the selected sentences, each of the syntactic rules identifying a first of the set of semantic classes and a second of the set of semantic classes, the rule being satisfied when any keyword from the first of the semantic classes is in a syntactic relationship with any keyword from the second of the semantic classes, the syntactic relationship comprising any one of a plurality of syntactic relationships, whereby when the syntactic rules are applied to a text corpus, text within the text corpus which satisfies at least one of the syntactic rules is retrieved, the component presenting the proposed syntactic rules to the user on a display.
-
Specification