Using semantic feature structures for document comparisons
First Claim
1. A method of enabling semantic comparisons of computer readable textual items comprising:
- generating a rules base as a mechanism for implementing said comparisons, including;
(a) defining syntactic rules for associating syntactic categories with individual words within sentence structures;
(b) defining grammar rules for determining semantic roles of at least some of said words within said sentence structures; and
(c) defining property rules for associating semantic properties with particular said words, at least some of said property rules being based upon adjacencies of said words in said sentence structures;
enabling applications of said rules base to each of a plurality of said textual items, wherein applying said rules base to a specific said textual item generates an output representative of said syntactic categories and said semantic roles and properties determined to be associated with words within sentence structures of said specific textual item; and
enabling comparison of said output to at least one second output that is representative of syntactic categories and semantic roles and properties determined to be associated with words within sentence structures of another textual item.
1 Assignment
0 Petitions
Accused Products
Abstract
Document comparisons can be performed at a semantic level by utilizing a rules base in which groups of rules are applied sequentially. In one implementation, (1) syntactic rules are applied to a document to form a tagged sequence in which individual words are tagged with their syntactic categories, (2) ambiguity rules are applied to the tagged sequence to resolve ambiguities, thereby providing a resolved tag sequence, (3) grammar rules are applied to the resolved tagged sequence to determine semantic roles of individual tagged words, thereby providing a role-specific resolved tagged sequence, and (4) property rules are applied to match properties (e.g., adjectives) with the words they modify, thereby providing a semantic feature structure. The semantic feature structure is then compared to at least one other structure.
95 Citations
29 Claims
-
1. A method of enabling semantic comparisons of computer readable textual items comprising:
-
generating a rules base as a mechanism for implementing said comparisons, including;
(a) defining syntactic rules for associating syntactic categories with individual words within sentence structures;
(b) defining grammar rules for determining semantic roles of at least some of said words within said sentence structures; and
(c) defining property rules for associating semantic properties with particular said words, at least some of said property rules being based upon adjacencies of said words in said sentence structures;
enabling applications of said rules base to each of a plurality of said textual items, wherein applying said rules base to a specific said textual item generates an output representative of said syntactic categories and said semantic roles and properties determined to be associated with words within sentence structures of said specific textual item; and
enabling comparison of said output to at least one second output that is representative of syntactic categories and semantic roles and properties determined to be associated with words within sentence structures of another textual item. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method of monitoring network activity comprising:
-
identifying a document transmitted via a network being monitored;
generating a semantic feature structure from said document, including applying predefined rules of syntax to categorize words of said document on a basis of parts of speech and further including applying predefined rules of grammar to associate said categorized words with semantic features of activities described in said document;
comparing said semantic feature structure to at least one reference semantic feature structure, including determining similarity between said semantic feature structure and each said reference semantic feature structure for which said comparing is performed; and
using determinations of said similarity as a basis for selectively filtering said document. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
-
21. Storage of computer readable programming in which said programming comprises:
-
a dictionary of words in which said words are associated with parts of speech;
a rules base configured to be cooperative with said dictionary in converting documents to semantic feature structures, said rules base including syntax rules, grammar rules and property rules;
a parts-of-speech tagger module configured to access said rules base in applying said syntax rules to sentence structures of each said document so as to assign parts-of-speech tags to words of said sentence structure;
a grammar-based module operatively associated with said parts-of-speech module and said rules base to apply said grammar rules following assignments of said parts-of-speech tags, said grammar-based module being configured to identify said words of said sentence structures of said document with semantic features of activities described in said sentence structures; and
a property-based module operatively associated with said grammar-based module and said rules base to apply said property rules to following applications of said grammar rules, said property-based module being configured to assign semantic properties to at least some of said words, wherein at least some assignments of semantic properties are based on adjacencies of particular said words in said sentence structures. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29)
-
Specification