Methods and apparatus for thematic parsing of discourse
First Claim
1. A method for parsing thematic context of input discourse, said method comprising the steps of:
- specifying a plurality of thematic constructions that define content of discourse;
testing each of said thematic constructions with said input discourse to identify applicability of each of said thematic constructions to said input discourse; and
generating a thematic context output for said input discourse by generating, for each word in said input discourse, a thematic tag to indicate applicability of each of said thematic constructions to said input discourse.
2 Assignments
0 Petitions
Accused Products
Abstract
A theme parsing system that determines the thematic context of input discourse is disclosed. Each word used in a language carries thematic information that conveys the importance of the meaning and content of the discourse. The theme parsing system discriminates words and phrases of the input discourse, identifying the type of importance or meaning, the impact on different parts of the discourse, and the overall contribution to the content of the discourse. The thematic conext of the discourse is determined in accordance with predetermined theme assessment criteria that is a function of the strategic importance of the discriminated words. The predetermined thematic assessment criteria defines which of the discriminated words are to be selected for each thematic analysis unit. The discourse is then output in a predetermined thematic format as a different view to a user, giving the topics of the discourse in a topic extractor, generating summarized versions of the discourse in a kernel generator, and identifying the key content of the discourse in a content extractor.
-
Citations
45 Claims
-
1. A method for parsing thematic context of input discourse, said method comprising the steps of:
-
specifying a plurality of thematic constructions that define content of discourse; testing each of said thematic constructions with said input discourse to identify applicability of each of said thematic constructions to said input discourse; and generating a thematic context output for said input discourse by generating, for each word in said input discourse, a thematic tag to indicate applicability of each of said thematic constructions to said input discourse. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for determining thematic context of input discourse, said method comprising the steps of:
-
generating a thematic context output for said input discourse comprising a plurality of thematic tags, wherein said thematic tags indicate applicability of a plurality of thematic constructions to thematic units of said input discourse; and searching said thematic context output for pre-defined combinations of said thematic tags; and extracting, in combination, said thematic units corresponding to said pre-defined combination of said thematic tags to extract a theme for said input stream.
-
-
9. A method for generating kernel sentences from input discourse, said method comprising the steps of:
-
generating a thematic context output for said input discourse comprising a plurality of thematic tags, wherein said thematic tags indicate applicability of a plurality of thematic constructions including a plurality of thematic contextual elements for said input discourse; and accessing said thematic tags in said thematic context output; and removing words from each sentence of said input discourse if a thematic tag for a corresponding word indicates applicability to one of said thematic contextual elements, wherein words that exist define a kernel sentence.
-
-
10. A method for determining a topic for each sentence in an input discourse, said method comprising the steps of:
-
generating a thematic context output for said input discourse comprising a plurality of thematic tags, wherein said thematic tags indicate applicability of a plurality of thematic constructions that define content for said input discourse; accessing said thematic tags in said thematic context output; and extracting a main topic for said sentence based on said thematic tags including extracting context words that support said main topic. - View Dependent Claims (11, 12)
-
-
13. A method for extracting content in an input discourse, said method comprising the steps of:
-
generating a thematic context output for said input discourse comprising a plurality of thematic tags, wherein said thematic tags indicate applicability of a plurality of thematic constructions that define content for said input discourse; and extracting, for each sentence in said input discourse, words that identify at least one of a plurality of major thematic points based on said thematic tags, wherein said words extracted identify content for each sentence in said input discourse. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
-
21. A computer readable medium for storing instructions, which when executed by a computer, causes the computer to perform the steps of:
-
storing a plurality of thematic constructions that define content of discourse; testing each of said thematic constructions with said input discourse to identify applicability of each of said thematic constructions to said input discourse; and generating a thematic context output for said input discourse by generating, for each word in said input discourse, a thematic tag to indicate applicability of each of said thematic constructions to said input discourse. - View Dependent Claims (22, 23, 24, 25, 26, 27)
-
-
28. A computer readable medium for storing instructions, which when executed by a computer, causes the computer to perform the steps of:
-
generating a thematic context output for said input discourse comprising a plurality of thematic tags, wherein said thematic tags indicate applicability of a plurality of thematic constructions to thematic units of said input discourse; and searching said thematic context output for pre-defined combinations of said thematic tags; and extracting, in combination, said thematic units corresponding to said pre-defined combination of said thematic tags to extract a theme for said input stream.
-
-
29. A computer readable medium for storing instructions, which when executed by a computer, causes the computer to perform the steps of:
-
generating a thematic context output for said input discourse comprising a plurality of thematic tags, wherein said thematic tags indicate applicability of a plurality of thematic constructions including a plurality of thematic contextual elements for said input discourse; and accessing said thematic tags in said thematic context output; and removing words from each sentence of said input discourse if a thematic tag for a corresponding word indicates applicability to one of said thematic contextual elements, wherein words that exist define a kernel sentence.
-
-
30. A computer readable medium for storing instructions, which when executed by a computer, causes the computer to perform the steps of:
-
generating a thematic context output for said input discourse comprising a plurality of thematic tags, wherein said thematic tags indicate applicability of a plurality of thematic constructions that define content for said input discourse; and accessing said thematic tags in said thematic context output; and
extracting a main topic for said sentence based on said thematic tags including extracting context words that support said main topic. - View Dependent Claims (31, 32)
-
-
33. A computer readable medium for storing instructions, which when executed by a computer, causes the computer to perform the steps of:
-
generating a thematic context output for said input discourse comprising a plurality of thematic tags, wherein said thematic tags indicate applicability of a plurality of thematic constructions that define content for said input discourse; and extracting, for each sentence in said input discourse, words that identify at least one of a plurality of major thematic points based on said thematic tags, wherein said words extracted identify content for each sentence in said input discourse. - View Dependent Claims (34, 35, 36, 37, 38, 39, 40)
-
-
41. An apparatus for determining themes in an input discourse, said apparatus comprising:
-
means for specifying a plurality of thematic constructions that define content of discourse; means for testing each of said thematic constructions with said input discourse to identify applicability of each of said thematic constructions to said input discourse; and means for generating a thematic context output for said input discourse by generating, for each word in said input discourse, a thematic tag to indicate applicability of each of said thematic constructions to said input discourse.
-
-
42. An apparatus for determining thematic context of an input discourse, said apparatus comprising:
-
means for generating a thematic context output for said input discourse comprising a plurality of thematic tags, wherein said thematic tags indicate applicability of a plurality of thematic constructions to thematic units of said input discourse; and means for searching said thematic context output for pre-defined combinations of said thematic tags; and means for extracting, in combination, said thematic units corresponding to said predefined combination of said thematic tags to extract a theme for said input stream.
-
-
43. An apparatus for generating kernel sentences from input discourse, said apparatus comprising:
-
means for generating a thematic context output for said input discourse comprising a plurality of thematic tags, wherein said thematic tags indicate applicability of a plurality of thematic constructions including a plurality of thematic contextual elements for said input discourse; means for accessing said thematic tags in said thematic context output; and means for removing words from each sentence of said input discourse if a thematic tag for a corresponding word indicates applicability to one of said thematic contextual elements, wherein words that exist define a kernel sentence.
-
-
44. An apparatus for determining a topic for each sentence in an input discourse, said apparatus comprising:
-
means for generating a thematic context output for said input discourse comprising a plurality of thematic tags, wherein said thematic tags indicate applicability of a plurality of thematic constructions that define content for said input discourse; and means for accessing said thematic tags in said thematic context output; and means for extracting a main topic for said sentence based on said thematic tags including extracting context words that support said main topic.
-
-
45. An apparatus for extracting content in an input discourse, said apparatus comprising:
-
means for generating a thematic context output for said input discourse comprising a plurality of thematic tags, wherein said thematic tags indicate applicability of a plurality of thematic constructions that define content for said input discourse; and means for extracting, for each sentence in said input discourse, words that identify at least one of a plurality of major thematic points based on said thematic tags, wherein said words extracted identify content for each sentence in said input discourse.
-
Specification