Text and question generating apparatus and method
First Claim
1. A text information generating apparatus, comprising:
- attribute input section operatively connected to receive at least one artificial attribute associated with a paragraph;
discourse structure attribute generating section operatively connected to generate a discourse structure attribute related to a discourse structure that is associated with said paragraph and a paragraph length ratio attribute related to a ratio of a number of characters in said paragraph to the number of characters of a matching pattern matched with said paragraph;
combination attribute generating section operatively connected to generate a combination attribute based on at least two of the artificial attribute, the discourse structure attribute, and the paragraph length ratio attribute;
text input interface operatively connected to receive text;
importance degree estimating section operatively connected to estimate an importance degree indicating an enhancement degree of correlation between said paragraph and the text based on at least one of the artificial attribute, the discourse structure attribute, the paragraph length ratio, and the combination attribute;
important paragraph determining section operatively connected to determine an important paragraph having higher correlation with the text based on the estimated importance degree of each attribute from one or more paragraphs in the text; and
text output interface operatively connect to provide information of the text that is based on the determination of said important paragraph determining section.
1 Assignment
0 Petitions
Accused Products
Abstract
To extract words or the like intensively related to contents of text from the same text without necessity of cost required for a excessively large amount of man-power and thereby to generate the information of the text using these extracted words or the like. The text information generating method and apparatus comprises an attribute input section for inputting artificial attribute, a discourse structure attribute generating section for generating discourse structure attribute and paragraph length ratio attribute, a combination attribute generating section for generating combination attribute by freely combining artificial attribute, discourse structure attribute and paragraph length ratio attribute, an importance degree estimating section for respectively estimating importance degree indicating enhancement degree of correlation with contents of text for each attribute, a text input interface, an important paragraph determining section for determining the important paragraphs from one or more paragraphs in the input text on the basis of an importance degree of each attribute, and a text output interface for outputting the information of the input text generated on the basis of determination of the important paragraph determining section.
14 Citations
10 Claims
-
1. A text information generating apparatus, comprising:
-
attribute input section operatively connected to receive at least one artificial attribute associated with a paragraph;
discourse structure attribute generating section operatively connected to generate a discourse structure attribute related to a discourse structure that is associated with said paragraph and a paragraph length ratio attribute related to a ratio of a number of characters in said paragraph to the number of characters of a matching pattern matched with said paragraph;
combination attribute generating section operatively connected to generate a combination attribute based on at least two of the artificial attribute, the discourse structure attribute, and the paragraph length ratio attribute;
text input interface operatively connected to receive text;
importance degree estimating section operatively connected to estimate an importance degree indicating an enhancement degree of correlation between said paragraph and the text based on at least one of the artificial attribute, the discourse structure attribute, the paragraph length ratio, and the combination attribute;
important paragraph determining section operatively connected to determine an important paragraph having higher correlation with the text based on the estimated importance degree of each attribute from one or more paragraphs in the text; and
text output interface operatively connect to provide information of the text that is based on the determination of said important paragraph determining section. - View Dependent Claims (4, 5, 6, 7)
-
-
2. A text information generating apparatus comprising:
-
attribute input section operatively connected to receive at least one artificial attribute that is associated with a paragraph;
discourse structure attribute generating section operatively connected to generate a discourse structure attribute related to a discourse structure and associated with said paragraph and a paragraph length attribute related to a ratio of a number of characters of said paragraph to a number of characters of a matching pattern matched to said paragraph;
word attribute generating section operatively connected to generate word attribute related to words of said paragraph;
combination attribute generating section operatively connected to generate a combination attribute based on at least two of the artificial attribute, the discourse structure attribute, the paragraph length ratio attribute, and the word attribute;
text input interface operatively connected to receive text;
importance degree estimating section operatively connected to estimate an importance degree indicating an enhancement degree of correlation between said paragraph and the text based on at least one of the artificial attribute, the discourse structure attribute, the paragraph length ratio attribute, the word attribute, and the combination attribute;
important paragraph determining section operatively connected to determine, based on the estimated importance degree of each attribute, an important paragraph having a higher correlation with the text from one or more paragraphs in the text; and
text output interface operatively connected to provide information of the text that is based on the determination of said important paragraph determining section.
-
-
3. A text information generating apparatus comprising:
-
attribute input section operatively connected to receive at least one artificial attribute that is associated with a paragraph;
discourse structure attribute generating section operatively connected to generate a discourse structure attribute related to a discourse structure that may be associated with said paragraph and a paragraph length ratio attribute related to a ratio of a number of characters in said paragraph to the number of characters of a matching pattern matched with said paragraph;
combination attribute generating section operatively connected to generate a combination attribute based on at least two of the artificial attribute, the discourse structure attribute, and the paragraph length ratio attribute;
text input interface operatively connected to receive text;
importance degree estimating section operatively connected to estimate an importance degree indicating an enhancement degree of correlation between said paragraph and the text based on at least one of the artificial attribute, the discourse structure attribute, the paragraph length ratio, and the combination attribute, and to determine at least one surplus attribute from at least two of the artificial attribute, the discourse structure attribute, the paragraph length ratio, and the combination attribute;
surplus attribute deleting section operatively connected to delete the determined surplus attribute from the attributes utilized by said importance degree estimating section;
important paragraph determining section operatively connected to determine, from one or more paragraphs, an important paragraph having higher correlation with contents of text based on the estimated importance degree of the attribute not determined to be a surplus attribute; and
text output interface operatively connected to provide information of the text based on the determination of said important paragraph determining section.
-
-
8. A text information generating method, comprising:
-
receiving at least one artificial attribute and is associated with a paragraph;
generating a discourse structure attribute related to a discourse structure that is associated with said paragraph and a paragraph length ratio attribute related to a ratio of a number of characters in said paragraph to the number of characters of a matching pattern matched with said paragraph;
generating a combination attribute based on at least two of the artificial attribute, the discourse structure attribute, and the paragraph length ratio attribute;
receiving text;
estimating an importance degree indicating an enhancement degree of correlation between said paragraph and the text based on at least one of the artificial attribute, the discourse structure attribute, the paragraph length ratio, and the combination attribute;
determining an important paragraph having higher correlation with the text based on the estimated importance degree of each attribute from one or more paragraphs in the text; and
providing information of the text that is based on the determining.
-
-
9. A text information generating method comprising:
-
receiving at least one artificial attribute that is associated with a paragraph;
generating a discourse structure attribute related to a discourse structure and associated with said paragraph and a paragraph length attribute related to a ratio of a number of characters of said paragraph to a number of characters of a matching pattern matched to said paragraph;
generating a word attribute related to words of said paragraph;
generating a combination attribute based on at least two of the artificial attribute, the discourse structure attribute, the paragraph length ratio attribute, and the word attribute;
receiving text;
estimating an importance degree indicating an enhancement degree of correlation between said paragraph and the text based on at least one of the artificial attribute, the discourse structure attribute, the paragraph length ratio attribute, the word attribute, and the combination attribute;
determining, based on the estimated importance degree of each attribute, an important paragraph having a higher correlation with the text from one or more paragraphs in the text; and
providing information of the text that is based on the determination of said important paragraph determining section.
-
-
10. A text information generating method comprising:
-
receiving at least one artificial attribute that is associated with a paragraph;
generating a discourse structure attribute related to a discourse structure that may be associated with said paragraph and a paragraph length ratio attribute related to a ratio of a number of characters in said paragraph to the number of characters of a matching pattern matched with said paragraph;
generating a combination attribute based on at least two of the artificial attribute, the discourse structure attribute, and the paragraph length ratio attribute;
receiving text;
estimating an importance degree indicating an enhancement degree of correlation between said paragraph and the text based on at least one of the artificial attribute, the discourse structure attribute, the paragraph length ratio, and the combination attribute, and to determine at least one surplus attribute from at least two of the artificial attribute, the discourse structure attribute, the paragraph length ratio, and the combination attribute;
deleting the determined surplus attribute from the attributes utilized in the estimation;
determining, from one or more paragraphs, an important paragraph having higher correlation with contents of text based on the estimated importance degree of the attribute not determined to be a surplus attribute; and
providing information of the text based on the determining of said important paragraph.
-
Specification