Subtitle generation and retrieval combining document processing with voice processing
First Claim
1. An apparatus for recognizing voice in a presentation to generate a subtitle corresponding to the voice, comprising:
- extraction means for extracting a keyword from document data used in the presentation; and
processing means for one of generating the subtitle and assisting in generating the subtitle, by use of the keyword extracted by the extraction means.
8 Assignments
0 Petitions
Accused Products
Abstract
Provides subtitle generation methods and apparatus which recognizes voice in a presentation to generate subtitles thereof, and retrieval apparatus for retrieving character strings by use of the subtitles. An apparatus of the present invention includes: a extraction unit for extracting text from presentation documents; an analysis unit for morphologically analyzing text to decompose it into words; a generation unit for generating common keywords by assigning weights to words; a registration unit for adding common keywords to a voice recognition dictionary; a recognition unit for recognizing voice in a presentation; a record unit for recording the correspondence between page and time by detecting page switching events; a regeneration unit for regenerating common keywords by further referring to the correspondence between page and time; a control unit for controlling the display of subtitles, common keywords, text and master subtitles; and a note generation unit for generating speaker notes from subtitles.
-
Citations
23 Claims
-
1. An apparatus for recognizing voice in a presentation to generate a subtitle corresponding to the voice, comprising:
-
extraction means for extracting a keyword from document data used in the presentation; and
processing means for one of generating the subtitle and assisting in generating the subtitle, by use of the keyword extracted by the extraction means. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An apparatus for retrieving a character string, comprising:
-
storage means for storing first text data obtained by recognizing voice in a presentation, second text data extracted from document data used in the presentation, and associated information of the first text data and the second text data; and
retrieval means for retrieving, by use of the associated information, the character string from text data composed of the first text data and the second text data. - View Dependent Claims (10)
-
-
11. A method of causing a computer to combine a processing of a document having a plurality of pages with a processing of voice generated with reference to the document, comprising the steps of:
-
causing the computer to determine, among subtitles obtained by recognizing the voice, a specific subtitle obtained by recognizing voice generated with reference to a specific page of the document; and
causing the computer to store the correspondence between the specific subtitle and the specific page. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A program product which allows a computer to realize:
-
a function of extracting a keyword from document data used in a presentation; and
a function of generating a subtitle corresponding to voice in the presentation or assisting in generating the subtitle, by use of the extracted keyword. - View Dependent Claims (18, 19, 20)
-
-
21. A program product which allows a computer to realize:
-
a function of determining, among subtitles obtained by recognizing voice generated with reference to a predetermined document, a specific subtitle obtained by recognizing voice generated with reference to a specific page of the document; and
a function of storing a correspondence between the specific subtitle and the specific page. - View Dependent Claims (22, 23)
-
Specification