IDENTIFYING CORRESPONDING POSITIONS IN DIFFERENT REPRESENTATIONS OF A TEXTUAL WORK
First Claim
1. A method for use in identifying a target audio position, in an audio representation of a textual work, that corresponds to a first text position in an electronic representation of the textual work, the method comprising:
- estimating a first estimated audio position in the audio representation based at least in part on the first text position in the electronic representation of the textual work;
performing automatic speech recognition (ASR) on a first audio segment appearing in the audio representation at the first estimated audio position to generate a first textual representation of the first audio segment;
identifying at least one second text position in the electronic representation of the textual work by searching the electronic representation of the textual work for text that matches the first textual representation; and
in response to determining that the at least one second text position does not comprise the first text position, estimating a second estimated position in the audio representation based at least in part on the at least one second text position.
2 Assignments
0 Petitions
Accused Products
Abstract
Described herein are techniques for determining corresponding positions between different representations of a textual work. In some of the techniques, portions of one or more representations may be processed. A determination of a corresponding position may be made in response to a request received from a user, such as a reader that desires to switch between representations. The request may indicate a position in one representation and the representation to which the user would like to switch. In response to receiving the request, one or more portions of one or more representations of a textual work may be processed. In some techniques, a corresponding position between different representations may be determined without processing the entirety of one or more representations of the textual work. For example, a corresponding position may be determined without processing an entire audio representation.
-
Citations
20 Claims
-
1. A method for use in identifying a target audio position, in an audio representation of a textual work, that corresponds to a first text position in an electronic representation of the textual work, the method comprising:
-
estimating a first estimated audio position in the audio representation based at least in part on the first text position in the electronic representation of the textual work; performing automatic speech recognition (ASR) on a first audio segment appearing in the audio representation at the first estimated audio position to generate a first textual representation of the first audio segment; identifying at least one second text position in the electronic representation of the textual work by searching the electronic representation of the textual work for text that matches the first textual representation; and in response to determining that the at least one second text position does not comprise the first text position, estimating a second estimated position in the audio representation based at least in part on the at least one second text position. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. At least one computer-readable storage medium having encoded thereon computer-executable instructions that, when executed by at least one computer, cause the at least one computer to carry out a method of identifying a text position in an electronic version of a textual work that corresponds to a first audio position in an audio representation of the textual work, the method comprising:
-
performing automatic speech recognition (ASR) on a first audio segment appearing in the audio representation at the first audio position to generate a first textual representation of the first audio segment; and identifying the text position by searching the electronic version of the textual work for text that matches the first textual representation. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. An apparatus comprising:
-
at least one processor; and at least one computer-readable storage medium having encoded thereon executable instructions that, when executed by the at least one processor, cause the at least one processor to carry out a method of identifying a digital position in a digital representation of a textual work that corresponds to a hardcopy location in a hardcopy representation of at least a portion of the textual work, the method comprising; performing a character recognition process on at least a portion of the hardcopy representation to generate a recognized textual representation; and searching the digital representation for text matching at least some of the recognized textual representation to identify the digital position in the digital representation. - View Dependent Claims (17, 18, 19, 20)
-
Specification