Hyper video: information retrieval using text from multimedia
First Claim
1. A method for finding documents which relate to a portion of a temporal document wherein the temporal document is video or audio material, comprising:
- (a) in response to a signal of interest at a particular time during the temporal document, identifying a portion of the temporal document for which related documents are to be found;
(b) selecting text associated with the portion of the temporal document identified;
(c) finding the related document by use of information retieval techniques as applied to the selected text, wherein the related documents are accessed through the Internet and are selected from a collection of documents according to scores associated with the documents, said scores based on a ratio between the number of documents in the collection and, for a term in the selected text, the number of documents in the collection containing the term; and
(d) selecting the related documents from among a collection of documents which may be accessed through the Internet, by utilizing databases comprising information about the collection, wherein a score SD of a document D in the collection may be determined by crediting the document D, for each term T in the temporal portion of the document identified which occurs in the document D, with an amount proportional to Robertson'"'"'s term frequency TFTD and to IDFT where
3 Assignments
0 Petitions
Accused Products
Abstract
Disclosed is a method and device for selecting documents, such as Web pages or sites, for presentation to a user, in response to a user expression of interest, during the course of presentation to the user of a document, such as a video or audio selection, whose content varies with time. The method takes advantage of information retrieval techniques to select documents related to the portion of the temporal document in which the user has expressed interest. The method generates the search query to use to select documents by reference to text associated with the portion of the temporal document in which the user has expressed interest, as by using the closed caption test associated with the video, or by using speech recognition techniques.
77 Citations
22 Claims
-
1. A method for finding documents which relate to a portion of a temporal document wherein the temporal document is video or audio material, comprising:
-
(a) in response to a signal of interest at a particular time during the temporal document, identifying a portion of the temporal document for which related documents are to be found;
(b) selecting text associated with the portion of the temporal document identified;
(c) finding the related document by use of information retieval techniques as applied to the selected text, wherein the related documents are accessed through the Internet and are selected from a collection of documents according to scores associated with the documents, said scores based on a ratio between the number of documents in the collection and, for a term in the selected text, the number of documents in the collection containing the term; and
(d) selecting the related documents from among a collection of documents which may be accessed through the Internet, by utilizing databases comprising information about the collection, wherein a score SD of a document D in the collection may be determined by crediting the document D, for each term T in the temporal portion of the document identified which occurs in the document D, with an amount proportional to Robertson'"'"'s term frequency TFTD and to IDFT where - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A device for finding documents which relate to a portion of a temporal document wherein the temporal document is video or audio material, comprising:
-
(a) means for identifying a portion of the temporal document for which related documents are to be found, in response to a signal of interest at a particular time during the temporal document;
(b) means for selecting text associated with the portion of the temporal document identified;
(c) means for finding thee related documents by use of information retrieval techniques as applied to the selected text, wherein the related documents are accessed through the Internet and are selected from a collection of documents according to scores associated with the documents, said scores based on a ratio between the number of documents in the collection and, for a term in the selected text, the number of documents in the collection containing the term;
(d) means for selecting the related documents from among a collection of documents which may be accessed through the Internet, by utilizing databases comprising information about the collection wherein a score SD of document D in the collection may be determined by crediting the document D, for each term T in the temporal portion of the document identified which occurs in the document D, with an amount proportional to Robertson'"'"'s term frequency TFTD and to IDFT where
TFTD=NTD/(NTD+K1+K2*(LD/L0)), andNTD is the number of times the term T occurs in document D, LD is the length of document D, L0 is the average length of a document in the collection of documents indexed, K1 and K2 are constants, and - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
Specification