Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data
DC CAFCFirst Claim
1. A system for acquiring and reviewing a body of information, wherein the body of information includes a plurality of segments, each segment representing a defined set of information in the body of information, the system comprising:
- means for acquiring data representing the body of information;
means for storing the acquired data;
first display means for generating a display of a first segment of the body of information from data that is part of the stored data;
means for comparing data representing a segment of the body of information to data representing a different segment of the body of information to determine whether, according to one or more predetermined criteria, the compared segments are related; and
second display means for generating a display of a portion of, or a representation of, a second segment of the body of information from data that is part of the stored data, wherein the second display means displays the portion or representation of the second segment in response to the display by the first display means of a first segment to which the second segment is related.
3 Assignments
Litigations
0 Petitions
Reexaminations
Accused Products
Abstract
The invention facilitates and enhances review of a body of information (that can be represented by a set of audio data, video data, text data or some combination of the three), enabling the body of information to be quickly reviewed to obtain an overview of the content of the body of information and allowing flexibility in the manner in which the body of information is reviewed. In a particular application of the invention, the content of audiovisual news programs is acquired from a first set of one or more information sources (e.g., television news programs) and text news stories are acquired from a second set of one or more information sources (e.g., on-line news services or news wire services). In such a particular application, the invention can enable the user to access the news stories of audiovisual news programs in a random manner so that the user can move quickly among news stories or news programs. The invention can also enable the user to quickly locate news stories pertaining to a particular subject. Additionally, when the user is observing a particular news story in a news program, the invention can identify and display related news stories. The invention can also enable the user to control the display of the news programs by, for example, speeding up the display, causing a summary of one or more news stories to be displayed, or pausing the display of the news stories. Additionally, the invention can indicate to the user which news story is currently being viewed, as well as which news stories have previously been viewed.
-
Citations
129 Claims
-
1. A system for acquiring and reviewing a body of information, wherein the body of information includes a plurality of segments, each segment representing a defined set of information in the body of information, the system comprising:
-
means for acquiring data representing the body of information;
means for storing the acquired data;
first display means for generating a display of a first segment of the body of information from data that is part of the stored data;
means for comparing data representing a segment of the body of information to data representing a different segment of the body of information to determine whether, according to one or more predetermined criteria, the compared segments are related; and
second display means for generating a display of a portion of, or a representation of, a second segment of the body of information from data that is part of the stored data, wherein the second display means displays the portion or representation of the second segment in response to the display by the first display means of a first segment to which the second segment is related. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
at least a portion of the body of information is represented by audiovisual data;
the first segment is represented by audiovisual data;
the first display means displays an audiovisual display of the first segment; and
the second segment is represented by audiovisual data.
-
-
4. A system as in claim 3, further comprising means for selecting a segment for which a portion or representation is displayed by the second display means, wherein selection of such segment causes the first display means to display an audiovisual display of the selected segment.
-
5. A system as in claim 1, wherein:
-
at least a portion of the body of information is represented by audiovisual data;
the first display means displays an audiovisual display of the first segment; and
the second display means displays a text display of a portion or representation of the second segment.
-
-
6. A system as in claim 1, wherein:
-
the first display means is an analog display device; and
the second display means is a digital display device.
-
-
7. A system as in claim 1, wherein:
-
the first display means is a television; and
the second display means is a computer display monitor.
-
-
8. A system as in claim 1, further comprising means for identifying the subject matter content of a segment of the body of information, wherein the means for comparing further comprises means for determining the similarity of the subject matter content of a segment to the subject matter content of a different segment, the predetermined criteria including a predefined degree of similarity with respect to which the relatedness of the compared segments is determined.
-
9. A system as in claim 8, wherein the means for determining the similarity of the subject matter of segments further comprises means for performing a relevance feedback method.
-
10. A system as in claim 1, wherein the means for acquiring data further comprises means for acquiring television broadcast signals.
-
11. A system as in claim 1, wherein the means for acquiring data further comprises means for acquiring radio broadcast signals.
-
12. A system as in claim 1, wherein the means for acquiring data further comprises means for acquiring computer-readable data files over a computer network from an information providing site that is part of that network.
-
13. A system as in claim 1, wherein the means for acquiring data further comprises:
-
means for acquiring television broadcast signals; and
means for acquiring computer-readable data files over a computer network from an information providing site that is part of that network.
-
-
14. A system as in claim 13, wherein:
-
the first segment is represented by data produced from the television broadcast signals; and
the second segment is represented by data from the computer-readable data files.
-
-
15. A system as in claim 1, further comprising means for identifying an instruction from a user to begin displaying at least some of the body of information, wherein the first display means begins displaying a segment in response to the user instruction.
-
16. A system as in claim 1, wherein the first and second display means are physically separate.
-
17. A system as in claim 1, wherein the means for storing the acquired data, the first display means and the second display means are interconnected to a conventional computer bus that enables the devices to communicate with each other such that the devices do not require wire communication over network communication lines to communicate with each other.
-
18. A system as in claim 1, wherein at least some of the acquired data is digital data, the means for acquiring data further comprising means for acquiring digital data.
-
19. A system as in claim 1, wherein at least some of the acquired data is analog data, the means for acquiring data further comprising means for acquiring analog data.
-
20. A method for acquiring and reviewing a body of information, wherein the body of information includes a plurality of segments, each segment representing a defined set of information in the body of information, the method comprising the steps of:
-
acquiring data representing the body of information;
storing the acquired data;
generating a display of a first segment of the body of information from data that is part of the stored data;
comparing data representing a segment of the body of information to data representing a different segment of the body of information to determine whether, according to one or more predetermined criteria, the compared segments are related; and
generating a display of a portion of, or a representation of, a second segment of the body of information from data that is part of the stored data, wherein the display of the portion or representation of the second segment is generated in response to the display of a first segment to which the second segment is related. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
the step of acquiring data representing the body of information further comprises the step of acquiring audiovisual data representing at least a portion of the body of information, wherein the first and second segments are represented by audiovisual data; and
the step of generating a display of a first segment of the body of information further comprises the step of generating an audiovisual display of the first segment.
-
-
23. A method as in claim 22, further comprising the step of identifying the selection of a second segment for which a portion or representation is being displayed, wherein selection of such second segment causes an audiovisual display of the selected second segment to be produced.
-
24. A method as in claim 20, wherein:
-
the step of acquiring data representing the body of information further comprises the step of acquiring audiovisual data representing at least a portion of the body of information;
the step of generating a display of a first segment of the body of information further comprises the step of generating an audiovisual display of the first segment; and
the step of generating a display of a portion of, or a representation of, a second segment of the body of information further comprises the step of generating a text display of the portion or representation of the second segment.
-
-
25. A method as in claim 20, wherein:
-
the step of generating a display of a first segment of the body of information further comprises the step of generating a display of the first segment on an analog display device; and
the step of generating a display of a portion of, or a representation of, a second segment of the body of information further comprises the step of generating a display of the portion or representation of the second segment on a digital display device.
-
-
26. A method as in claim 20, wherein:
-
the step of generating a display of the first segment on an analog display device further comprises the step of generating a display of the first segment on a television; and
the step of generating a display of the portion or representation of the second segment on a digital display device further comprises the step of generating a display of the portion or representation of the second segment on a computer display monitor.
-
-
27. A method as in claim 20, further comprising the step of identifying the subject matter content of a segment of the body of information, wherein the step of comparing further comprises the step of determining the similarity of the subject matter content of a segment to the subject matter content of a different segment, the predetermined criteria including a predefined degree of similarity with respect to which the relatedness of the compared segments is determined.
-
28. A method as in claim 27, wherein the step of determining the similarity of the subject matter of segments further comprises the step of performing a relevance feedback method.
-
29. A method as in claim 20, wherein the step of acquiring data further comprises the step of acquiring television broadcast signals.
-
30. A method as in claim 20, wherein the step of acquiring data further comprises the step of acquiring radio broadcast signals.
-
31. A method as in claim 20, wherein the step of acquiring data further comprises the step of acquiring computer-readable data files over a computer network from an information providing site that is part of that network.
-
32. A method as in claim 20, wherein the step of acquiring data further comprises the steps of:
-
acquiring television broadcast signals; and
acquiring computer-readable data files over a computer network from an information providing site that is part of that network.
-
-
33. A method as in claim 32, wherein:
-
the first segment is represented by data produced from the television broadcast signals; and
the second segment is represented by data from the computer-readable data files.
-
-
34. A method as in claim 20, further comprising the step of identifying an instruction from a user to begin displaying at least some of the body of information, wherein the display of a first segment is begun in response to the user instruction.
-
35. A method as in claim 20, wherein the first and second segments are displayed on physically separate display devices.
-
36. A method as in claim 20, wherein the steps of storing the acquired data, generating a display of a first segment of the body of information, and generating a display of a portion of, or a representation of, a second segment of the body of information are performed by devices interconnected to a conventional computer bus that enables the devices to communicate with each other such that the devices do not require wire communication over network communication lines to communicate with each other.
-
37. A method as in claim 20, wherein at least some of the acquired data is digital data, the step of acquiring data further comprising the step of acquiring digital data.
-
38. A method as in claim 20, wherein at least some of the acquired data is analog data, the step of acquiring data further comprising the step of acquiring analog data.
-
39. A method for categorizing according to subject matter an uncategorized segment of a body of information that includes a plurality of segments, each segment representing a defined set of information in the body of information, one or more segments of the body of information having previously been categorized by identifying each of the one or more segments with one or more subject matter categories, the method comprising the steps of:
-
determining the degree of similarity between the subject matter content of the uncategorized segment and the subject matter content of each of the previously categorized segments;
identifying one or more of the previously categorized segments as relevant to the uncategorized segment based upon the determined degrees of similarity of subject matter content between the uncategorized segment and the previously categorized segments; and
selecting one or more subject matter categories with which to identify the uncategorized segment based upon the subject matter categories used to identify the relevant previously categorized segments. - View Dependent Claims (40, 41, 42, 43, 44)
identifying a plurality of the previously categorized segments that are the most similar to the uncategorized segment;
determining the degree of similarity between each of the plurality of previously categorized segments and each other of the plurality of previously categorized segments;
for each pair of previously categorized segments of the plurality of previously categorized segments having greater than a predefined degree of similarity, eliminating one of the pair of previously categorized segments from the plurality of previously categorized segments, wherein the previously categorized segment or segments remaining after the step of eliminating are similar and distinct previously categorized segments; and
identifying one or more of the similar and distinct previously categorized segments as relevant previously categorized segments.
-
-
42. A method as in claim 39, wherein the step of selecting one or more subject matter categories further comprises selecting the most frequently occurring subject matter category or categories associated with the relevant previously categorized segments.
-
43. A method as in claim 39, wherein the uncategorized segment has been acquired from a first data source and the previously categorized segment or segments have been acquired from a second data source that is different than the first data source.
-
44. A method as in claim 43, wherein:
-
the data acquired from the first data source are television or radio broadcast signals; and
the data acquired from the second data source are computer-readable data files.
-
-
45. A method for determining whether a first set of information represented by a set of data of a first type is relevant to a second set of information represented by a set of data of a second type, the first and second sets of information being different from each other, the method comprising the steps of:
-
deriving a set of data of the second type from the set of data of the first type, the derived set of data of the second type also being representative of the first set of information;
determining the degree of similarity between the set of data of the second type representing the second set of information and the derived set of data of the second type representing the first set of information; and
determining whether the first set of information is relevant to the second set of information based upon the degree of similarity between the set of data of the second type representing the second set of information and the derived set of data of the second type representing the first set of information. - View Dependent Claims (46, 47, 48, 49, 50)
determining the degree of similarity between each set of data of the second type representing one of the plurality of sets of information and the derived set of data of the second type representing the first set of information;
identifying which, if any, of the sets of data of the second type representing one of the plurality of sets of information have greater than a predefined degree of similarity to the derived set of data of the second type representing the first set of information, the sets of data of the second type so identified being termed similar sets of data of the second type;
determining the degree of similarity between each similar set of data of the second type and each other similar set of data of the second type;
for each pair of similar sets of data of the second type having greater than a predefined degree of similarity, eliminating one of the pair of similar sets of data of the second type from the set of similar sets of data of the second type, wherein the set or sets of similar data of the second type remaining after the step of eliminating are similar and distinct sets of data of the second type; and
identifying the set or sets of information corresponding to one or more of the similar and distinct sets of data of the second type as relevant to the second set of information.
-
-
49. A method as in claim 48, wherein the step of identifying the relevant set or sets of information further comprises identifying no more than a predetermined number of relevant sets of information, the predetermined number of relevant sets of information corresponding to the sets of data of the second type having the greatest degree of similarity to the derived set of data of the second type.
-
50. A method as in claim 45, wherein the first type of data is analog data and the second type of data is digital data.
-
51. A method for identifying the boundaries of segments in a body of information, each segment comprising a contiguous related set of information in the body of information, wherein the body of information is represented by at least a set of text data and a set of video data, the method comprising the steps of:
-
performing a coarse partitioning method, the coarse partitioning method further comprising the steps of;
identifying time-stamped markers in the set of text data; and
determining approximate segment boundaries within the body of information as the times of occurrence of the time-stamp markers;
for each approximate segment boundary, specifying a range of time that includes the time of occurrence of the approximate segment boundary;
extracting subsets of video data from the set of video data that occur during the specified ranges of time;
performing a fine partitioning method to identify one or more breaks in the set of video data; and
selecting the best break that occurs in each subset of video data, the time of occurrence of the best break in each subset being designated as a boundary of a segment in the body of information. - View Dependent Claims (52, 53, 54, 55, 56, 57)
ascertaining a synchronization of the audio data and the video data; and
determining the location of the segment boundaries in the set of audio data using the previously determined location of the segment boundaries in the set of video data and the synchronization of the audio data and video data.
-
-
58. A method for identifying the boundaries of segments in a body of information, each segment comprising a contiguous related set of information in the body of information, wherein the body of information is represented by a set of text data, a set of video data, and a set of audio data, the method comprising the steps of:
-
performing a coarse partitioning method, the coarse partitioning method further comprising the steps of;
identifying time-stamped markers in the set of text data; and
determining approximate segment boundaries within the body of information as the times of occurrence of the time-stamp markers;
for each approximate segment boundary, specifying a range of time that includes the time of occurrence of the approximate segment boundary;
extracting subsets of audio data from the set of audio data that occur during the specified ranges of time;
performing a fine partitioning method to identify one or more breaks in the set of audio data;
selecting the best break that occurs in each subset of audio data, the time of occurrence of the best break in each subset being designated as a boundary of a segment in the body of information;
ascertaining a synchronization of the audio data and the video data; and
determining the location of the segment boundaries in the set of video data using the previously determined location of the segment boundaries in the set of audio data and the synchronization of the audio data and video data. - View Dependent Claims (59, 60, 61, 62)
-
-
63. A computer readable medium encoded with one or more computer programs for enabling acquisition and review of a body of information, wherein the body of information includes a plurality of segments, each segment representing a defined set of information in the body of information, comprising:
-
instructions for acquiring data representing the body of information;
instructions for storing the acquired data;
instructions for generating a display of a first segment of the body of information from data that is part of the stored data;
instructions for comparing data representing a segment of the body of information to data representing a different segment of the body of information to determine whether, according to one or more predetermined criteria, the compared segments are related; and
instructions for generating a display of a portion of, or a representation of, a second segment of the body of information from data that is part of the stored data, wherein the display of the portion or representation of the second segment is generated in response to the display of a first segment to which the second segment is related. - View Dependent Claims (64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81)
the instructions for acquiring data representing the body of information further comprise instructions for acquiring audiovisual data representing at least a portion of the body of information, wherein the first and second segments are represented by audiovisual data; and
the instructions for generating a display of a first segment of the body of information further comprise instruction for generating an audiovisual display of the first segment.
-
-
66. A computer readable medium as in claim 65, further comprising instructions for identifying the selection of a second segment for which a portion or representation is being displayed, wherein selection of such second segment causes an audiovisual display of the selected second segment to be produced.
-
67. A computer readable medium as in claim 63, wherein:
-
the instructions for acquiring data representing the body of information further comprise instructions for acquiring audiovisual data representing at least a portion of the body of information;
the instructions for generating a display of a first segment of the body of information further comprise instructions for generating an audiovisual display of the first segment; and
the instructions for generating a display of a portion of, or a representation of, a second segment of the body of information further comprise instructions for generating a text display of the portion or representation of the second segment.
-
-
68. A computer readable medium as in claim 63, wherein:
-
the instructions for generating a display of a first segment of the body of information further comprise instructions for generating a display of the first segment on an analog display device; and
the instructions for generating a display of a portion of, or a representation of, a second segment of the body of information further comprise instructions for generating a display of the portion or representation of the second segment on a digital display device.
-
-
69. A computer readable medium as in claim 63, wherein:
-
the instructions for generating a display of the first segment on an analog display device further comprise instructions for generating a display of the first segment on a television; and
the instructions for generating a display of the portion or representation of the second segment on a digital display device further comprise instructions for generating a display of the portion or representation of the second segment on a computer display monitor.
-
-
70. A computer readable medium as in claim 63, further comprising instructions for identifying the subject matter content of a segment of the body of information, wherein the instructions for comparing further comprise instructions for determining the similarity of the subject matter content of a segment to the subject matter content of a different segment, the predetermined criteria including a predefined degree of similarity with respect to which the relatedness of the compared segments is determined.
-
71. A computer readable medium as in claim 70, wherein the instructions for determining the similarity of the subject matter of segments further comprise instructions for performing a relevance feedback method.
-
72. A computer readable medium as in claim 63, wherein the instructions for acquiring data further comprise instructions for acquiring television broadcast signals.
-
73. A computer readable medium as in claim 63, wherein the instructions for acquiring data further comprise instructions for acquiring radio broadcast signals.
-
74. A computer readable medium as in claim 63, wherein the instructions for acquiring data further comprise instructions for acquiring computer-readable data files over a computer network from an information providing site that is part of that network.
-
75. A computer readable medium as in claim 63, wherein the instructions for acquiring data further comprise:
-
instructions for acquiring television broadcast signals; and
instructions for acquiring computer-readable data files over a computer network from an information providing site that is part of that network.
-
-
76. A computer readable medium as in claim 75, wherein:
-
the first segment is represented by data produced from the television broadcast signals; and
the second segment is represented by data from the computer-readable data files.
-
-
77. A computer readable medium as in claim 63, further comprising instructions for identifying an instruction from a user to begin displaying at least some of the body of information, wherein the display of a first segment is begun in response to the user instruction.
-
78. A computer readable medium as in claim 63, wherein the first and second segments are displayed on physically separate display devices.
-
79. A computer readable medium as in claim 63, wherein the instructions for storing the acquired data, generating a display of a first segment of the body of information, and generating a display of a portion of, or a representation of, a second segment of the body of information are executed by devices interconnected to a conventional computer bus that enables the devices to communicate with each other such that the devices do not require wire communication over network communication lines to communicate with each other.
-
80. A computer readable medium as in claim 63, wherein at least some of the acquired data is digital data, the instructions for acquiring data further comprising instructions for acquiring digital data.
-
81. A computer readable medium as in claim 63, wherein at least some of the acquired data is analog data, the instructions for acquiring data further comprising instructions for acquiring analog data.
-
82. A computer readable medium encoded with one or more computer programs for enabling categorization according to subject matter of an uncategorized segment of a body of information that includes a plurality of segments, each segment representing a defined set of information in the body of information, one or more segments having previously been categorized by identifying each of the one or more segments with one or more subject matter categories, comprising:
-
instructions for determining the degree of similarity between the subject matter content of the uncategorized segment and the subject matter content of each of the previously categorized segments;
instructions for identifying one or more of the previously categorized segments as relevant to the uncategorized segment based upon the determined degrees of similarity of subject matter content between the uncategorized segment and the previously categorized segments; and
instructions for selecting one or more subject matter categories with which to identify the uncategorized segment based upon the subject matter categories used to identify the relevant previously categorized segments. - View Dependent Claims (83, 84, 85, 86, 87)
instructions for identifying a plurality of the previously categorized segments that are the most similar to the uncategorized segment;
instructions for determining the degree of similarity between each of the plurality of previously categorized segments and each other of the plurality of previously categorized segments;
instructions for eliminating, for each pair of previously categorized segments of the plurality of previously categorized segments having greater than a predefined degree of similarity, one of the pair of previously categorized segments from the plurality of previously categorized segments, wherein the remaining previously categorized segment or segments are similar and distinct previously categorized segments; and
instructions for identifying one or more of the similar and distinct previously categorized segments as relevant previously categorized segments.
-
-
85. A computer readable medium as in claim 82, wherein the instructions for selecting one or more subject matter categories further comprise instructions for selecting the most frequently occurring subject matter category or categories associated with the relevant previously categorized segments.
-
86. A computer readable medium as in claim 82, wherein the uncategorized segment has been acquired from a first data source and the previously categorized segment or segments have been acquired from a second data source that is different than the first data source.
-
87. A computer readable medium as in claim 86, wherein:
-
the data acquired from the first data source are television or radio broadcast signals; and
the data acquired from the second data source are computer-readable data files.
-
-
88. A computer readable medium encoded with one or more computer programs for enabling determination of whether a first set of information represented by a set of data of a first type is relevant to a second set of information represented by a set of data of a second type, the first and second sets of information being different from each other, comprising:
-
instructions for deriving a set of data of the second type from the set of data of the first type, the derived set of data of the second type also being representative of the first set of information;
instructions for determining the degree of similarity between the set of data of the second type representing the second set of information and the derived set of data of the second type representing the first set of information; and
instructions for determining whether the first set of information is relevant to the second set of information based upon the degree of similarity between the set of data of the second type representing the second set of information and the derived set of data of the second type representing the first set of information. - View Dependent Claims (89, 90, 91, 92, 93)
instructions for determining the degree of similarity between each set of data of the second type representing one of the plurality of sets of information and the derived set of data of the second type representing the first set of information;
instructions for identifying which, if any, of the sets of data of the second type representing one of the plurality of sets of information have greater than a predefined degree of similarity to the derived set of data of the second type representing the first set of information, the sets of data of the second type so identified being termed similar sets of data of the second type;
instructions for determining the degree of similarity between each similar set of data of the second type and each other similar set of data of the second type;
instructions for eliminating, for each pair of similar sets of data of the second type having greater than a predefined degree of similarity, one of the pair of similar sets of data of the second type from the set of similar sets of data of the second type, wherein the remaining set or sets of similar data of the second type are similar and distinct sets of data of the second type; and
instructions for identifying the set or sets of information corresponding to one or more of the similar and distinct sets of data of the second type as relevant to the second set of information.
-
-
92. A computer readable medium as in claim 91, wherein the instructions for identifying the relevant set or sets of information further comprise instructions for identifying no more than a predetermined number of relevant sets of information, the predetermined number of relevant sets of information corresponding to the sets of data of the second type having the greatest degree of similarity to the derived set of data of the second type.
-
93. A computer readable medium as in claim 88, wherein the first type of data is analog data and the second type of data is digital data.
-
94. A computer readable medium encoded with one or more computer programs for enabling identification of the boundaries of segments in a body of information, each segment comprising a contiguous related set of information in the body of information, wherein the body of information is represented by at least a set of text data and a set of video data, comprising:
-
instructions for performing a coarse partitioning method, the coarse partitioning instructions further comprising;
instructions for identifying time-stamped markers in the set of text data; and
instructions for determining approximate segment boundaries within the body of information as the times of occurrence of the time-stamp markers;
instructions for specifying, for each approximate segment boundary, a range of time that includes the time of occurrence of the approximate segment boundary;
instructions for extracting subsets of video data from the set of video data that occur during the specified ranges of time;
instructions for performing a fine partitioning method to identify one or more breaks in the set of video data; and
instructions for selecting the best break that occurs in each subset of video data, the time of occurrence of the best break in each subset being designated as a boundary of a segment in the body of information. - View Dependent Claims (95, 96, 97, 98, 99, 100)
instructions for ascertaining a synchronization of the audio data and the video data; and
instructions for determining the location of the segment boundaries in the set of audio data using the previously determined location of the segment boundaries in the set of video data and the synchronization of the audio data and video data.
-
-
101. A system for categorizing according to subject matter an uncategorized segment of a body of information that includes a plurality of segments, each segment representing a defined set of information in the body of information, one or more segments of the body of information having previously been categorized by identifying each of the one or more segments with one or more subject matter categories, the system comprising:
-
means for determining the degree of similarity between the subject matter content of the uncategorized segment and the subject matter content of each of the previously categorized segments;
means for identifying one or more of the previously categorized segments as relevant to the uncategorized segment based upon the determined degrees of similarity of subject matter content between the uncategorized segment and the previously categorized segments; and
means for selecting one or more subject matter categories with which to identify the uncategorized segment based upon the subject matter categories used to identify the relevant previously categorized segments. - View Dependent Claims (102, 103, 104, 105, 106)
means for identifying a plurality of the previously categorized segments that are the most similar to the uncategorized segment;
means for determining the degree of similarity between each of the plurality of previously categorized segments and each other of the plurality of previously categorized segments;
means for eliminating, for each pair of previously categorized segments of the plurality of previously categorized segments having greater than a predefined degree of similarity, one of the pair of previously categorized segments from the plurality of previously categorized segments, wherein the remaining previously categorized segment or segments are similar and distinct previously categorized segments; and
means for identifying one or more of the similar and distinct previously categorized segments as relevant previously categorized segments.
-
-
104. A system as in claim 101, wherein the means for selecting one or more subject matter categories further comprises means for selecting the most frequently occurring subject matter category or categories associated with the relevant previously categorized segments.
-
105. A system as in claim 101, wherein the uncategorized segment has been acquired from a first data source and the previously categorized segment or segments have been acquired from a second data source that is different than the first data source.
-
106. A system as in claim 105, wherein:
-
the data acquired from the first data source are television or radio broadcast signals; and
the data acquired from the second data source are computer-readable data files.
-
-
107. A system for determining whether a first set of information represented by a set of data of a first type is relevant to a second set of information represented by a set of data of a second type, the first and second sets of information being different from each other, the system comprising:
-
means for deriving a set of data of the second type from the set of data of the first type, the derived set of data of the second type also being representative of the first set of information;
means for determining the degree of similarity between the set of data of the second type representing the second set of information and the derived set of data of the second type representing the first set of information; and
means for determining whether the first set of information is relevant to the second set of information based upon the degree of similarity between the set of data of the second type representing the second set of information and the derived set of data of the second type representing the first set of information. - View Dependent Claims (108, 109, 110, 111, 112)
means for determining the degree of similarity between each set of data of the second type representing one of the plurality of sets of information and the derived set of data of the second type representing the first set of information;
means for identifying which, if any, of the sets of data of the second type representing one of the plurality of sets of information have greater than a predefined degree of similarity to the derived set of data of the second type representing the first set of information, the sets of data of the second type so identified being termed similar sets of data of the second type;
means for determining the degree of similarity between each similar set of data of the second type and each other similar set of data of the second type;
means for eliminating, for each pair of similar sets of data of the second type having greater than a predefined degree of similarity, one of the pair of similar sets of data of the second type from the set of similar sets of data of the second type, wherein the remaining set or sets of similar data of the second type are similar and distinct sets of data of the second type; and
means for identifying the set or sets of information corresponding to one or more of the similar and distinct sets of data of the second type as relevant to the second set of information.
-
-
111. A system as in claim 110, wherein the means for identifying the relevant set or sets of information further comprises means for identifying no more than a predetermined number of relevant sets of information, the predetermined number of relevant sets of information corresponding to the sets of data of the second type having the greatest degree of similarity to the derived set of data of the second type.
-
112. A system as in claim 107, wherein the first type of data is analog data and the second type of data is digital data.
-
113. A computer readable medium encoded with one or more computer programs for identifying the boundaries of segments in a body of information, each segment comprising a contiguous related set of information in the body of information, wherein the body of information is represented by a set of text data, a set of video data, and a set of audio data, comprising:
-
instructions for performing a coarse partitioning method, the instructions for performing a coarse partitioning method further comprising;
instructions for identifying time-stamped markers in the set of text data; and
instructions for determining approximate segment boundaries within the body of information as the times of occurrence of the time-stamp markers;
instructions for specifying, for each approximate segment boundary, a range of time that includes the time of occurrence of the approximate segment boundary;
instructions for extracting subsets of audio data from the set of audio data that occur during the specified ranges of time;
instructions for performing a fine partitioning method to identify one or more breaks in the set of audio data;
instructions for selecting the best break that occurs in each subset of audio data, the time of occurrence of the best break in each subset being designated as a boundary of a segment in the body of information;
instructions for ascertaining a synchronization of the audio data and the video data; and
instructions for determining the location of the segment boundaries in the set of video data using the previously determined location of the segment boundaries in the set of audio data and the synchronization of the audio data and video data. - View Dependent Claims (114, 115, 116, 117)
-
-
118. A system for identifying the boundaries of segments in a body of information, each segment comprising a contiguous related set of information in the body of information, wherein the body of information is represented by at least a set of text data and a set of video data, the system comprising:
-
means for performing a coarse partitioning method, the means for performing a coarse partitioning method further comprising;
means for identifying time-stamped markers in the set of text data; and
means for determining approximate segment boundaries within the body of information as the times of occurrence of the time-stamp markers;
means for specifying, for each approximate segment boundary, a range of time that includes the time of occurrence of the approximate segment boundary;
means for extracting subsets of video data from the set of video data that occur during the specified ranges of time;
means for performing a fine partitioning method to identify one or more breaks in the set of video data; and
means for selecting the best break that occurs in each subset of video data, the time of occurrence of the best break in each subset being designated as a boundary of a segment in the body of information. - View Dependent Claims (119, 120, 121, 122, 123, 124)
means for ascertaining a synchronization of the audio data and the video data; and
means for determining the location of the segment boundaries in the set of audio data using the previously determined location of the segment boundaries in the set of video data and the synchronization of the audio data and video data.
-
-
125. A system for identifying the boundaries of segments in a body of information, each segment comprising a contiguous related set of information in the body of information, wherein the body of information is represented by a set of text data, a set of video data, and a set of audio data, the system comprising:
-
means for performing a coarse partitioning method, the means for performing a coarse partitioning method further comprising;
means for identifying time-stamped markers in the set of text data; and
means for determining approximate segment boundaries within the body of information as the times of occurrence of the time-stamp markers;
means for specifying, for each approximate segment boundary, a range of time that includes the time of occurrence of the approximate segment boundary;
means for extracting subsets of audio data from the set of audio data that occur during the specified ranges of time;
means for performing a fine partitioning method to identify one or more breaks in the set of audio data;
means for selecting the best break that occurs in each subset of audio data, the time of occurrence of the best break in each subset being designated as a boundary of a segment in the body of information;
means for ascertaining a synchronization of the audio data and the video data; and
means for determining the location of the segment boundaries in the set of video data using the previously determined location of the segment boundaries in the set of audio data and the synchronization of the audio data and video data. - View Dependent Claims (126, 127, 128, 129)
-
Specification