Method and system for information discovery and text analysis
First Claim
1. A method for searching one or more text sources including temporally-ordered data objects, the method comprising the steps of:
- providing access to the one or more text sources, each text source including one or more temporally-ordered data objects;
obtaining or generating a search query based on one or more search tokens and one or more time intervals;
obtaining or generating time data associated with the temporally-ordered data objects included in the one or more text sources;
identifying one or more data objects related to the search query in the one or more text sources;
generating one or more popularity curves based on frequency of the identified data objects corresponding to one or more of the search tokens in the one or more time intervals and the time data, the popularity curve exhibiting fluctuations of popularity over the one or more time intervals;
identifying a burst in the one or more popularity curves, the burst occurring in one or more burst time intervals;
identifying data objects related to the search query in the one or more burst time intervals as burst data objects;
for each of the burst data objects, assigning an authoritative index to the each burst data objects, the authoritative index being computed from a frequency of content related to the search query contributed by an author of the each burst data object;
ranking the burst data objects according to the authoritative index to obtain the top-i ranked burst data objects, i being an integer; and
providing the top-i ranked burst data objects as authoritative data objects for user selection and viewing;
wherein each data object comprises at least one of a blog, document, posting, article, email and message.
6 Assignments
0 Petitions
Accused Products
Abstract
A method for searching text sources including temporally-ordered data objects, such as a blog, is provided including the steps of: (i) providing access to text sources, each text source including temporally-ordered data objects; (ii) obtaining or generating a search query based on terms and time intervals; (iii) obtaining or generating time data associated with the data objects; (iv) identifying data objects based on the search query; and (v) generating popularity curves based on the frequency of data objects corresponding to one or more of the search terms in the one or more time intervals. A system and computer program for text source searching is also provided.
-
Citations
22 Claims
-
1. A method for searching one or more text sources including temporally-ordered data objects, the method comprising the steps of:
-
providing access to the one or more text sources, each text source including one or more temporally-ordered data objects; obtaining or generating a search query based on one or more search tokens and one or more time intervals; obtaining or generating time data associated with the temporally-ordered data objects included in the one or more text sources; identifying one or more data objects related to the search query in the one or more text sources; generating one or more popularity curves based on frequency of the identified data objects corresponding to one or more of the search tokens in the one or more time intervals and the time data, the popularity curve exhibiting fluctuations of popularity over the one or more time intervals; identifying a burst in the one or more popularity curves, the burst occurring in one or more burst time intervals; identifying data objects related to the search query in the one or more burst time intervals as burst data objects; for each of the burst data objects, assigning an authoritative index to the each burst data objects, the authoritative index being computed from a frequency of content related to the search query contributed by an author of the each burst data object; ranking the burst data objects according to the authoritative index to obtain the top-i ranked burst data objects, i being an integer; and providing the top-i ranked burst data objects as authoritative data objects for user selection and viewing; wherein each data object comprises at least one of a blog, document, posting, article, email and message. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for searching one or more text sources including temporally-ordered data objects, the system comprising:
-
a computer connected to the one or more text sources; and a search term definition utility linked to the computer or executing on the computer; the computer and the search term definition utility being configured to cooperate with each other and being operable to; provide access to the one or more text sources, each text source including one or more temporally-ordered data objects; obtain or generate a search query based on one or more search tokens and one or more time intervals; obtain or generate time data associated with the temporally-ordered data objects included in the one or more text sources; identify one or more data objects related to the search query in the one or more text sources; generate one or more popularity curves based on the frequency of the identified data objects corresponding to one or more of the search tokens in the one or more time intervals and the time data; identify a burst in the one or more popularity curves, the burst occurring in one or more burst time intervals; identify data objects related to the search query in the one or more burst time intervals as burst data objects; for each of the burst data objects, assign an authoritative index to the each burst data objects, the authoritative index being computed from a frequency of content related to the search query contributed by an author of the each burst data object; rank the burst data objects according to the authoritative index to obtain the top-i ranked burst data objects, i being an integer; and provide the top-i ranked burst data objects as authoritative data objects for user selection and viewing; wherein each data object comprises at least one of a blog, document, posting, article, email and message. - View Dependent Claims (12)
-
-
13. A computer software product for use on a computer system, the computer software product comprising:
-
a computer readable non-transitory storage medium, computer program code means stored on the computer readable non-transitory storage medium, the computer program code means comprising encoded instructions, wherein the encoded instructions comprise; providing access to the one or more text sources, each text source including one or more temporally-ordered data objects; obtaining or generating a search query based on one or more search tokens and one or more time intervals; obtaining or generating time data associated with the temporally-ordered data objects included in the one or more text sources; identifying one or more data objects related to the search query in the one or more text sources; generating one or more popularity curves based on frequency of the identified data objects corresponding to one or more of the search tokens in the one or more time intervals and the time data, the popularity curve exhibiting fluctuations of popularity over the one or more time intervals; identifying a burst in the one or more popularity curves, the burst occurring in one or more burst time intervals; identifying data objects related to the search query in the one or more burst time intervals as burst data objects; for each of the burst data objects, assigning an authoritative index to the each burst data objects, the authoritative index being computed from a frequency of content related to the search query contributed by an author of the each burst data object; ranking the burst data objects according to the authoritative index to obtain the top-i ranked burst data objects, i being an integer; and providing the top-i ranked burst data objects as authoritative data objects for user selection and viewing; wherein each data object comprises at least one of a blog, document, posting, article, email and message. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22)
-
Specification