System and method for aggregating web feeds relevant to a geographical locale from multiple sources
First Claim
1. A method of collecting feeds relevant to a geographical locale from multiple sources to be electronically published to a user through a user device, comprising:
- receiving a web feed having source metadata that includes a geographic location identifier, and a plurality of tags including title data, summary data, and text data and in a source format;
generating mapping information for mapping the tags in the source format to a normalized data structure having a title field, a summary field, and a text field, the mapping information generated from (i) a probabilistic model using stored statistical information to assign the plurality of tags to the title field, the summary field and the text field of the normalized data structure, and (ii) heuristic data that assigns the plurality of tags to the title field, the summary field and the text field of the normalized data structure based on a size of each of the plurality of tags;
parsing the web feed according to the mapping information generate a normalized web feed from the web feed, the normalized web feed having the normalized data structure specified by the mapping information;
determining a relevant geographical locale for the normalized web feed by parsing text data in the text field of the normalized web feed to detect location keywords therein;
when the relevant geographic locale differs from the geographic location identifier in the source metadata, identifying a geographic location for the normalized web feed based exclusively on the location keywords;
performing filtering on the normalized web feed to determine whether the normalized web feed includes qualified content for publication; and
publishing the qualified content to be accessible to the user on the user device through a user interface on which the qualified content is depicted as being relevant to the geographical location.
12 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for aggregating web feeds relevant to a geographical locale from multiple sources are disclosed. One embodiment includes, aggregating a web feed using identifier information included in source metadata, parsing the web feed according to mapping information in the source metadata to generate a normalized web feed from the web feed having a data structure specified by the mapping information, determining the geographical locale that is relevant to the web feed, performing filtering on the normalized web feed to determine whether the normalized web feed includes qualified content for publication, and/or publishing the qualified content to be accessible to the user on the user device through a user interface on which the qualified content is depicted as being relevant to the geographical locale.
113 Citations
37 Claims
-
1. A method of collecting feeds relevant to a geographical locale from multiple sources to be electronically published to a user through a user device, comprising:
-
receiving a web feed having source metadata that includes a geographic location identifier, and a plurality of tags including title data, summary data, and text data and in a source format; generating mapping information for mapping the tags in the source format to a normalized data structure having a title field, a summary field, and a text field, the mapping information generated from (i) a probabilistic model using stored statistical information to assign the plurality of tags to the title field, the summary field and the text field of the normalized data structure, and (ii) heuristic data that assigns the plurality of tags to the title field, the summary field and the text field of the normalized data structure based on a size of each of the plurality of tags; parsing the web feed according to the mapping information generate a normalized web feed from the web feed, the normalized web feed having the normalized data structure specified by the mapping information; determining a relevant geographical locale for the normalized web feed by parsing text data in the text field of the normalized web feed to detect location keywords therein; when the relevant geographic locale differs from the geographic location identifier in the source metadata, identifying a geographic location for the normalized web feed based exclusively on the location keywords; performing filtering on the normalized web feed to determine whether the normalized web feed includes qualified content for publication; and publishing the qualified content to be accessible to the user on the user device through a user interface on which the qualified content is depicted as being relevant to the geographical location. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A method performed by a computer in communication with a machine-readable storage medium having stored thereon a set of instructions which when executed, causes a processor of the computer to identify and publish web feeds relevant to a geographical locale collected from multiple sources, the method comprising:
-
receiving a web feed having source metadata that includes a geographic location identifier, and a plurality of tags including title data, summary data, and text data and in a source format; generating mapping information for mapping the tags in the source format to a normalized data structure having a title field, a summary field, and a text field, the mapping information generated from (i) a probabilistic model using stored statistical information to assign the plurality of tags to the title field, the summary field and the text field of the normalized data structure, and (ii) heuristic data that assigns the plurality of tags to the title field, the summary field and the text field of the normalized data structure based on a size of each of the plurality of tags; parsing the web feed according to the mapping information, the source metadata and the plurality of tags; creating a normalized web feed from the web feed, the normalized web feed having the normalized data structure specified by the mapping information; determining a relevant geographic locale corresponding to the normalized web feed by parsing text data in the text field of the normalized web feed to detect location keywords therein when the relevant geographic locale differs from the geographic location identifier in the source metadata, identifying the geographic location for the normalized web feed based exclusively on the location keywords; publishing valid content in the normalized web feed such that the valid content is accessible to the user through a user interface presented on a user computing device on which the qualified content is depicted as being associated with the geographical location; determining trending data of the valid content; and prioritizing placement of the valid content in the user interface first according to the geographic location and then according to the trending data. - View Dependent Claims (21, 22)
-
-
23. A system for publishing web feeds relevant to a geographical locale collected from multiple sources, the system, comprising:
-
one or more client devices connected via a network to a host server; a listing repository to receive and store web feeds, each web feed having source metadata that includes a geographic location identifier, and a plurality of tags including title data, summary data, and text data and in a source format; an aggregator engine, coupled to the listing repository, for aggregating the web feeds using the geographic location identifier in the source metadata; a normalization module in the aggregator engine for generating mapping information for mapping the tags in the source format to a normalized data structure having a title field, a summary field, and a text field, the mapping information generated from (i) a probabilistic model using stored statistical information to assign the plurality of tags to the title field, the summary field and the text field of the normalized data structure, and (ii) heuristic data that assigns the plurality of tags to the title field, the summary field and the text field of the normalized data structure based on a size of each of the plurality of tags; and
the normalization module further for parsing the web feeds to generate normalized web feeds having the a normalized data structure;a parser for determining a relevant geographic locale corresponding to the normalized web feed by parsing text data in the text field of the normalized web feeds to detect location keywords therein and when the relevant geographic locale differs from the geographic location identifier in the source metadata, identifying the geographic location for the normalized web feeds based exclusively on the location keywords; a filter module in the aggregator engine that filters the normalized web feeds to determine whether the normalized web feeds include qualified stories for publication; multiple content repositories each coupled to the aggregator engine for storing the qualified stories; and a publish server coupled to the multiple content repositories for publishing the qualified stories accessible to a user through a user interface on which the qualified content is depicted as being relevant to the geographical location. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30)
-
-
31. A method for collecting feeds relevant to a geographical locale from multiple sources to be electronically published to a user through a user device, comprising:
-
receiving a web feed having source metadata that includes a geographic location identifier, and a plurality of tags including title data, summary data, and text data and in a source format; generating mapping information for mapping the tags in the source format to a normalized data structure having a title field, a summary field, and a text field, the mapping information generated from (i) a probabilistic model using stored statistical information to assign the plurality of tags to the title field, the summary field and the text field of the normalized data structure, and (ii) heuristic data that assigns the plurality of tags to the title field, the summary field and the text field of the normalized data structure based on a size of each of the plurality of tags; parsing the web feed according to the mapping information to generate a normalized web feed from the web feed, the normalized web feed having the normalized data structure specified by the mapping information; determining a relevant geographical locale for the normalized web feed by parsing text data in the text field of the normalized web feed to detect location keywords therein; and
when the relevant geographical locale differs from the geographic location identifier in the source metadata, identifying the relevant geographical locale as a relevant geographic location for the normalized web feed based on the location keywords;filtering the normalized web feed to determine whether the normalized web feed includes qualified content for publication based on the relevant geographic location; and publishing the qualified content to the user through a user interface wherein the qualified content is depicted as being from the relevant geographic location.
-
-
32. A method of using a device to access feeds that are relevant to a geographical locale aggregated from multiple sources through a host site, comprising:
-
receiving a first set of web feeds and a second set of web feeds, each web feed of the first and second sets of web feeds having source metadata that includes a geographic location identifier, and a plurality of tags including title data, summary data, and text data and in a source format; generating mapping information for mapping the tags in the source format to a normalized data structure having a title field, a summary field, and a text field, the mapping information generated from (i) a probabilistic model using stored statistical information to assign the plurality of tags to the title field, the summary field and the text field of the normalized data structure, and (ii) heuristic data that assigns the plurality of tags to the title field, the summary field and the text field of the normalized data structure based on a size of each of the plurality of tags; determining relevant geographic locales corresponding to the first set of web feeds and the second set of web feeds by parsing text data in the text fields of the web feeds to detect location keywords therein; and when the relevant geographical locales differ from the geographic location identifier in the source metadata identifying the geographic locations of the web feeds based exclusively on the location keywords; retrieving first relevant web feeds from the first set and second set of web feeds relevant to a first geographical locale; when the first geographic locale corresponds to a geographic locale of a user, displaying the first relevant web feeds on a user interface when the user visits the host site; in response to the user selecting a particular feed in the first relevant feeds, opening a link to the particular feed in a second user interface such that the user views content of the particular feed on a provider site of the content; wherein, the second user interface includes a widget having a sharing feature allowing the user to share the content with other users; retrieving trended feeds related to a detected trend in response to the user selecting the detected trend and displaying the trended feeds in the user interface; wherein, the trended feeds are also relevant to the first geographical locale based on the location keywords; and in response to the user selecting a second geographical locale through the user interface, retrieving second web feeds relevant to the second geographical locale based on the location keywords and displaying the second relevant web feeds in the second user interface. - View Dependent Claims (33, 34, 35, 36, 37)
-
Specification