Method and system for classification of venue by analyzing data from venue website
First Claim
1. A method for classifying a venue by analyzing venue data from a venue website, comprising:
- receiving preliminary venue-related data including a venue URL;
scanning the venue website to retrieve venue data;
retrieving verifiable venue data from the venue data, the verifiable venue data being a subset of the venue data;
analyzing, using a computer, the verifiable venue data by comparing the verifiable venue data to the preliminary venue-related data;
determining a probability level for the venue URL from the comparison;
if the probability level for the venue URL is equal or greater than a first probability level, determining the number of selected attributes in the venue data;
determining the percentage of the attribute representation from the total number of preselected attributes in the venue data; and
classifying the venue based on the percentage of the attribute representation.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system classifies a venue by analyzing venue data from a venue website. The method includes receiving preliminary venue-related data. The method includes scanning the venue website to retrieve venue data, wherein scanning the venue website includes retrieving the venue data from HTML pages, text documents, PDF documents, and images. The method includes retrieving verifiable venue data from the venue data. The verifiable venue data is a subset of the venue data. The method includes analyzing the verifiable venue data by comparing the verifiable venue data to the preliminary venue-related data and determining a probability level for the venue URL from the comparison. If the probability level for the venue URL is equal or greater than a first probability level, the venue website data is further analyzed to extract attributes and attribute counts in a robust and context-sensitive way. The method includes determining the percentage of the attribute representation from the total number of preselected attributes in the venue data and classifying the venue based on the percentage of the attribute representation.
-
Citations
24 Claims
-
1. A method for classifying a venue by analyzing venue data from a venue website, comprising:
-
receiving preliminary venue-related data including a venue URL; scanning the venue website to retrieve venue data; retrieving verifiable venue data from the venue data, the verifiable venue data being a subset of the venue data; analyzing, using a computer, the verifiable venue data by comparing the verifiable venue data to the preliminary venue-related data; determining a probability level for the venue URL from the comparison; if the probability level for the venue URL is equal or greater than a first probability level, determining the number of selected attributes in the venue data; determining the percentage of the attribute representation from the total number of preselected attributes in the venue data; and classifying the venue based on the percentage of the attribute representation. - View Dependent Claims (2, 3, 4)
-
-
5. A computer-implemented method for determining attributes of a venue, the method comprising steps to:
-
analyze first data associated with a first venue to identify a first set of venue attributes associated with the first venue; analyze second data associated with a second venue to identify a second set of venue attributes associated with the second venue; compare, using a computing device, the first set of venue attributes with the second set of venue attributes; and determine, based on comparing the first set and the second set, a level of similarity between the first venue and the second venue. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A system for determining attributes of a venue, the system comprising one or more processors that are operable to:
-
analyze first data associated with a first venue to identify a first set of venue attributes associated with the first venue; analyze second data associated with a second venue to identify a second set of venue attributes associated with the second venue; compare, using a computing device, the first set of venue attributes with the second set of venue attributes; and determine, based on comparing the first set and the second set, a level of similarity between the first venue and the second venue. - View Dependent Claims (22)
-
-
23. A computer program product comprising a non-transitory computer usable medium having a computer readable program code embodied therein, said computer readable program code adapted to be executed to implement a method for determining attributes of a venue, the method comprising steps to:
-
analyze first data associated with a first venue to identify a first set of venue attributes associated with the first venue; analyze second data associated with a second venue to identify a second set of venue attributes associated with the second venue; compare, using a computing device, the first set of venue attributes with the second set of venue attributes; and determine, based on comparing the first set and the second set, a level of similarity between the first venue and the second venue. - View Dependent Claims (24)
-
Specification