METHOD AND SYSTEM FOR CLASSIFICATION OF VENUE BY ANALYZING DATA FROM VENUE WEBSITE
First Claim
1. A method for classifying a venue by analyzing venue data from a venue website, comprising:
- receiving preliminary venue-related data including a venue URL;
scanning the venue website to retrieve venue data;
retrieving verifiable venue data from the venue data, the verifiable venue data being a subset of the venue data;
analyzing the verifiable venue data by comparing the verifiable venue data to the preliminary venue-related data;
determining a probability level for the venue URL from the comparison;
if the probability level for the venue URL is equal or greater than a first probability level, determining the number of selected attributes in the venue data;
determining the percentage of the attribute representation from the total number of preselected attributes in the venue data;
classifying the venue based on the percentage of the attribute representation.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system classifies a venue by analyzing venue data from a venue website. The method includes receiving preliminary venue-related data. The method includes scanning the venue website to retrieve venue data, wherein scanning the venue website includes retrieving the venue data from HTML pages, text documents, PDF documents, and images. The method includes retrieving verifiable venue data from the venue data. The verifiable venue data is a subset of the venue data. The method includes analyzing the verifiable venue data by comparing the verifiable venue data to the preliminary venue-related data and determining a probability level for the venue URL from the comparison. If the probability level for the venue URL is equal or greater than a first probability level, the venue website data is further analyzed to extract attributes and attribute counts in a robust and context-sensitive way. The method includes determining the percentage of the attribute representation from the total number of preselected attributes in the venue data and classifying the venue based on the percentage of the attribute representation.
-
Citations
24 Claims
-
1. A method for classifying a venue by analyzing venue data from a venue website, comprising:
-
receiving preliminary venue-related data including a venue URL; scanning the venue website to retrieve venue data; retrieving verifiable venue data from the venue data, the verifiable venue data being a subset of the venue data; analyzing the verifiable venue data by comparing the verifiable venue data to the preliminary venue-related data; determining a probability level for the venue URL from the comparison; if the probability level for the venue URL is equal or greater than a first probability level, determining the number of selected attributes in the venue data; determining the percentage of the attribute representation from the total number of preselected attributes in the venue data; classifying the venue based on the percentage of the attribute representation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A method for classifying a venue by analyzing venue data from a venue website, comprising:
-
receiving venue data from the venue website; determining the number of preselected attributes in the venue data; determining a percentage of each preselected attribute representation from the total number of preselected attributes in the venue data; assigning a classifier factor for each preselected attribute in the venue data; classifying the venue based on the percentage of each attribute representation and the classifier factor. - View Dependent Claims (16, 17, 18)
-
-
19. A method for classifying a venue by analyzing selected attributes in a venue website, comprising:
-
entering the venue website using a URL corresponding to the website; receiving venue data from the venue website; determining the number of selected attributes in the venue website; determining a percentage of each selected attribute representation from the total number of preselected attributes in the URL data; assigning a classifier factor for each selected attribute in the URL data; classifying the venue based on the percentage of each attribute representation and the classifier factor. - View Dependent Claims (20, 21, 22)
-
-
23. A system for classifying a venue by analyzing venue data from a venue website, comprising:
-
a server having a central processing unit, the server receiving the venue data from the venue website; a venue classification application having program code for executing a plurality of steps to analyze the venue data and classify the venue data; a communication network enabling the server to access the venue website to receive the venue data; the central processing unit executing the steps of; determining the number of selected attributes in the venue data; determining a percentage of each selected attribute representation from the total number of selected attributes in the venue data; assigning a classifier factor for each selected attribute in the venue data; classifying the venue based on the percentage of each attribute representation and the classifier factor. - View Dependent Claims (24)
-
Specification