AUTOMATIC CLASSIFICATION OF SEGMENTED PORTIONS OF WEB PAGES
First Claim
Patent Images
1. A method comprising:
- with one or more special purpose computing devices coupled to a memory;
for at least one of a plurality of segmented portions associated with at least one of a plurality of displayable web pages as represented by one or more digital signals of one or more data files stored in said memory, using one or more machine learned models to;
identify one or more feature properties associated with said segmented portion within said one or more data files, and/or otherwise inferable from said one or more data files, andclassify said at least one of said plurality of segmented portions as being at least one of a plurality of segment types based, at least in part, on said one or more identified feature properties; and
storing one or more digital signals in said memory as part of an index associated with said plurality of segmented portions, said index being based, at least in part, on said segment type.
10 Assignments
0 Petitions
Accused Products
Abstract
Exemplary methods and apparatuses are provided which may be used for classifying and indexing segmented portions of web pages and providing related information for use in information extraction and/or information retrieval systems.
-
Citations
20 Claims
-
1. A method comprising:
with one or more special purpose computing devices coupled to a memory; for at least one of a plurality of segmented portions associated with at least one of a plurality of displayable web pages as represented by one or more digital signals of one or more data files stored in said memory, using one or more machine learned models to; identify one or more feature properties associated with said segmented portion within said one or more data files, and/or otherwise inferable from said one or more data files, and classify said at least one of said plurality of segmented portions as being at least one of a plurality of segment types based, at least in part, on said one or more identified feature properties; and storing one or more digital signals in said memory as part of an index associated with said plurality of segmented portions, said index being based, at least in part, on said segment type. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
11. An apparatus comprising:
-
memory having stored therein one or more digital signals representing at least one data file associated with at least one displayable web page; at least one processing unit coupled to said memory and programmed with instructions to; for at least one of a plurality of segmented portions associated with said displayable web page, use one or more machine learned models to; identify one or more feature properties associated with said segmented portion within said one or more data files, and/or otherwise inferable from said one or more data files, and classify said at least one of said plurality of segmented portions as being at least one of a plurality of segment types based, at least in part, on said one or more identified feature properties; and establish an index in said memory, said index associated with said plurality of segmented portions and being based, at least in part, on said segment type. - View Dependent Claims (12, 13, 14, 15)
-
-
16. An article comprising:
a computer readable medium having computer implementable instructions stored thereon which if implemented by one or more processing units in a computing device operatively transform the computing device into a special purpose device to; for at least one of a plurality of segmented portions associated with at least one of a plurality of displayable web pages as represented by one or more digital signals of one or more data files stored in a memory, use one or more machine learned models to; identify one or more feature properties associated with said segmented portion within said one or more data files, and/or otherwise inferable from said one or more data files, and classify said at least one of said plurality of segmented portions as being at least one of a plurality of segment types based, at least in part, on said one or more identified feature properties; and establish one or more digital signals representing an index within a memory coupled to said one or more processing units, said index associated with said plurality of segmented portions and being based, at least in part, on said segment type. - View Dependent Claims (17, 18, 19, 20)
Specification