Updating taxonomy based on webpage
First Claim
Patent Images
1. A computer-implemented method comprising:
- extracting, by a computing device, structured content from a website associated with a stored taxonomy, the structured content including at least a first menu and a second menu, contents of the second menu depending on a selection from contents of the first menu;
determining a recent taxonomy by applying category rules to the structured content, the category rules being customized for a structure of a page of the website including the first menu and the second menu, the category rules dictating that contents of the second menu represent subcategories of at least one known category represented by contents of the first menu, the recent taxonomy including multiple known subcategories of the known category and a new subcategory represented by a new item on the second menu; and
updating the stored taxonomy based on the determined recent taxonomy by adding the new subcategory to the stored taxonomy.
2 Assignments
0 Petitions
Accused Products
Abstract
According to an example implementation, a computer-implemented method may include extracting, by a computing device, structured content from a website, determining a recent taxonomy by applying category rules to the structured content, the recent taxonomy including multiple categories and a new category, and updating a stored taxonomy based on the determined recent taxonomy by adding the new category to the stored taxonomy.
28 Citations
26 Claims
-
1. A computer-implemented method comprising:
-
extracting, by a computing device, structured content from a website associated with a stored taxonomy, the structured content including at least a first menu and a second menu, contents of the second menu depending on a selection from contents of the first menu; determining a recent taxonomy by applying category rules to the structured content, the category rules being customized for a structure of a page of the website including the first menu and the second menu, the category rules dictating that contents of the second menu represent subcategories of at least one known category represented by contents of the first menu, the recent taxonomy including multiple known subcategories of the known category and a new subcategory represented by a new item on the second menu; and updating the stored taxonomy based on the determined recent taxonomy by adding the new subcategory to the stored taxonomy. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A method comprising:
-
extracting, by a computing device, structured content from a website associated with a stored taxonomy, the structured content of the website including a table and table entries within the table; determining a recent taxonomy by applying category rules to the structured content, the recent taxonomy including multiple known subcategories of a known category and a new subcategory of the known category, the category rules being customized for the structure of the website to dictate that the table entries indicate subcategories of the known category and a table entry within the table must be the new subcategory; and updating the stored taxonomy based on the determined recent taxonomy by adding the new subcategory of the known category to the stored taxonomy. - View Dependent Claims (18, 19, 20, 21, 22)
-
-
23. A non-transitory computer-readable medium including executable code tangibly embodied thereon, the executable code being configured to, when executed, cause a data processing apparatus to:
-
extract structured content from a website associated with a stored taxonomy, the structured content including at least a first menu and a second menu, contents of the second menu depending on a selection from contents of the second menu; determine a recent taxonomy by applying category rules to the structured content, the category rules being customized for a structure of a page of the website including the first menu and the second menu, the category rules dictating that contents of the second menu represent subcategories of at least one known category represented by contents of the first menu, the recent taxonomy including multiple known subcategories of the known category and a new subcategory represented by a new item on the second menu; and update the stored taxonomy based on the determined recent taxonomy by adding the new subcategory to the stored taxonomy. - View Dependent Claims (24)
-
-
25. An apparatus comprising:
-
at least one processor; and at least one memory device, the at least one memory device comprising executable code stored thereon that, when executed by the at least one processor, is configured to cause the apparatus to; extract structured content from a website associated with a stored taxonomy, the structured content including at least a first menu and a second menu, contents of the second menu depending on a selection from contents of the first menu; determine a recent taxonomy by applying category rules to the structured content, the category rules being customized for a structure of a page of the website including the first menu and the second menu, the category rules dictating that contents of the second menu represent subcategories of at least one known category represented by contents of the first menu, the recent taxonomy including multiple known subcategories of the known category and a new subcategory represented by a new item on the second menu; and update the stored taxonomy based on the determined recent taxonomy by adding the new subcategory to the stored taxonomy. - View Dependent Claims (26)
-
Specification