Hidden text detection for search result scoring
First Claim
1. A method comprising:
- generating, by a system of one or more computers, a first data structure based on a web page,where the first data structure includes nodes corresponding to text that will be visually displayed from the web page and nodes corresponding to text that will not be visually displayed from the web page when the web page is rendered for display by a client device;
generating, by the system of one or more computers, a second data structure for the web page based on the first data structure,the second data structure including nodes that correspond to text that will be visually displayed when the web page is rendered for display by the client device;
comparing, by the system of one or more computers, nodes corresponding to text of the first data structure with nodes corresponding to text of the second data structure to identify the nodes corresponding to text that will not be visually displayed from the web page when the web page is rendered for display by the client device; and
generating, by the system of one or more computers, weighting factors for the nodes corresponding to text that will not be visually displayed from the web page when the web page is rendered for display by the client device based on the nodes corresponding to text that will not be visually displayed from the web page.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for creating a hidden text data index for ranking computerized query search results that includes generating a render tree based on a document object model (DOM) tree for a web page. The render tree includes nodes that correspond to text that will be visually displayed by a client device when executed. The method includes comparing nodes corresponding to text of the DOM tree with the nodes corresponding to text of the render tree to identify the nodes in the DOM tree that will not be visually displayed when executed by the client device. The method also includes creating a hidden text data index for the nodes corresponding to text of the DOM tree not in the render tree. The hidden text data index identifies nodes corresponding to text of the DOM tree as hidden that will not be visually displayed when executed by the client device.
-
Citations
20 Claims
-
1. A method comprising:
-
generating, by a system of one or more computers, a first data structure based on a web page, where the first data structure includes nodes corresponding to text that will be visually displayed from the web page and nodes corresponding to text that will not be visually displayed from the web page when the web page is rendered for display by a client device; generating, by the system of one or more computers, a second data structure for the web page based on the first data structure, the second data structure including nodes that correspond to text that will be visually displayed when the web page is rendered for display by the client device; comparing, by the system of one or more computers, nodes corresponding to text of the first data structure with nodes corresponding to text of the second data structure to identify the nodes corresponding to text that will not be visually displayed from the web page when the web page is rendered for display by the client device; and generating, by the system of one or more computers, weighting factors for the nodes corresponding to text that will not be visually displayed from the web page when the web page is rendered for display by the client device based on the nodes corresponding to text that will not be visually displayed from the web page. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system comprising:
-
a processor, and a memory, the memory including instructions that when executed cause the processor to; generate a first data structure based on a web page, where the first data structure includes nodes corresponding to text that will be visually displayed from the web page and text that will not be visually displayed from the web page when the web page is rendered for display by a client device; generate a second data structure for the web page based on the first data structure, the second data structure including nodes that correspond to text that will be visually displayed when the web page is rendered for display by the client device; generate a hidden text data index for the web page based on the first data structure and the second data structure, the hidden text data index identifying nodes that correspond to text of the first data structure as hidden that will not be visually displayed when the web page is rendered for display by the client device; and generate weighting factors for the nodes corresponding to text that will not be visually displayed from the web page when the web page is rendered for display by the client device based on the nodes corresponding to text that will not be visually displayed from the web page. - View Dependent Claims (11, 12, 13, 14)
-
-
15. A method comprising:
-
generating, by a system of one or more computers, a document object model (DOM) tree for a first web page, where the DOM tree includes nodes corresponding to text that will be visually displayed and nodes corresponding to text that will not be visually displayed when the first web page is rendered for display by a client device; generating, by the system of one or more computers, a render tree based on the DOM tree, the render tree including nodes that correspond to text that will be visually displayed by the client device when the first web page is rendered for display by the client device; comparing, by the system of one or more computers, nodes corresponding to text of the DOM tree with nodes corresponding to text of the render tree to identify the nodes corresponding to text of the DOM tree that will not be visually displayed when the first web page rendered for display by the client device; and creating, by the system of one or more computers, a hidden text data index based on the DOM tree and the render tree, the hidden text data index identifying nodes corresponding to text of the DOM tree as hidden that will not be visually displayed when the first web page is rendered for display by the client device and including a weighting factor associated with the identified nodes. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification