Mechanism for Separating Content from Noisy Context in Template-Based Documents for Search Indexing
First Claim
Patent Images
1. A computer-implemented method, comprising:
- selecting, by a server computer, a plurality of documents for index comparison;
identifying, by the server computer, one or more identical elements found in each of the plurality of documents; and
removing, by the server computer, the one or more identical elements from consideration in an indexing process of the plurality of documents.
1 Assignment
0 Petitions
Accused Products
Abstract
In one embodiment, a mechanism for separating content from noisy context in template-based documents for search indexing is disclosed. In one embodiment, a method includes selecting a plurality of documents for index comparison, identifying one or more identical elements found in each of the plurality of documents, and removing the one or more identical elements from consideration in an indexing process of the plurality of documents.
25 Citations
20 Claims
-
1. A computer-implemented method, comprising:
-
selecting, by a server computer, a plurality of documents for index comparison; identifying, by the server computer, one or more identical elements found in each of the plurality of documents; and removing, by the server computer, the one or more identical elements from consideration in an indexing process of the plurality of documents. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system, comprising:
-
a network interface to receive a plurality of documents for index comparison; and a comparison module communicably coupled to the network interface to receive the plurality of documents, the comparison module operable to; identify one or more identical elements found in each of the plurality of documents; and remove the one or more identical elements from consideration in an indexing process of the plurality of documents. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. An article of manufacture comprising a machine-readable storage medium including data that, when accessed by a machine, cause the machine to perform operations comprising:
-
selecting a plurality of documents for index comparison; identifying one or more identical elements found in each of the plurality of documents; and removing the one or more identical elements from consideration in an indexing process of the plurality of documents. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification