Normalizing document metadata using directory services
First Claim
1. In a computerized environment, a method of normalizing document data to improve the results of search requests, the method comprising the acts of:
- receiving a document containing document data;
parsing the document data into one or more document segments;
identifying at least one of the one or more document segments as an alias that correlates with a document datum found in an alias directory service; and
associating the received document with the document alias so that, upon request for the document datum through a search engine, the received document is returned to the requester by association of the document datum with the alias.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention provides methods, systems, and computer program products for normalizing document search terms through use of an alias database, as may be found in an alias relationship file, such as a directory service. A gatherer module receives as input (or crawls through) several documents in series or in parallel and can recognize data segments as related to one of the aliases in the alias relationship file. The gatherer then associates the document appropriately so that a search engine may find all documents associated with a search term, regardless of whether the term has undergone several name changes (various aliases) over the course of time. Accordingly, a user may then search for a person'"'"'s name, and receive as a search result all documents listing the person'"'"'s name, as well as documents listing, for example, only the person'"'"'s email address.
-
Citations
28 Claims
-
1. In a computerized environment, a method of normalizing document data to improve the results of search requests, the method comprising the acts of:
-
receiving a document containing document data;
parsing the document data into one or more document segments;
identifying at least one of the one or more document segments as an alias that correlates with a document datum found in an alias directory service; and
associating the received document with the document alias so that, upon request for the document datum through a search engine, the received document is returned to the requester by association of the document datum with the alias. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. In a computerized environment, a method of normalizing document data to improve the results of search requests, the method comprising:
-
an act of receiving a document containing document data;
an act of parsing the document data into one or more document segments; and
a step for normalizing document metadata used as a reference by a search engine by maintaining one or more relationships between a search term and an alternate search term, a search term property or alternative search term property. - View Dependent Claims (11, 12, 13, 14)
-
-
15. A computer program product having computer-executable instructions for performing a method of normalizing document data to improve the results of search requests, the method comprising the acts of:
-
receiving a document containing document data;
parsing the document data into one or more document segments;
identifying at least one of the one or more document segments as an alias for a document datum found in an alias directory service; and
associating the received document with the document alias so that, upon request for the document datum through a search engine, the received document is returned to the requester by association of the document datum with the alias. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A computer program product having computer-executable instructions for performing a method of normalizing document data to improve the results of search requests, the method comprising:
-
an act of receiving a document containing document data;
an act of parsing the document data into one or more document segments; and
a step for normalizing document metadata used as a reference by a search engine by maintaining one or more relationships between a search term and an alternate search term, a search term property or alternative search term property. - View Dependent Claims (25, 26, 27, 28)
-
Specification