Enhancing cluster analysis using document metadata
First Claim
Patent Images
1. A system comprising:
- one or more processors;
one or more memories storing program instructions that the one or more processors execute;
computer usable program code stored in at least one of the memories, which when the computer usable program code is executed by the one or more processors the system is operable to receive a search query comprising of at least one search criteria, wherein the at least one search criteria is a text string;
computer usable program code stored in at least one of the memories, which when the computer usable program code is executed by the one or more processors the system is operable to perform a keyword search against an standard index, wherein the standard index is comprised of keywords obtained from a document within a standard cluster, wherein the standard cluster is a document cluster sharing a plurality of keywords;
computer usable program code stored in at least one of the memories, which when the computer usable program code is executed by the one or more processors the system is operable to execute an enhanced search against an enhanced index, wherein the enhanced index is comprised of metadata associated with an enhanced cluster, wherein the enhanced cluster is a document cluster associated with the metadata, wherein the metadata is a user defined metadata;
computer usable program code stored in at least one of the memories, which when the computer usable program code is executed by the one or more processors the system is operable to aggregate the standard cluster and the enhanced cluster into separate merged documents, wherein the separate merged documents are documents comprising of the standard cluster and enhanced cluster contents; and
computer usable program code stored in at least one of the memories, which when the computer usable program code is executed by the one or more processors the system is operable to execute a ranking algorithm on each separate merged document to obtain a final ranking of content within the single document, wherein the ranking algorithm applies a ranking formula, wherein the ranking formula comprises sum of product of a tag line presence value with a hit weight and product of a cluster rank with the hit weight subtracted from one.
1 Assignment
0 Petitions
Accused Products
Abstract
A search query including search criteria can be received. The search criteria can be a text string. An enhanced search against an enhanced index can be executed. The enhanced index can be metadata associated with an enhanced cluster. The enhanced cluster can be a document cluster associated with the metadata. The enhanced cluster can be aggregated into a merged document. The merged document can be a document including the enhanced cluster contents. The ranking algorithm can be executed on the merged document to obtain a final ranking of content within the single document.
-
Citations
17 Claims
-
1. A system comprising:
-
one or more processors; one or more memories storing program instructions that the one or more processors execute; computer usable program code stored in at least one of the memories, which when the computer usable program code is executed by the one or more processors the system is operable to receive a search query comprising of at least one search criteria, wherein the at least one search criteria is a text string; computer usable program code stored in at least one of the memories, which when the computer usable program code is executed by the one or more processors the system is operable to perform a keyword search against an standard index, wherein the standard index is comprised of keywords obtained from a document within a standard cluster, wherein the standard cluster is a document cluster sharing a plurality of keywords; computer usable program code stored in at least one of the memories, which when the computer usable program code is executed by the one or more processors the system is operable to execute an enhanced search against an enhanced index, wherein the enhanced index is comprised of metadata associated with an enhanced cluster, wherein the enhanced cluster is a document cluster associated with the metadata, wherein the metadata is a user defined metadata; computer usable program code stored in at least one of the memories, which when the computer usable program code is executed by the one or more processors the system is operable to aggregate the standard cluster and the enhanced cluster into separate merged documents, wherein the separate merged documents are documents comprising of the standard cluster and enhanced cluster contents; and computer usable program code stored in at least one of the memories, which when the computer usable program code is executed by the one or more processors the system is operable to execute a ranking algorithm on each separate merged document to obtain a final ranking of content within the single document, wherein the ranking algorithm applies a ranking formula, wherein the ranking formula comprises sum of product of a tag line presence value with a hit weight and product of a cluster rank with the hit weight subtracted from one. - View Dependent Claims (2, 3, 4)
-
-
5. A computer program product comprising a non-transitory computer readable storage medium having computer usable program code embodied therewith, the computer usable program code comprising:
-
computer usable program code stored in a non-transitory storage medium, when said computer usable program code is executed by a processor it is operable to receive a search query comprising of at least one search criteria, wherein the at least one search criteria is a text string; computer usable program code stored in a non-transitory storage medium, when said computer usable program code is executed by a processor it is operable to perform a keyword search against an standard index, wherein the standard index is comprised of keywords obtained from a document within a standard cluster, wherein the standard cluster is a document cluster sharing a plurality of keywords; computer usable program code stored in a non-transitory storage medium, when said computer usable program code is executed by a processor it is operable to execute an enhanced search against an enhanced index, wherein the enhanced index is comprised of metadata associated with an enhanced cluster, wherein the enhanced cluster is a document cluster associated with the metadata, wherein the metadata is a user defined metadata; computer usable program code stored in a non-transitory storage medium, when said computer usable program code is executed by a processor it is operable to aggregate the standard cluster and the enhanced cluster into separate merged documents, wherein the separate merged documents are documents comprising of the standard cluster and enhanced cluster contents; and computer usable program code stored in a non-transitory storage medium, when said computer usable program code is executed by a processor it is operable to execute a ranking algorithm on each separate merged document to obtain a final ranking of content within the single document, wherein the ranking algorithm applies a ranking formula, wherein the ranking formula comprises sum of product of a tag line presence value with a hit weight and product of a cluster rank with the hit weight subtracted from one. - View Dependent Claims (6, 7, 8, 9, 10)
-
-
11. A system comprising hardware comprising a processer and software stored on a non-transitory storage medium, wherein programmatic instructions of the software are executable on the hardware causing the system to:
-
receive a search query comprising of at least one search criteria, wherein the at least one search criteria is a text string; perform a keyword search against an standard index, wherein the standard index is comprised of keywords obtained from a document within a standard cluster, wherein the standard cluster is a document cluster sharing a plurality of keywords; execute an enhanced search against an enhanced index, wherein the enhanced index is comprised of metadata associated with an enhanced cluster, wherein the enhanced cluster is a document cluster associated with the metadata, wherein the metadata is a user defined metadata; aggregate the enhanced cluster each into a merged document, wherein the merged document is a document comprising of the enhanced cluster contents; and execute a ranking algorithm on the merged document to obtain a final ranking of content within the single document, wherein the ranking algorithm applies a ranking formula, wherein the ranking formula comprises sum of product of a tag line presence value with a hit weight and product of a cluster rank with the hit weight subtracted from one. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
Specification