Efficient storage of related sparse data in a search index
First Claim
1. A computer-implemented method to store related sparse data within a search index, the method comprising:
- detecting sparse data within content to be indexed, the sparse data comprising a data point related to an entity by determining if the content matches one or more classification rules;
determining a number of matches and a confidence of the matches for the one or more classification rules;
in response to a determination that there is at least one match, creating an integer array for the data point such that a value associated with the data point is one or more of a count value and a confidence value based on the determined number and confidence of the matches;
in response to receiving a request from a user to perform a query on the search index that includes the data point, transforming the query to a corresponding range based on an identifier of the entity; and
upon execution of the query, providing query results to the user.
2 Assignments
0 Petitions
Accused Products
Abstract
Sparse data may be stored in a search index such that one or more data points related to a number of entities may be queriable. Upon detection of a data point related to an entity, an integer array may be created for the data point to store within a single column of the search index, the integer array including an identifier of the entity and a value associated with the data point. In response to receiving a request from a user to perform a query on the search index that includes the data point, the query may be transformed to a corresponding range based on the identifier of the entity. Upon execution of the query, the identifier may be removed and query results may be provided to the user as another array with a user-compatible identifier for the entity such that the results may be understood by the user.
-
Citations
15 Claims
-
1. A computer-implemented method to store related sparse data within a search index, the method comprising:
-
detecting sparse data within content to be indexed, the sparse data comprising a data point related to an entity by determining if the content matches one or more classification rules; determining a number of matches and a confidence of the matches for the one or more classification rules; in response to a determination that there is at least one match, creating an integer array for the data point such that a value associated with the data point is one or more of a count value and a confidence value based on the determined number and confidence of the matches; in response to receiving a request from a user to perform a query on the search index that includes the data point, transforming the query to a corresponding range based on an identifier of the entity; and upon execution of the query, providing query results to the user. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system configured to store related sparse data within a search index, the system comprising:
-
a communication module configured to transmit data between one or more servers of the system and one or more client devices associated with the system; and at least one server comprising; a memory; and a processor coupled to the memory and the communication interface, the processor configured to execute a search application, wherein the search application is configured to execute a classification engine and a query engine, wherein the classification engine is configured to; determine if content to be indexed matches one or more classification rules to detect sparse data within the content, the sparse data comprising a data point related to an entity determine a number of matches and a confidence of the matches for the one or more classification rules; and in response to a determination that there is at least one match, creating an inter array for the data point such that a value associated with the data point is one or more of a count value and a confidence value based on the determined number and confidence of the matches; and the query engine is configured to; transform a query to a corresponding range based on an identifier of the entity in response to a receipt of a request from a user to perform a query on the search index that includes the data point; and upon execution of the query, provide query results to the user. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A hardware computer-readable memory device with instructions stored thereon to store related sparse data within a search index, the instructions comprising:
-
detecting sensitive data within content to be indexed, the sensitive data comprising a data point related to an entity by determining if the content matches one or more classification rules; determining a number of matches and a confidence of the matches for the one or more classification rules; in response to a determination that there is at least one match, creating an integer array for the data point such that a value associated with the data point is one or more of a count value and a confidence value based on the determined number and confidence of the matches; in response to receiving a request from a user to perform a query on the search index that includes the data point, transforming the query to a corresponding range based on an identifier of the entity; and upon execution of the query; removing the identifier of the entity from the integer array; and providing query results to the user as another array with a user compatible identifier for the entity. - View Dependent Claims (14, 15)
-
Specification