×

SYSTEM AND METHOD FOR MULTI-DIMENSIONAL AGGREGATION OVER LARGE TEXT CORPORA

  • US 20080228718A1
  • Filed: 03/15/2007
  • Published: 09/18/2008
  • Est. Priority Date: 03/15/2007
  • Status: Active Grant
First Claim
Patent Images

1. A method for retrieving data from an inverted list index within a computer system, wherein the index comprises annotated postings, the method consisting of:

  • receiving a query in a system;

    converting the query into a query language;

    scanning at least one list of postings for data from the query;

    aggregating the data in the list, thereby resulting in an aggregated list, wherein the aggregating includes;

    recording the occurrence of unique values from the list;

    mapping the values using a user-provided definition to an alternate value;

    grouping the values by a user-provided mapping of values to groups;

    recording and mutating data associated with the unique value in the list;

    relating the recorded data values with other values in the index; and

    returning the requested data from the aggregated list in a return format,wherein the annotated postings contain per-document identification, per-occurrence identification, and per-occurrence related data, wherein alternately per-occurrence related data is accessible using per-document identification and per-occurrence identification,wherein the unique value is the result of a computation on a pre-existing value,wherein recording data associated with the unique value takes place during query processing,wherein mutating data associated with the unique value includes numeric calculations, referential mappings, or any other deterministic process.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×