×

SYSTEM AND METHOD FOR MULTITHREADED TEXT INDEXING FOR NEXT GENERATION MULTI-CORE ARCHITECTURES

  • US 20110252033A1
  • Filed: 04/09/2010
  • Published: 10/13/2011
  • Est. Priority Date: 04/09/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method for indexing documents, comprising:

  • generating a single document hash table in storage memory for a single document using an index construction in a multithreaded and scalable configuration wherein multiple threads are each assigned work to reduce synchronization between threads wherein generating a single document hash table includes;

    partitioning the single document a plurality of subparts and indexing strings of partitioned subparts of the single document to create a minor hash table for each subpart;

    generating a document level hash table from the minor hash tables;

    updating a stream level hash table for the strings which maps every string to a global identifier; and

    generating a term reordered array from the document level hash table.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×