×

System and method for indexing streams containing unstructured text data

  • US 9,262,511 B2
  • Filed: 03/15/2013
  • Issued: 02/16/2016
  • Est. Priority Date: 07/30/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method for indexing data, comprising the steps of:

  • receiving data-streams, wherein the data-streams comprise data-elements;

    storing the data-elements of the received data-streams, wherein the stored data-elements are stored via one or more processors in block-stores;

    allocating the stored data-elements to data-blocks of the block-stores, wherein the stored data-elements are allocated via the one or more processors to the data-blocks;

    further allocating the block-allocated data-elements to events of the data-blocks, wherein each of the data-blocks comprise one or more events, wherein each of the events comprise the block-allocated data-elements of the corresponding data-block, wherein the block-allocated data-elements are allocated via the one or more processors to the events;

    splitting the event-allocated data-elements into terms, wherein the event-allocated data-elements are split via the one or more processors into the terms;

    calculating a term frequencies of each term in each of the events, wherein the term frequencies are calculated via the one or more processors;

    calculating block-level term frequency data for the event-allocated data-elements stored in the corresponding data-block based on the term frequencies, wherein the block-level term frequency data is calculated via the one or more processors; and

    ,generating tree index structures for the event-allocated data-elements based on the block-level term frequency data, wherein the tree index structures comprise Y-tree index structures, wherein the terms are used in the Y-tree index structures as keys, wherein the tree index structures are calculated via the one or more processors.

View all claims
  • 8 Assignments
Timeline View
Assignment View
    ×
    ×