Relational database system for storing nodes of a hierarchical index of multi-dimensional data in a first module and metadata regarding the index in a second module

US 6,505,205 B1
Filed: 01/03/2002
Issued: 01/07/2003
Est. Priority Date: 05/29/1999
Status: Expired due to Term

First Claim

Patent Images

1. A method of storing a hierarchical index of multi-dimensional data in a database on a computer system, comprising:

constructing a first module in a first relational database;

inserting a row in said first module for a hierarchical index of multi-dimensional data, wherein said row comprises;

an identifier of a root node of said index;

a node capacity of said index; and

a measure of the dimensionality of said multi-dimensional data;

constructing a second module in a second relational database;

inserting a row in said second module for a first node of said index, wherein said row comprises one or more of;

a first identifier of said first node;

a location identifier configured to identify a storage location of said first node;

a parent_node identifier of a parent node of said first node;

a parent_location identifier configured to identify a storage location of said parent node;

a sibling identifier of a sibling node of said first node;

one or more entries, wherein each entry comprises;

an identifier of a child of said first node, wherein said child is either a child data item or a child node; and

a bounding area encompassing either said child data item or a set of data items accessible through said child node; and

a count of the number of said one or more entries; and

storing the multi-dimensional data in a third module in a third relational database.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A system and method for indexing and storing multi-dimensional or multi-attribute data. Data items are recursively sorted in a selected dimension (e.g., the dimension having the greatest variance) and divided until each subdivision fits into a leaf node having a specified fanout. Intermediate nodes and a root node are constructed to complete the index. Each node of the index is stored in a database as a separate object or record and may include a node identifier of the unique, an identifier of a parent and/or a sibling node and an entry for each child of the node, which may be data items or other nodes. Each record entry for a child includes an associated bounding area encompassing descendant data items. Another database table or module may store information about the index, such as the dimensionality of the data, the index fanout and an identifier of a root of the index.

456 Citations

28 Claims

1. A method of storing a hierarchical index of multi-dimensional data in a database on a computer system, comprising:
- constructing a first module in a first relational database;
  
  inserting a row in said first module for a hierarchical index of multi-dimensional data, wherein said row comprises;
  
  an identifier of a root node of said index;
  
  a node capacity of said index; and
  
  a measure of the dimensionality of said multi-dimensional data;
  
  constructing a second module in a second relational database;
  
  inserting a row in said second module for a first node of said index, wherein said row comprises one or more of;
  
  a first identifier of said first node;
  
  a location identifier configured to identify a storage location of said first node;
  
  a parent_node identifier of a parent node of said first node;
  
  a parent_location identifier configured to identify a storage location of said parent node;
  
  a sibling identifier of a sibling node of said first node;
  
  one or more entries, wherein each entry comprises;
  
  an identifier of a child of said first node, wherein said child is either a child data item or a child node; and
  
  a bounding area encompassing either said child data item or a set of data items accessible through said child node; and
  
  a count of the number of said one or more entries; and
  
  storing the multi-dimensional data in a third module in a third relational database.
- View Dependent Claims (2, 3, 4)
- - 2. The method of claim 1, wherein said first database comprises one or more of said second database and said third database.
  - 3. The method of claim 1, wherein said inserting a row in said second module is performed for every node in said index.
  - 4. The method of claim 1, wherein said parent node of said first node is the root node of said hierarchical index.

5. A computer-implemented method of constructing a hierarchical index from a set of multi-dimensional data, comprising:
- (a) calculating a number of members of a set of multi-dimensional data items;
  
  (b) determining whether said number of members exceeds a node capacity of a hierarchical index configured to index said data items;
  
  (c) determining a variance of values in one or more dimensions of said data items;
  
  (d) identifying a first dimension of said one or more multi dimensions in which to divide said set of multi-dimensional data items;
  
  (e) sorting said data items in said first dimension;
  
  (f) dividing said sorted data items in said first dimension into two or more subsets;
  
  (g) repeating (a)-(f) for each said subset in order to divide said set of data items into a plurality of data item clusters, wherein each cluster comprises a number of data items no greater than said node capacity; and
  
  (h) configuring a leaf node of said hierarchical index for data items in a first luster.
- View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13)
- - 6. The method of claim 5, wherein said multi-dimensional data items consist of data items having values for multiple attributes.
  - 7. The method of claim 5, in which said identifying comprises selecting a dimension, from said one or more dimensions, having the greatest variance of values.
  - 8. The method of claim 5, in which said dividing comprises:
    - calculating an approximate median value in said first dimension; and
      
      selecting a first data item corresponding to said approximate median value.
  - 9. The method of claim 5, in which said configuring comprises:
10. The method of claim 5, wherein the dimensions of said multi-dimensional data are inherently related.
11. The method of claim 5, wherein the dimensions of the multi-dimensional data are independent attributes.
12. The method of claim 5, further comprising identifying a query pattern for retrieving one or more data items from said set of data items, wherein said query pattern comprises a hierachy of two or more dimensions of said multi-dimensional data items.
13. The method of claim 12, in which said identifying comprises selecting a dimension in said hierarchy of two or more dimensions.

14. A method of updating an electronic index of multi-dimensional data items, comprising:
- receiving a new data item to be added to a set of hierarchically indexed multi-dimensional data items;
  
  inserting said new data item in a leaf node of said hierarchical index, wherein each node of said index comprises one or more of;
  
  a node identifier of said node;
  
  a row identifier of a storage location of said node in a database;
  
  a parent_node identifier of a parent of said node;
  
  a parent_row identifier of a storage location of said parent of said node in said database;
  
  a sibling identifier of a sibling of said node; and
  
  a set of child entries corresponding to children of said node;
  
  determining whether a first node in said index splits due to said inserting;
  
  if said first node splits due to said inserting;
  
  creating a second node;
  
  assigning said second node said node identifier of said first node;
  
  assigning a new node identifier to said first node; and
  
  transferring one or more child entries of said set of child entries of said first node to said second node;
  
  if said first node is a root node of said index;
  
  creating a new root node of said index;
  
  setting said parent_node identifier of one of said first node and said second node to said node identifier of said new root node; and
  
  setting said parent_row identifier of said one of said first node and said second node to said row identifier of said new root node; and
  
  updating metadata regarding said index, said metadata comprising;
  
  a dimensionality of said multi-dimensional data items;
  
  an identifier of said root node of said index; and
  
  a node capacity of said index.
- View Dependent Claims (15, 16)
- - 15. The method of claim 14, in which said inserting a new data item comprises:
16. The method of claim 15, in which said determining comprises, after said adding:
- removing said first node identifiers of said one or more nodes from said data structure in reverse order of said storing; and
  
  determining whether said first node identifier of a node removed from said data structure is different from a node identifier of said node at said removing.

17. A computer readable storage medium storing instructions that, when executed by a computer, cause the computer to perform a method of storing a hierarchical index of multi-dimensional data in a database on a computer system, the method comprising:
- constructing a first module in a first relational database;
  
  inserting a row in said first module for a hierarchical index of multi-dimensional data, wherein said row comprises;
  
  an identifier of a root node of said index;
  
  a node capacity of said index; and
  
  a measure of the dimensionality of said multi-dimensional data;
  
  constructing a second module in a second relational database;
  
  inserting a row in said second module for a first node of said index, wherein said row comprises one or more of;
  
  a first identifier of said first node;
  
  a location identifier configured to identify a storage location of said first node;
  
  a parent_node identifier of a parent node of said first node;
  
  a parent_location identifier configured to identify a storage location of said parent node;
  
  a sibling identifier of a sibling node of said first node;
  
  one or more entries, wherein each entry comprises;
  
  an identifier of a child of said first node, wherein said child is either a child data item or a child node; and
  
  a bounding area encompassing either said child data item or a set of data items accessible through said child node; and
  
  a count of the number of said one or more entries; and
  
  storing the multi-dimensional data in a third module in a third relational database.

18. A computer readable storage medium containing a data structure configured for hierarchically indexing multi-dimensional data items, said data structure comprising:
- a first module configured to store a set of records, wherein each record consists of one or more of;
  
  an identifier of a node of a hierarchical index of multi-dimensional data items;
  
  an identifier of said record;
  
  an identifier of a parent node of said node;
  
  an identifier of a sibling node of said node; and
  
  one or more child entries, each said child entry comprising;
  
  an identifier of a child item of said node; and
  
  a bounding area encompassing one or more of said multi-dimensional data items; and
  
  a second module configured to store one or more of;
  
  a dimensionality of said multi-dimensional data items;
  
  an identifier of a root node of said index; and
  
  a measure of a node capacity of said index.
- View Dependent Claims (19, 20, 21, 22, 23)
- - 19. The computer readable storage medium of claim 18, in which the data structure further comprises a third module configured to store one or more of said multi-dimensional data items.
  - 20. The computer readable storage medium of claim 18, wherein
21. The computer readable storage medium of claim 18, wherein each of said multi-dimensional data items comprises a geographical position.
22. The computer readable storage medium of claim 21, wherein said bounding area comprises a geographical region surrounding said geographic positions corresponding to said one or more multi-dimensional data items.
23. The computer readable storage medium of claim 18, wherein said node capacity comprises a fanout of said index.

24. An apparatus for indexing a set of multi-dimensional data items, comprising:
- a storage device configured to store a set of multi-dimensional data items;
  
  a processor configured to manipulate said set of multi-dimensional data items; and
  
  a database configured to store a hierarchical index of said set of multi-dimensional data items, wherein said index comprises one or more nodes, the database comprising;
  
  a first module configured to store one or more of;
  
  an identifier of a root node of said index;
  
  a node capacity of said index; and
  
  a dimensionality of said multi-dimensional data items; and
  
  a second module comprising a record for each of said one or more nodes, a first record for a first node comprising;
  
  an identifier of said first node; and
  
  an entry for each child of said first node, wherein a child is one of a data item and a child node and wherein a first entry comprises;
  
  an identifier of a first child of said first node; and
  
  a bounding region encompassing said first child if said first child is a data item or all data items of descendants of said first child if said first child is a child node.
- View Dependent Claims (25, 26)
- - 25. The apparatus of claim 24, further comprising a generator for generating unique identifiers for each node of said index.
  - 26. The apparatus of claim 24, wherein said database further comprises:

27. A computer readable storage medium storing instructions that, when executed by a computer, cause the computer to perform a method of constructing a hierarchical index from a set of multi-dimensional data, the method comprising:
- (a) calculating a number of members of a set of multi-dimensional data items;
  
  (b) determining whether said number of members exceeds a node capacity of a hierarchical index configured to index said data items;
  
  (c) determining a variance of values in one or more dimensions of said data items;
  
  (d) identifying a first dimension of said one or more multi dimensions in which to divide said set of multi-dimensional data items;
  
  (e) sorting said data items in said first dimension;
  
  (f) dividing said sorted data items in said first dimension into two or more subsets;
  
  (g) repeating (a)-(f) for each said subset in order to divide said set of data items into a plurality of data item clusters, wherein each cluster comprises a number of data items no greater than said node capacity; and
  
  (h) configuring a leaf node of said hierarchical index for data items in a first cluster.

28. A computer readable storage medium storing instructions that, when executed by a computer, cause the computer to perform a method of updating an electronic index of multi-dimensional data items, the method comprising:
- receiving a new data item to be added to a set of hierarchically indexed multi-dimensional data items;
  
  inserting said new data item in a leaf node of said hierarchical index, wherein each node of said index comprises one or more of;
  
  a node identifier of said node;
  
  a row identifier of a storage location of said node in a database;
  
  a parent_node identifier of a parent of said node;
  
  a parent_row identifier of a storage location of said parent of said node in said database;
  
  a sibling identifier of a sibling of said node; and
  
  a set of child entries corresponding to children of said node;
  
  determining whether a first node in said index splits due to said inserting;
  
  if said first node splits due to said inserting;
  
  creating a second node;
  
  assigning said second node said node identifier of said first node;
  
  assigning a new node identifier to said first node; and
  
  transferring one or more child entries of said set of child entries of said first node to said second node; and
  
  if said first node is a root node of said index;
  
  creating a new root node of said index;
  
  setting said parent_node identifier of one of said first node and said second node to said node identifier of said new root node; and
  
  setting said parent_row identifier of said one of said first node and said second node to said row identifier of said new root node.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Oracle International Corporation (Oracle Corporation)
Original Assignee
Oracle Corporation
Inventors
Ravada, Siva, Kothuri, Ravi, Sharma, Jayant, Banerjee, Jayanta
Primary Examiner(s)
CHANNAVAJJALA, SRIRAMA T

Application Number

US10/037,923
Time in Patent Office

369 Days
Field of Search

707/1-8, 707/10, 707/100-104, 707/200-206, 707/501.1, 707/514, 345/841, 345/853
US Class Current

1/1
CPC Class Codes

G06F 16/2246   Trees, e.g. B+trees

G06F 16/2264   Multidimensional index stru...

Y10S 707/99931   Database or file accessing

Y10S 707/99932   Access augmentation or opti...

Y10S 707/99934   Query formulation, input pr...

Y10S 707/99937   Sorting

Y10S 707/99942   Manipulating data structure...

Y10S 707/99943   Generating database or data...

Y10S 707/99945   Object-oriented database st...

Relational database system for storing nodes of a hierarchical index of multi-dimensional data in a first module and metadata regarding the index in a second module

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

456 Citations

28 Claims

Specification

Solutions

Use Cases

Quick Links

Relational database system for storing nodes of a hierarchical index of multi-dimensional data in a first module and metadata regarding the index in a second module

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

456 Citations

28 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links