×

Vector index preparing method, similar vector searching method, and apparatuses for the methods

  • US 7,007,019 B2
  • Filed: 12/21/2000
  • Issued: 02/28/2006
  • Est. Priority Date: 12/21/1999
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method of preparing an index, which is searchable by a computer, with respect to a vector database in which a finite number of ordered lists each including at least N-dimensional real vector and an identification number of the vector are registered as vector data, said index being used for data retrieval using a computer, said method comprising:

  • a first step of vector index preparation of dividing N components into m ordered list in a predetermined method with respect to the N-dimensional real vector V of each vector data in said vector database, preparing m partial vectors v1 to vm, subsequently tabulating a distribution of a norm of the partial vector vk (k=1 to m), preparing a norm partition table which contains a predetermined number of norm ranges, calculating a region number d to which said partial vector vk belongs in accordance with predetermined D region center vectors p1 to pD, tabulating a distribution of a cosine (vk·

    pd)/(|Vk|*|pd|) of an angle formed by said partial vector vk and the region center vector pd as a declination distribution, and preparing a declination partition table which contains a predetermined number of declination ranges;

    a second step of the vector index preparation of dividing N components into m ordered lists in the same method as said first step with respect to the N-dimensional real vector V of each vector data in said vector database, preparing m partial vectors v1 to vm, referring to said norm partition table to calculate a number r of the norm partition to which the norm of said partial vector vb belongs with respect to the partial vector vb (b=1 to m) for the partial space number b, calculating the region number d to which said partial vector vb belongs in accordance with the predetermined D region center vectors p1 to pD in the same method as said first step, calculating a declination (vb·

    pd)/(|vb|*|pd|) as a cosine of an angle formed by said partial vector vb and the region center vector pd indicating a center direction of the region of said region number d, referring to said declination partition table, calculating a number c of the belonging declination partition, and calculating index registration data to be registered in a vector index from said partial space number b, said region number d, said declination partition number c, said norm partition number r, the component of said partial vector vb, and the identification number i; and

    a third step of the vector index preparation of constituting the vector index such that the identification number and the component of each partial vector can be searched using a ordered list of the partial space number b, the region number d, the declination partition number c and a norm partition number range (r1, r2) as a key from said norm partition table, said declination partition table, and said index registration data, and such that the vector component of each vector data can be searched with the identification number of the vector component.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×