Code string processing system and method using intervals
First Claim
1. A data processing system comprising:
- a binary tree generating means for generating a binary tree representing a range of each interval based on the intervals in a code string including at least one interval,said binary tree having nodes each of which corresponds to an interval,the end side of each node being connected to a preceding node corresponding to an interval preceding the interval of the node or to a following node corresponding to an interval following the interval of the node,each node being assigned a partial interval length which is a sum of the interval length of the node itself and the interval length of at least one node in the end side of the node; and
an interval retrieving means for identifying an interval where a specified position indicating a position within said code string is included,wherein said interval retrieving means sequentially moves a current node, pointed to by a specified pointer, from said root to said end side, calculates the range of the current node based on the partial interval length of at least one of the current node and said preceding node and said following node directly connected to the end side of the current node, compares the calculated interval with said specified position, moves the current node to the preceding node when the specified position precedes the calculated interval;
moves the current node to the following node when the specified position follows the calculated interval, and identifies that the specified position is included in the current node when the specified position is included in the calculated interval.
1 Assignment
0 Petitions
Accused Products
Abstract
A data retrieval system which updates data quickly. A divider determines the first substring and second substring based on the code string from which a key string is retrieved. For each substring, a generator generates the dictionary data showing the correspondence between a trailing string, which is a trailing part of data in the substring, and the start position of the trailing string within the code string. A retriever a trailing string whose leading string is a key string or a part of the key string, based on the dictionary data. A remover removes duplicate trailing strings. When a changer changes the code string, an updater updates dictionary data associated with the substring based on the contents of the change. A first maintaining device maintains the boundary interval at a maximum key length or longer, and a second maintaining device maintains the boundary interval at a specified length or less.
-
Citations
25 Claims
-
1. A data processing system comprising:
-
a binary tree generating means for generating a binary tree representing a range of each interval based on the intervals in a code string including at least one interval, said binary tree having nodes each of which corresponds to an interval, the end side of each node being connected to a preceding node corresponding to an interval preceding the interval of the node or to a following node corresponding to an interval following the interval of the node, each node being assigned a partial interval length which is a sum of the interval length of the node itself and the interval length of at least one node in the end side of the node; and an interval retrieving means for identifying an interval where a specified position indicating a position within said code string is included, wherein said interval retrieving means sequentially moves a current node, pointed to by a specified pointer, from said root to said end side, calculates the range of the current node based on the partial interval length of at least one of the current node and said preceding node and said following node directly connected to the end side of the current node, compares the calculated interval with said specified position, moves the current node to the preceding node when the specified position precedes the calculated interval;
moves the current node to the following node when the specified position follows the calculated interval, and identifies that the specified position is included in the current node when the specified position is included in the calculated interval. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A data processing method comprising:
-
a binary tree generating process for generating a binary tree representing a range of each interval based on the intervals in a code string including at least one interval, said binary tree having nodes of which corresponds to an interval, the end side of each node being connected to a preceding node corresponding to an interval preceding the interval of the node or to a following node corresponding to an interval following the interval of the node, each node being assigned a partial interval length which is the sum of the interval length of the node itself and the interval length of at least one node on the end side of the node, and an interval retrieving process for identifying an interval where a specified position indicating a position within said code string is included, wherein said interval retrieving process sequentially moves a current node, pointed to by a specified pointer, from said root to said end side, calculates the range of the current node based on the partial interval length of at least one of the current node and said preceding node and said following node directly connected to the end side of the current node, compares the calculated interval with said specified position, moves the current node to the preceding node when the specified position precedes the calculated interval, moves the current node to the following node when the specified position follows the calculated interval, and identifies that the specified position is included in the current node when the specified position is included in the calculated interval. - View Dependent Claims (20, 21, 22, 23)
-
-
24. A data processing system comprising:
-
a binary tree generating means for generating a binary tree representing a range of each interval based on the intervals in a code string including at least one interval, said binary tree having nodes each of which corresponds to an interval, each node being assigned a partial interval length which is the sum of the interval lengths of all the nodes included in the subtree whose root is the node including the node itself, and an interval retrieving means for identifying an interval where a specified position indicating a position within said code string is included, wherein said interval retrieving means sequentially moves a current node, pointed to by a specified pointer, from said root to said end side, calculates the range of the current node based on the partial interval length of at least one of the current node and said preceding node and said following node directly connected to the end side of the current node, compares the calculated interval with said specified position, moves the current node to the preceding node when the specified position precedes the calculated interval, moves the current node to the following node when the specified position follows the calculated interval, and identifies that the specified position is included in the current node when the specified position is included in the calculated interval.
-
-
25. A data processing method comprising:
-
a binary tree generating process for generating a binary tree representing a range of each interval based on the intervals in a code string including at least one interval, said binary tree having nodes each of which corresponds to an interval, each node being assigned a partial interval length which is the sum of the interval lengths of all the nodes included in the subtree whose root is the node including the node itself, and an interval retrieving process for identifying an interval where a specified position indicating a position within said code string is included, wherein said interval retrieval process sequentially moves a current node, pointed to by a specified pointer, from said root to said end side, calculates the range of the current node based on the partial interval length of at least one of the current node and said preceding node and said following node directly connected to the end side of the current node, compares the calculated interval with said specified position, moves the current node to the preceding node when the specified position precedes the calculated interval, moves the current node to the following node when the specified position follows the calculated interval, and identifies that the specified position is included in the current node when the specified position is included in the calculated interval.
-
Specification