Database origami
First Claim
1. A database tool for hosting on a computer with data memory storage for databases and an execution unit for software programs, comprising:
- a first database that includes connection logs and records of individual communication-network subscriber with a corresponding communication-network-subscriber address;
a second database of data elements representing communication-network-subscriber addresses under a user'"'"'s scrutiny;
first means for identifying each unique communication-network-subscriber address in the first database;
second means for determining each of a plurality of communication-network-subscriber addresses in the second database of communication-network-subscriber addresses that are also included in the first database;
third means for parsing said communication-network-subscriber addresses into connected component clusters;
fourth means for computing a valence value for each communication-network-subscriber address;
fifth means for computing a "distance" between any two communication-network-subscriber addresses that have evidently communicated with each other;
sixth means for geometrically mapping said communication-network-subscriber addresses to points in a mapping space plane or volume relative to said distance computed by the fifth means; and
seventh means for plotting and displaying said mapping plane or space to a user.
0 Assignments
0 Petitions
Accused Products
Abstract
A database tool comprises a computer-implemented method for extracting systematic information from one or more databases that apparently only comprise data noise or seemingly unrelated data items. Criminal and community relationships that exist amongst telephone and internet subscribers are extracted from large telephone databases derived from wire taps and/or long distance telephone records. A telephone records file is used that comprises a caller'"'"'s telephone number, dialed telephone numbers, and the time. A second database comprises a list of telephone numbers which are suspicious for some reason, and a descriptor as to why each such telephone number is suspicious. A third database includes biographical data about the telephone subscribers, such as name, address, and other facts. The unique telephone numbers in the database are identified. Matches between the first and second databases are made. Related components are grouped into clusters. The valence for each telephone number is computed. The relational distances between each pair of telephone numbers in a cluster are determined. The telephone numbers are represented as points in the x,y-plane of a display, with the distance between the points representing the strength of the relationship based on call frequency and other criteria. An interactive interface is provided for the user to click on items to see the background information associated with each point.
-
Citations
11 Claims
-
1. A database tool for hosting on a computer with data memory storage for databases and an execution unit for software programs, comprising:
-
a first database that includes connection logs and records of individual communication-network subscriber with a corresponding communication-network-subscriber address; a second database of data elements representing communication-network-subscriber addresses under a user'"'"'s scrutiny; first means for identifying each unique communication-network-subscriber address in the first database; second means for determining each of a plurality of communication-network-subscriber addresses in the second database of communication-network-subscriber addresses that are also included in the first database; third means for parsing said communication-network-subscriber addresses into connected component clusters; fourth means for computing a valence value for each communication-network-subscriber address; fifth means for computing a "distance" between any two communication-network-subscriber addresses that have evidently communicated with each other; sixth means for geometrically mapping said communication-network-subscriber addresses to points in a mapping space plane or volume relative to said distance computed by the fifth means; and seventh means for plotting and displaying said mapping plane or space to a user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A database tool, comprising:
-
a host a computer with data memory storage for databases and an execution unit for software programs; a first database that includes connection logs and records of individual communication-network-subscriber addresses that includes at least one of telephone company toll data and dialed-number-recorder records, sales register transaction records, credit card transaction records, internet packet routing data, e-mail routing information, caller-ID data captures, and cellular telephone cell-switching and call-routing information; a second database of data elements representing communication-network-subscriber addresses under a user'"'"'s scrutiny and that includes at least one of telephone company toll data and dialed-number-recorder records, sales register transaction records, credit card transaction records, internet packet routing data, e-mail routing information, caller-ID data captures, and cellular telephone cell-switching and call-routing information; first means for identifying each unique communication-network-subscriber address in the first database comprises a computer-implemented software program method hosted on a computer that generates two files of unique sets of telephone numbers contained in the first database, a first such file includes said unique telephone numbers sorted in numerical order with a frequency count of the number of calls in which that phone number was either the caller or callee, and a second such file with the same information but the data is sorted by the frequency count; second means for determining each of a plurality of communication-network-subscriber addresses in the second database of communication-network-subscriber addresses that are also included in the first database comprises a computer-implemented software program method hosted on a computer that compares telephone numbers in the first database with those in second database, and that outputs a file of the telephone numbers common to both the first and second databases; third means for parsing said communication-network-subscriber addresses into connected-component clusters comprises a computer-implemented software program method hosted on a computer that finds communication connection associations that exist amongst a plurality of telephone numbers recorded in the first database, wherein each telephone number that was connected to another telephone number at least once is isolated into a single connected-component cluster in which every member of the connected-component cluster can be chained to all of the others by calling-telephone number, called-telephone number, or both; fourth means for computing a valence value for each communication-network-subscriber address comprises a computer-implemented software program method hosted on a computer that determines the total number of other connection addresses with which a particular connection address communicates, either as a caller or a callee, and that represents the outcome of such determination with a valance value; fifth means for computing a "distance" between any two communication-network-subscriber addresses that have evidently communicated with each other comprises a computer-implemented software program method hosted on a computer that assigns imaginary relatively scaled distances between points, which represent individual communication addresses in an imaginary plane or space, that are related to the number of times each such communication-network-subscriber addresses have had a communication connection recorded with another represented communication-network-subscriber address according to communication connection information included in the first database; sixth means for geometrically mapping said communication-network-subscriber addresses includes a host computer with a display monitor or printer for representing a plurality of points, which each represent a single communication-network-subscriber address, in a mapping space plane or volume relative to said distance computed by the fifth means; seventh means for plotting and displaying said mapping plane or space to a user; a third database that includes biographical information about persons or products associated with particular connection addresses in the second database and connected to provide such biographical information to the host computer and for selectively informing said user of previously determined data about each said person or product associated with said particular connection address; and an interactive interface included in the host computer providing for a user to manipulate each of the first and second databases, and the first through seventh means, and to extract useful information about relationships that exist between communication-network-subscriber addresses.
-
Specification