Method for probabilistic analysis of most frequently occurring electronic message addresses within personal store (.PST) files to determine owner with confidence factor based on relative weight and set of user-specified factors
First Claim
1. A method of determining a probable owner of a personal store file in a computer, the method comprising:
- receiving a set of user-specified factors including a first number of most frequently occurring addresses and a second number of sent-then-received electronic messages to be analyzed for said first number of most frequently occurring addresses;
determining the first number of most frequently occurring addresses among the second number of sent-then-received electronic messages within said personal store file;
applying a weight to each of said first number of determined most frequently occurring addresses based on a number of occurrences of each of said first number of determined most frequently occurring addresses and one or more user-specified factors of the set of user-specified factors;
determining a relative weight of each of said first number of determined most frequently occurring addresses;
determining if any of said first number of determined most frequently occurring addresses are duplicate addresses;
summing the relative weights of duplicate addresses to create a combined relative weight and setting the relative weight of one of the duplicate address to the combined relative weight;
returning a top weighted address of said first number of determined most frequently occurring addresses as said probable owner of said personal store file with a confidence factor derived from the relative weight and the set of user-specified factors.
2 Assignments
0 Petitions
Accused Products
Abstract
A probabilistic process to determine the owner of an electronic file, such as a Personal Store (.pst). A weighted analysis of multiple factors is performed including the operating system file owner, a user running the process, a “top Y most frequently occurring addresses” when analyzing “X number of sent then received messages,” and a number of occurrences of each “top Y most frequently occurring address.” Other factors, such as the ability to resolve against a directory service may be used. Each of the “top Y most frequently occurring addresses” is analyzed to calculate its weight according to a predetermined relationship and the address is compared to the operating system file owner and a logged-on user value. If there is a match, that value is returned as the probable owner of the file.
22 Citations
16 Claims
-
1. A method of determining a probable owner of a personal store file in a computer, the method comprising:
-
receiving a set of user-specified factors including a first number of most frequently occurring addresses and a second number of sent-then-received electronic messages to be analyzed for said first number of most frequently occurring addresses; determining the first number of most frequently occurring addresses among the second number of sent-then-received electronic messages within said personal store file; applying a weight to each of said first number of determined most frequently occurring addresses based on a number of occurrences of each of said first number of determined most frequently occurring addresses and one or more user-specified factors of the set of user-specified factors; determining a relative weight of each of said first number of determined most frequently occurring addresses; determining if any of said first number of determined most frequently occurring addresses are duplicate addresses; summing the relative weights of duplicate addresses to create a combined relative weight and setting the relative weight of one of the duplicate address to the combined relative weight; returning a top weighted address of said first number of determined most frequently occurring addresses as said probable owner of said personal store file with a confidence factor derived from the relative weight and the set of user-specified factors. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method of determining an owner of an electronic file in a computer, the method comprising:
-
receiving a set of one or more user-specified factors associated with said electronic file, wherein said user-specified factors include a first number of most frequently occurring addresses among a second number of sent-then-received electronic messages within said electronic file to be analyzed for said first number of most frequently occurring addresses; determining the first number of most frequently occurring addresses among the second number of sent-then-received electronic messages within said personal store file; applying weights to said user-specified factors; defining a first factor that is to be compared to others of said user-specified factors, wherein the first factor is defined as a number of occurrences of each of said first number of determined most frequently occurring addresses; determining a relative weight for each of said first number of determined most frequently occurring addresses; determining if any of said first number of determined most frequently occurring addresses are duplicate addresses; summing the relative weights of duplicate addresses to create a combined relative weight and setting the relative weight of one of the duplicate address to the combined relative weight; and returning a top weighted address of said first number of determined most frequently occurring addresses as said owner of said electronic file with a confidence factor derived from the relative weight and the set of user-specified factors. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A method of determining ownership of an electronic file containing e-mail messages in a computer, the method comprising:
-
receiving a set of user-specified factors including a first number of most frequently occurring addresses and a second number of sent-then-received e-mail messages to be analyzed for said first number of most frequently occurring addresses; determining a the first number of most frequently occurring addresses among the-second number of sent-then-received e-mail messages within said electronic file, applying a weight to each of said first number of determined most frequently occurring addresses based on a number of occurrences of each of said first number of determined most frequently occurring addresses and one or more user-specified factors of the set of user-specified factors; determining a relative weight of each of said first number of determined most frequently occurring addresses; determining if any of said first number of determined most frequently occurring addresses are duplicate addresses; summing the relative weights of duplicate addresses to create a combined relative weight and setting the relative weight of one of the duplicate address to the combined relative weight; and determining said ownership of said electronic file in accordance with the relative weight of each of said first number of determined most frequently occurring addresses with a confidence factor derived from the relative weight and the set of user-specified factors. - View Dependent Claims (14, 15, 16)
-
Specification