Distance measure for probability distribution function of mixture type
First Claim
Patent Images
1. A method executed in a computer of computing a distance measure between first mixture type probability distribution functions,
-
( x ) = ∑ i = 1 N μ i g i ( x ) , pertaining to a set data collected from a first source, and a second mixture type probability distribution function pertaining to another set of collected data, the improvement characterized by;
said distance measure being where d(gi, hk) is a function of the distance between component gi of the first probability distribution function and component hk of the second probability distribution function where ω
ik≧
0 for 1≦
i≦
N, and for 1 ≦
k≦
K, and andmaking a determination, based on said computed overall distance as to whether said another set of collected data pertains to said source.
1 Assignment
0 Petitions
Accused Products
Abstract
In accordance with our invention, for two mixture-type probability distribution functions (PDF'"'"'s), G, H,
where G is a mixture of N component PDF'"'"'s gi (x), H is a mixture of K component PDF'"'"'s hk (x), μi and γk are corresponding weights that satisfy
we define their distance, DM(G, H), as
where d(gI, hk is the element distance between component PDF'"'"'s gi and hk and w satisfie
ωik≧0, 1≦i≦N, 1≦k≦K;
and
The application of this definition of distance to various sets of real world data is demonstrated.
20 Citations
18 Claims
-
1. A method executed in a computer of computing a distance measure between first mixture type probability distribution functions,
-
( x ) = ∑ i = 1 N μ i g i ( x ) , pertaining to a set data collected from a first source, and a second mixture type probability distribution function pertaining to another set of collected data, the improvement characterized by; said distance measure being where d(gi, hk) is a function of the distance between component gi of the first probability distribution function and component hk of the second probability distribution function where ω
ik≧
0 for 1≦
i≦
N, and for 1 ≦
k≦
K, andand making a determination, based on said computed overall distance as to whether said another set of collected data pertains to said source. - View Dependent Claims (2, 3, 4)
-
-
5. A computer program embedded in a storage medium for computing a distance measure between first and second mixture type probability distribution functions,
-
( x ) = ∑ i = 1 N μ i g i ( x ) , pertaining to a set data collected from a first source, and pertaining to another set of collected data, the improvement comprising a software module of said computer program that evaluates said distance measure in accordance with equation;
where d(gi, hk) is a function of distance between a component, gi, of the first probability distribution function and a component, hk, of the second probability distribution function where ω
ik≧
0, 1≦
i≦
N, 1≦
k≦
K,there exists some value of i for which ω
ik>
0 for at least two values of k, andand making a determination, based on said computed overall distance as to whether said another set of collected data pertains to said source. - View Dependent Claims (6, 7, 8)
-
-
9. A computer system for computing a distance measure between first and second mixture type probability distribution functions,
-
( x ) = ∑ i = 1 N μ i g i ( x ) , and H ( x ) = ∑ k = 1 K γ k h k ( x ) , pertaining to audio data comprising; memory for storing said audio data; a processing module for deriving one of said mixture type probability distribution functions from said audio data; and a processing module for evaluating said distance measure in accordance with where d(gi, hk) is a function of the distance between a component, gi, of the first probability distribution function and a component, hk, of the second probability distribution function, where and ω
ik≧
0, 1≦
i≦
N, 1≦
k≦
K,and there exists some value of i for which ω
ik>
0 for at least two values of k, and- View Dependent Claims (10, 11, 12)
-
-
13. A method executed in a computer for computing a distance measure between a mixture type probability distribution function
-
( x ) = ∑ i = 1 N μ i g i ( x ) , pertaining to a set data collected from a first source, where μ
, is a weight imposed on component gi(x), and a mixture type probability distribution functionpertaining to another set of collected data, where γ
k is a weight imposed on component hk comprising the steps of;computing an element distance, d(gi, hk), between each gi and each hk where 1≦
i≦
N,1≦
k≦
K,computing an overall distance, denoted by DM(G, H), between the mixture probability distribution function G, and the mixture probability distribution function H, based on a weighted sum of the all element distances, wherein weights ω
i,k imposed on the element distances d(gi, hk), are chosen so that the overall distance DM(G, H) is minimized, subject toω
ik>
0 for at least two values of k for each value of i,and making a determination, based on said computed overall distance as to whether said another set of collected data pertains to said source. - View Dependent Claims (14, 15, 16)
-
-
17. A method executed in a computer for content-based searching of stored data that pertains to a physical attribute of a system comprising the steps of:
-
acquiring a collection of physical attributes data; and transforming said collection of physical attributes data into a signal tat is outputted by said computer;
where said transforming is effected by;identifying collections in said stored data; developing a probability distribution function for each of said identified collections; developing a probability distribution function for the acquired collection; developing a distance measure between the developed probability distribution function of said acquired collection and developed probability distribution functions for said identified collections; applying a threshold to the developed distance measure to discover those of said identified segments with a distance measure below said preselected threshold value, where said distance is directly computed according to a measure that guarantees to satisfy the non-negativeness, symmetry, and triangular inequality properties of a distance measure; and developing said output signal based on step of applying where said distance measure between a first probability function, and a second probability function, is where d(gi, hk) is a function of the distance between a component, gi, of the first probability distribution function and a component hk, of the second probability distribution function, ω
ik≧
0, 1≦
i≦
N, 1≦
k≦
K,and there exists some value of i for which ω
ik>
0 for at least two values of k.
-
-
18. A method executed in a computer comprising the steps of:
-
identifying speaker segments in provided audio-visual data based on speech contained in said data; developing a probability distribution function for each of said segments from data points within each of said segments; and developing distance measures among said probability distribution functions, where each of said measures is obtained through a one-pass evaluation of a function that guarantees to satisfy the non-negativeness, symmetry, and triangular inequality properties of a distance measure where said distance measure between a first probability function,
and a second probability function,
is
whered(gi, hk) is a function of the distance between a component, gi, of the first probability distribution function and a component, hk, of the second probability distribution function, ω
ik≧
0, 1≦
i≦
N, 1≦
k≦
K,and there exists some value of i for which ω
ik>
0 for at least two values of k.
-
Specification