Consensus sequence identification
First Claim
1. A method comprising:
- receiving historical information of episodes, each episode including at least one sequence of events taken over a period of time;
constructing event sets from the historical information, each of the event sets including at least one sequence of events;
categorizing each event from the historical information with general event category labels and synthetic event category labels, a subset of the synthetic event category labels being broader than two or more of the general event category labels, at least a subset of the events having at least one general event category label and at least one synthetic event category label;
learning an event metric on the events by using the general event category labels and the synthetic event category labels to perform dimensionality reduction to associate a vector with each event and to determine an angle between every two vectors, each general event category label and each synthetic event category label being assigned a separate dimension;
determining an event set metric using distances between each pair of event sets using the event metric;
deriving a sequence metric on the episodes to compute the distances between episodes, wherein the sequence metric obtaining a preferred match between two episodes with respect to a cost function describing a weighting for the event set metric;
deriving a subsequence metric on the episodes to compute the distances between episodes, wherein the subsequence metric is a function of the event set metric on subsequences of each episode;
grouping episodes into subgroups based on the distances obtained using the sequence metric and the subsequence metric;
for at least one subgroup, generating a consensus sequence by finding a preferred sequence of events with respect to a function of the sequence metric and the subsequence metric between the preferred sequence and the episodes of the at least one subgroup; and
generating a report indicating the consensus sequence.
6 Assignments
0 Petitions
Accused Products
Abstract
An example method comprises receiving historical information of episodes, constructing event sets from the historical information, categorizing each event with general labels and synthetic labels, learning an event metric on the events by using the general and synthetic labels to perform dimensionality reduction to associate a vector with each event and to determine an angle between every two vectors, determining an event set metric using distances between each pair of event sets, deriving a sequence metric on the episodes, the sequence metric obtaining a preferred match between two episodes, deriving a subsequence metric on the episodes, the subsequence metric is a function of the event set metric on subsequences of each episode, grouping episodes into subgroups based on distances, for at least one subgroup, generating a consensus sequence by finding a preferred sequence of events, and the episodes of the subgroup, and generating a report indicating the consensus sequence.
17 Citations
23 Claims
-
1. A method comprising:
-
receiving historical information of episodes, each episode including at least one sequence of events taken over a period of time; constructing event sets from the historical information, each of the event sets including at least one sequence of events; categorizing each event from the historical information with general event category labels and synthetic event category labels, a subset of the synthetic event category labels being broader than two or more of the general event category labels, at least a subset of the events having at least one general event category label and at least one synthetic event category label; learning an event metric on the events by using the general event category labels and the synthetic event category labels to perform dimensionality reduction to associate a vector with each event and to determine an angle between every two vectors, each general event category label and each synthetic event category label being assigned a separate dimension; determining an event set metric using distances between each pair of event sets using the event metric; deriving a sequence metric on the episodes to compute the distances between episodes, wherein the sequence metric obtaining a preferred match between two episodes with respect to a cost function describing a weighting for the event set metric; deriving a subsequence metric on the episodes to compute the distances between episodes, wherein the subsequence metric is a function of the event set metric on subsequences of each episode; grouping episodes into subgroups based on the distances obtained using the sequence metric and the subsequence metric; for at least one subgroup, generating a consensus sequence by finding a preferred sequence of events with respect to a function of the sequence metric and the subsequence metric between the preferred sequence and the episodes of the at least one subgroup; and generating a report indicating the consensus sequence. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system comprising:
-
memory; at least one processor; an event set construction module configured to control the at least one processor to receive historical information of episodes, each episode including at least one sequence of events taken over a period of time, and to construct event sets from the historical information, each of the event sets including at least one sequence of events; a categorization module configured to control the at least one processor to categorize each event from the historical information with general event category labels and synthetic event category labels, a subset of the synthetic event category labels being broader than two or more of the general event category labels, at least a subset of the events having at least one general event category label and at least one synthetic event category label; a metric construction module configured to control the at least one processor to learn an event metric on the events by using the general event category labels and the synthetic event category labels to perform dimensionality reduction to associate a vector with each event and to determine an angle between every two vectors, each general event category label and each synthetic event category label being assigned a separate dimension; a distance module configured to control the at least one processor to determine an event set metric using distances between each pair of event sets using the event metric; an episode metric assembly module configured to control the at least one processor to derive a sequence metric on the episodes to compute distances between episodes, the sequence metric obtaining a preferred match between two episodes with respect to a cost function describing a weighting for the event set metric, and to derive a subsequence metric on the episodes to compute the distances between episodes, wherein the subsequence metric is a function of the event set metric on subsequences of each episode; an autogroup module configured to control the at least one processor to group episodes into subgroups based on the distances obtained using the sequence metric and the subsequence metric; and a consensus module configured to, for at least one subgroup, control the at least one processor to generate a consensus sequence by finding a preferred sequence of events with respect to a function of the sequence metric and the subsequence metric between the preferred sequence and the episodes of the at least one subgroup and to generate a report indicating the consensus sequence. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A non-transitory computer readable medium comprising executable instructions that are executable by a processor to perform a method, the method comprising:
-
receiving historical information of episodes, each episode including at least one sequence of events taken over a period of time; constructing event sets from the historical information, each of the event sets including at least one sequence of events; categorizing each event from the historical information with general event category labels and synthetic event category labels, a subset of the synthetic event category labels being broader than two or more of the general event category labels, at least a subset of the events having at least one general event category label and at least one synthetic event category label; learning an event metric on the events by using the general event category labels and the synthetic event category labels to perform dimensionality reduction to associate a vector with each event and to determine an angle between every two vectors, each general event category label and each synthetic event category label being assigned a separate dimension; determining an event set metric using distances between each pair of event sets using the event metric; deriving a sequence metric on the episodes to compute distances between episodes, the sequence metric obtaining a preferred match between two episodes with respect to a cost function describing a weighting for the event set metric; deriving a subsequence metric on the episodes to compute the distances between episodes, wherein the subsequence metric is a function of the event set metric on subsequences of each episode; grouping episodes into subgroups based on the distances obtained using the sequence metric and the subsequence metric; for at least one subgroup, generating a consensus sequence by finding a preferred sequence of events with respect to a function of the sequence metric and the subsequence metric between the preferred sequence and the episodes of the at least one subgroup; and generating a report indicating the consensus sequence.
-
Specification