Consensus sequence identification
First Claim
1. A method comprising:
- receiving historical information of past episodes, each past episode including at least one sequence of events taken over a period of time;
constructing event sets from the historical information each of the event sets including at least one sequence of events;
categorizing each event from the historical information with general event category labels and synthetic event category labels, at least one of the synthetic event category labels being broader than the at least one general event category label, the at least one of the synthetic event category labels categorizing events which are also categorized by the at least one general event category label and categorizing at least one event not categorized by the at least one general event category label;
learning an event metric on the events by using the general event category labels and the synthetic event category labels to perform dimensionality reduction to associate a vector with each event and to determine an angle between every two vectors, each general event category label and each synthetic event category label being assigned a separate dimension;
determining an event set metric using distances between each pair of event sets using the event metric;
deriving a sequence metric on the episodes to compute distances between episodes, the sequence metric obtaining a preferred match between two episodes with respect to a cost function describing a weighting for the event set metric;
deriving a subsequence metric on the episodes to compute the distances between episodes, wherein the subsequence metric is a function of the event set metric on subsequences of each episode;
grouping episodes into subgroups based on distances obtained using the sequence metric and the subsequence metric;
for at least one subgroup, generating a consensus sequence by finding a preferred sequence of events with respect to a function of the sequence metric and the subsequence metric between the preferred sequence and the episodes of the at least one subgroup; and
generating a report indicating the consensus sequence.
6 Assignments
0 Petitions
Accused Products
Abstract
An example method comprises receiving historical information of episodes, constructing event sets from the historical information, categorizing each event with general labels and synthetic labels, learning an event metric on the events by using the general and synthetic labels to perform dimensionality reduction to associate a vector with each event and to determine an angle between every two vectors, determining an event set metric using distances between each pair of event sets, deriving a sequence metric on the episodes, the sequence metric obtaining a preferred match between two episodes, deriving a subsequence metric on the episodes, the subsequence metric is a function of the event set metric on subsequences of each episode, grouping episodes into subgroups based on distances, for at least one subgroup, generating a consensus sequence by finding a preferred sequence of events, and the episodes of the subgroup, and generating a report indicating the consensus sequence.
29 Citations
22 Claims
-
1. A method comprising:
-
receiving historical information of past episodes, each past episode including at least one sequence of events taken over a period of time; constructing event sets from the historical information each of the event sets including at least one sequence of events; categorizing each event from the historical information with general event category labels and synthetic event category labels, at least one of the synthetic event category labels being broader than the at least one general event category label, the at least one of the synthetic event category labels categorizing events which are also categorized by the at least one general event category label and categorizing at least one event not categorized by the at least one general event category label; learning an event metric on the events by using the general event category labels and the synthetic event category labels to perform dimensionality reduction to associate a vector with each event and to determine an angle between every two vectors, each general event category label and each synthetic event category label being assigned a separate dimension; determining an event set metric using distances between each pair of event sets using the event metric; deriving a sequence metric on the episodes to compute distances between episodes, the sequence metric obtaining a preferred match between two episodes with respect to a cost function describing a weighting for the event set metric; deriving a subsequence metric on the episodes to compute the distances between episodes, wherein the subsequence metric is a function of the event set metric on subsequences of each episode; grouping episodes into subgroups based on distances obtained using the sequence metric and the subsequence metric; for at least one subgroup, generating a consensus sequence by finding a preferred sequence of events with respect to a function of the sequence metric and the subsequence metric between the preferred sequence and the episodes of the at least one subgroup; and generating a report indicating the consensus sequence. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A non-transitory computer readable medium comprising executable instructions that are executable by a processor to perform a method, the method comprising:
-
receiving historical information of past episodes, each past episode including at least one sequence of events taken over a period of time; constructing event sets from the historical information, each of the event sets including at least one sequence of events; categorizing each event from the historical information with gene-al event category labels and synthetic event category labels, at least one of the synthetic event category labels being broader than the at least one general event category label, the at least one of the synthetic event category labels categorizing events which are also categorized by the at least one general event category label and categorizing at least one event not categorized by the at least one general event category label; learning an event metric on the events by using the general event category labels and the synthetic event category labels to perform dimensionality reduction to associate a vector with each event and to determine an angle between every two vectors, each general event category label and each synthetic event category label being assigned a separate dimension; determining an event set metric using distances between each pair of event sets using the event metric; deriving a sequence metric on the episodes to compute distances between episodes the sequence metric obtaining a preferred match between two episodes with respect to a cost function describing a weighting for the event set metric; deriving a subsequence metric on the episodes to compute the distances between episodes, wherein the subsequence metric is a function of the event set metric on subsequences of each episode; grouping episodes into subgroups based on distances obtained using the sequence metric and the subsequence metric; for at least one subgroup, generating a consensus sequence by finding a preferred sequence of events with respect to a function of the sequence metric and the subsequence metric between the preferred sequence and the episodes of the at least one subgroup; and generating a report indicating the consensus sequence. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. A system comprising:
-
at least one processor; memory comprising instructions to configure at least one of the at least one processor to; receiving historical information of past episodes, each past episode including at least one sequence of events taken over a period of time; construct event sets from the historical information, each of the event sets including at least one sequence of events; categorize each event from the historical information with general event category labels and synthetic event category labels, at least one of the synthetic event category labels being broader than the at least one general event category label, the at least one of the synthetic event category labels categorizing events which are also categorized by the at least one general event category label and categorizing at least one event not categorized by the at least one general event category label; learn an event metric on the events by using the general event category labels and the synthetic event category labels to perform dimensionality reduction to associate a vector with each event and to determine an angle between every two vectors, each general event category label and each synthetic event category label being assigned a separate dimension; determine an event set metric using distances between each pair of event sets using the event metric; derive a sequence metric on the episodes to compute distances between episodes, the sequence metric obtaining a preferred match between two episodes with respect to a cost function describing a weighting for the event set metric; derive a subsequence metric on the episodes to compute the distances between episodes, wherein the subsequence metric is a function of the event set metric on subsequences of each episode; group episodes into subgroups based on distances obtained using the sequence metric and the subsequence metric; for at least one subgroup, generate a consensus sequence by finding a preferred sequence of events with respect to a function of the sequence metric and the subsequence metric between the preferred sequence and the episodes of the at least one subgroup; and generate a report indicating the consensus sequence.
-
Specification