SEMI-SUPERVISED RANDOM DECISION FORESTS FOR MACHINE LEARNING

US 20130346346A1
Filed: 06/21/2012
Published: 12/26/2013
Est. Priority Date: 06/21/2012
Status: Active Grant

First Claim

Patent Images

1. A machine learning process comprising:

accessing, using a processor, a plurality of labeled observations each labeled observation having a label indicating one of a plurality of classes that the labeled observation is a member of;

accessing a plurality of unlabeled observations which are unlabeled in that, for each unlabeled observation, it is not known to which one of the plurality of classes the unlabeled observation belongs;

training a plurality of random decision trees to form a semi-supervised random decision forest using both the labeled observations and the unlabeled observations such that each random decision tree partitions the labeled and the unlabeled observations into clusters according to similarity of the observations and according to the labels.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Semi-supervised random decision forests for machine learning are described, for example, for interactive image segmentation, medical image analysis, and many other applications. In examples, a random decision forest comprising a plurality of hierarchical data structures is trained using both unlabeled and labeled observations. In examples, a training objective is used which seeks to cluster the observations based on the labels and similarity of the observations. In an example, a transducer assigns labels to the unlabeled observations on the basis of the clusters and certainty information. In an example, an inducer forms a generic clustering function by counting examples of class labels at leaves of the trees in the forest. In an example, an active learning module identifies regions in a feature space from which the observations are drawn using the clusters and certainty information; new observations from the identified regions are used to train the random decision forest.

Citations

20 Claims

1. A machine learning process comprising:
- accessing, using a processor, a plurality of labeled observations each labeled observation having a label indicating one of a plurality of classes that the labeled observation is a member of;
  
  accessing a plurality of unlabeled observations which are unlabeled in that, for each unlabeled observation, it is not known to which one of the plurality of classes the unlabeled observation belongs;
  
  training a plurality of random decision trees to form a semi-supervised random decision forest using both the labeled observations and the unlabeled observations such that each random decision tree partitions the labeled and the unlabeled observations into clusters according to similarity of the observations and according to the labels.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
- - 2. A method as claimed in claim 1 where each random decision tree is a hierarchical tree data structure comprising split nodes and leaf nodes and having a test associated with each split node.
  - 3. A method as claimed in claim 1 wherein training the random decision trees comprises using a training objective which applies to unlabeled observations as well as to labeled observations.
  - 4. A method as claimed in claim 1 wherein training the random decision trees comprises optimizing an information gain which comprises an unsupervised term and a supervised term.
  - 5. A method as claimed in claim 1 wherein training the random decision trees comprises optimizing an information gain which comprises a supervised term and an unsupervised term, the supervised term comprising an entropy of the labeled observations that reach a specified split node minus the sum of an entropy of the labeled observations that reach each child node of the specified split node weighted by a ratio of the number of labeled observations that reach each of the child nodes of the specified split node to the number of labeled observations that reach the specified split node.
  - 6. A method as claimed in claim 5 comprising calculating the entropy as the negative of the sum over possible ground truth label values of an empirical probability distribution extracted from the training points times the log of the empirical probability distribution.
  - 7. A method as claimed in claim 1 comprising assigning labels and certainties of those labels to the unlabeled observations by giving unlabeled observations the labels of close labeled observations where close labeled observations are selected on the basis of the clusters.
  - 8. A method as claimed in claim 7 comprising selecting the close labeled observations in a manner which optimizes geodesic distances between each unlabeled point and the labeled point it takes its label from.
  - 9. A method as claimed in claim 8 wherein the geodesic distances are calculated on the basis of the clusters and using a distance measure based on correlations between variables which gauges similarity of an unknown sample set to a known one.
  - 10. A method as claimed in claim 8 wherein the geodesic distances are calculated on the basis of the clusters and using a distance measure which discourages geodesic paths from cutting across regions of low data density.
  - 11. A method as claimed in claim 1 comprising assigning labels and certainties of those labels to the unlabeled observations by giving unlabeled observations the labels of close labeled observations where close labeled observations are selected on the basis of the clusters;
    - generating a clustering function by counting the labels for each class in each cluster.
  - 12. A method as claimed in claim 1 comprising:
    - assigning labels and certainties of those labels to the unlabeled observations by giving unlabeled observations the labels of close labeled observations where close labeled observations are selected on the basis of the clusters;
      
      receiving an unseen, unlabeled observation and passing the observation through each of the trees in the forest to select a leaf of each tree;
      
      aggregating the clusters of each of the selected leaves;
      
      obtaining a label for the observation and a certainty of the label from the aggregated clusters.
  - 13. A method as claimed in claim 1 comprising assigning labels and certainties of those labels to the unlabeled observations by giving unlabeled observations the labels of close labeled observations where close labeled observations are selected on the basis of the clusters;
    - and using the assigned labels and certainties to identify regions in a feature space from which the observations are drawn.
  - 14. A method as claimed in claim 13 comprising obtaining new observations in the identified regions and training the random decision forest using the new observations.

15. A machine learning process comprising:
- accessing, using a processor, a plurality of labeled observations each labeled observation having a label indicating one of a plurality of classes that the labeled observation is a member of;
  
  accessing a plurality of unlabeled observations which are unlabeled in that, for each unlabeled observation, it is not known to which one of the plurality of classes the unlabeled observation belongs;
  
  training a plurality of random decision trees to form a semi-supervised random decision forest using both the labeled observations and the unlabeled observations and according to a training objective which optimizes an information gain comprising an unsupervised term and a supervised term.

16. A machine learning system comprising:
- an input arranged to receive a plurality of labeled observations each labeled observation having a label indicating one of a plurality of classes that the labeled observation is a member of;
  
  the input also arranged to access a plurality of unlabeled observations which are unlabeled in that, for each unlabeled observation, it is not known to which one of the plurality of classes the unlabeled observation belongs;
  
  a training engine arranged to train a plurality of random decision trees to form a semi-supervised random decision forest using both the labeled observations and the unlabeled observations such that each random decision tree partitions the labeled and the unlabeled observations into clusters according to similarity of the observations and according to the labels.
- View Dependent Claims (17, 18, 19, 20)
- - 17. A machine learning system as claimed in claim 16 comprising a transducer arranged to assign labels and certainties of those labels to the unlabeled observations by giving unlabeled observations the labels of close labeled observations where close labeled observations are selected on the basis of the clusters.
  - 18. A machine learning system as claimed in claim 16 wherein the training engine is at least partially implemented in hardware logic.
  - 19. A machine learning system as claimed in claim 17 comprising an inducer arranged to generate a clustering function by counting the labels for each class in each cluster.
  - 20. A machine learning system as claimed in claim 17 comprising an active learning module arranged to use the assigned labels and certainties to identify regions in a feature space from which the observations are drawn and to train the semi-supervised random decision forest using new observations drawn from the identified regions.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Microsoft Technology Licensing LLC (Microsoft Corporation)
Original Assignee
Microsoft Corporation
Inventors
Criminisi, Antonio, Shotton, Jamie Daniel Joseph

Granted Patent

US 9,519,868 B2
Time in Patent Office

Days
Field of Search
US Class Current

706/12
CPC Class Codes

G06N 20/00   Machine learning

G06N 20/20   Ensemble learning

G06N 5/02   Knowledge representation; S...

G06N 7/01   Probabilistic graphical mod...

SEMI-SUPERVISED RANDOM DECISION FORESTS FOR MACHINE LEARNING

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

20 Claims

Specification

Solutions

Use Cases

Quick Links

SEMI-SUPERVISED RANDOM DECISION FORESTS FOR MACHINE LEARNING

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

20 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links