System and method for multiple instance learning for computer aided detection

US 7,986,827 B2
Filed: 02/06/2007
Issued: 07/26/2011
Est. Priority Date: 02/07/2006
Status: Active Grant

First Claim

Patent Images

1. A computer-implemented method of training a classifier for computer aided detection of digitized medical images, comprising the steps of:

providing a plurality of bags, each bag containing a plurality of feature samples of a single region-of-interest in said medical image, wherein said feature samples include texture, shape, intensity, and contrast of said region-of-interest, wherein each region-of-interest has been labeled as either malignant or healthy; and

training said classifier on said plurality of bags of feature samples, subject to the constraint that at least one point in a convex hull of each bag, corresponding to said feature sample, is correctly classified according to the label of the associated region-of-interest,wherein said classifier is trained on a computer, and wherein said classifier is trained by minimizing the expression vE(ξ

)+Φ

(ω

,η

)+Ψ

(λ

) over arguments (ξ

,ω

,η

,λ

)ε

R^r+n+1+γsubject to the conditions
ξ

ⁱ=dⁱ−

(λ

_jⁱB_jⁱω

−

eη

),
ξ

ε

Ω

,
e′

λ

_jⁱ=1,
0≦

λ

_jⁱ,wherein ξ

={ξ

₁, . . . ,ξ

_r} are slack terms, E;

R^rR represents a loss function, ω

is a hyperplane coefficient, η

is the bias term, λ

is a vector containing the coefficients of the convex combination that defines the representative point of bag i in class j wherein 0≦

λ

_jⁱ,e′

λ

_jⁱ=1, γ

is the total number of convex hull coefficients corresponding to the representative points in class j,Φ

;

R⁽ⁿ⁺¹⁾R is a regularization function on the hyperplane coefficients, Ψ

is a regularization function on the convex combination coefficients λ

_jⁱ, Ω

represents a feasible set for ξ

matrix B_jⁱε

R^m^jⁱ^×

n,i=1, . . . ,r_j, jε

{±

1} is the i^thbag of class label j, r is the total number of representative points, n is the number of features, m_jⁱis the number of rows in B, vector dε

{±

1}^r^jrepresents binary bag-labels for the malignant and healthy sets, respectively, and the vector e represents a vector with all its elements equal to one.

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method of training a classifier for computer aided detection of digitized medical image, includes providing a plurality of bags, each bag containing a plurality of feature samples of a single region-of-interest in a medical image, where each region-of-interest has been labeled as either malignant or healthy. The training uses candidates that are spatially adjacent to each other, modeled by a “bag”, rather than each candidate by itself. A classifier is trained on the plurality of bags of feature samples, subject to the constraint that at least one point in a convex hull of each bag, corresponding to a feature sample, is correctly classified according to the label of the associated region-of-interest, rather than a large set of discrete constraints where at least one instance in each bag has to be correctly classified.

62 Citations

View as Search Results

19 Claims

1. A computer-implemented method of training a classifier for computer aided detection of digitized medical images, comprising the steps of:
- providing a plurality of bags, each bag containing a plurality of feature samples of a single region-of-interest in said medical image, wherein said feature samples include texture, shape, intensity, and contrast of said region-of-interest, wherein each region-of-interest has been labeled as either malignant or healthy; and
  
  training said classifier on said plurality of bags of feature samples, subject to the constraint that at least one point in a convex hull of each bag, corresponding to said feature sample, is correctly classified according to the label of the associated region-of-interest,wherein said classifier is trained on a computer, and wherein said classifier is trained by minimizing the expression vE(ξ
  
  )+Φ
  
  (ω
  
  ,η
  
  )+Ψ
  
  (λ
  
  ) over arguments (ξ
  
  ,ω
  
  ,η
  
  ,λ
  
  )ε
  
  R^r+n+1+γsubject to the conditions
  ξ
  
  ⁱ=dⁱ−
  
  (λ
  
  _jⁱB_jⁱω
  
  −
  
  eη
  
  ),
  ξ
  
  ε
  
  Ω
  
  ,
  e′
  
  λ
  
  _jⁱ=1,
  0≦
  
  λ
  
  _jⁱ,wherein ξ
  
  ={ξ
  
  ₁, . . . ,ξ
  
  _r} are slack terms, E;
  
  R^rR represents a loss function, ω
  
  is a hyperplane coefficient, η
  
  is the bias term, λ
  
  is a vector containing the coefficients of the convex combination that defines the representative point of bag i in class j wherein 0≦
  
  λ
  
  _jⁱ,e′
  
  λ
  
  _jⁱ=1, γ
  
  is the total number of convex hull coefficients corresponding to the representative points in class j,Φ
  
  ;
  
  R⁽ⁿ⁺¹⁾R is a regularization function on the hyperplane coefficients, Ψ
  
  is a regularization function on the convex combination coefficients λ
  
  _jⁱ, Ω
  
  represents a feasible set for ξ
  
  matrix B_jⁱε
  
  R^m^jⁱ^×
  
  n,i=1, . . . ,r_j, jε
  
  {±
  
  1} is the i^thbag of class label j, r is the total number of representative points, n is the number of features, m_jⁱis the number of rows in B, vector dε
  
  {±
  
  1}^r^jrepresents binary bag-labels for the malignant and healthy sets, respectively, and the vector e represents a vector with all its elements equal to one.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
- - 2. The method of claim 1, wherein E(ξ
    - )=∥
      
      (ξ
      
      )₊∥
      
      ₂²,Φ
      
      (ω
      
      ,η
      
      )=∥
      
      (ω
      
      ,η
      
      )∥
      
      ₂²and Ω
      
      =R^r⁺, wherein ξ
      
      ₊ and r₊ are respectively slack variables and points labeled by +1.
  - 3. The method of claim 1, wherein E(ξ
    - )=∥
      
      (ξ
      
      ) ∥
      
      ₂²,Φ
      
      (ω
      
      ,η
      
      )=∥
      
      (ω
      
      ,η
      
      )∥
      
      ₂²and Ω
      
      =R^r.
  - 4. The method of claim 1, wherein v=1,E(ξ
    - )=∥
      
      ξ
      
      ∥
      
      ₂²and Ω
      
      ={ξ
      
      ;
      
      e′
      
      ξ
      
      _j=0,jε
      
      {±
      
      1 }}.
  - 5. The method of claim 4, further comprising replacing ξ
    - ⁱby dⁱ−
      
      (λ
      
      _jⁱB_jⁱω
      
      −
      
      eη
      
      ) in the objective function, replacing equality constraints e′
      
      ξ
      
      _j=0 by ω
      
      ′
      
      (μ
      
      ₊−
      
      μ
      
      ₋)=2, wherein said classifier is trained by minimizing the expression ω
      
      ^TS_Wω
      
      +Φ
      
      (ω
      
      )+Ψ
      
      (λ
      
      ) with respect to the arguments (ω
      
      ,λ
      
      )ε
      
      R^{n+γ
      
      subject to the conditions}
      ω
      
      ^T(μ
      
      ₊−
      
      μ
      
      ₋)=b,
      e′
      
      λ
      
      _jⁱ=1,
      0≦
      
      λ
      
      _jⁱ,
6. The method of claim 5, further comprising:
- initializing
7. The method of claim 6, further comprising setting convex-hull coefficients of negative bags to be 1.
8. The method of claim 6, further comprising transforming said feature samples into a higher dimensional space using a kernel transformation (X{+}, X) for the positive class and K(X{−
- }, X) for the negative class, wherein X{+}, X{−
  
  }, and X are data matrices for positive, negative and all samples respectively, wherein each row is a sample vector in these matrices, wherein if the size of X is too large, subsampling a random subset from said original feature samples.
9. The method of claim 5, wherein Φ
- (ω
  
  )=ε
  
  ∥
  
  ω
  
  ∥
  
  ₂²and Ψ
  
  (λ
  
  )=ε
  
  ∥
  
  λ
  
  ∥
  
  ₂², wherein ε
  
  is a positive regularization parameter.

10. A method of training a classifier for computer aided detection of digitized medical images, comprising the steps of:
- providing a plurality of bags, each bag containing a plurality of feature samples of a single region-of-interest in said medical image, wherein each region-of-interest has been labeled as either malignant or healthy, wherein each bag is represented by a matrix B_jⁱε
  
  R^m^jⁱ^×
  
  n, i=1, . . . ,r_j,jε
  
  {±1} is the i^thbag of class label j, r is the total number of representative points, n is the number of features, m_jⁱis the number of rows in B; and
  
  training said classifier by minimizing the expression ∥
  
  ξ
  
  ∥
  
  ₂²+Φ
  
  (ω
  
  ,η
  
  )+Ψ
  
  (λ
  
  ) over arguments (ξ
  
  ,ω
  
  ,η
  
  ,λ
  
  )ε
  
  R^r+n+1+γsubject to the conditions
  ξ
  
  ⁱ=dⁱ−
  
  (λ
  
  _jⁱB_jⁱω
  
  −
  
  eη
  
  ),
  e′
  
  ξ
  
  _j=0,
  e′
  
  λ
  
  _jⁱ=1,
  0≦
  
  λ
  
  _jⁱ,wherein ξ
  
  ={ξ
  
  , . . . ,ξ
  
  _r} are slack terms, ω
  
  is a hyperplane coefficient, η
  
  is the bias offset from the origin term, λ
  
  is a vector containing the coefficients of the convex combination that defines the representative point of bag i in class j wherein 0≦
  
  λ
  
  _jⁱ,e′
  
  λ
  
  _jⁱ=1, γ
  
  is the total number of convex hull coefficients corresponding to the representative points in class j, Φ
  
  ;
  
  R⁽ⁿ⁺¹⁾R is a regularization function on the hyperplane coefficients, Ψ
  
  is a regularization function on the convex combination coefficients λ
  
  _jⁱ, matrix B_jⁱε
  
  R^m^jⁱ^×
  
  n,i=1, . . . ,r_j, jε
  
  {±
  
  1} is the i^thbag of class label j, r is the total number of representative points, n is the number of features, m_jⁱis the number of rows in B, vector dε
  
  {±
  
  1}^r^jrepresents binary bag-labels for the malignant and healthy sets, respectively, and the vector e represents a vector with all its elements equal to one.

11. A program storage device readable by a computer, tangibly embodying a non-transitory program of instructions executable by the computer to perform the method steps for training a classifier for computer aided detection of digitized medical images, said method comprising the steps of:
- providing a plurality of bags, each bag containing a plurality of feature samples of a single region-of-interest in said medical image, wherein said feature samples include texture, shape, intensity, and contrast of said region-of-interest, wherein each region-of-interest has been labeled as either malignant or healthy; and
  
  training said classifier on said plurality of bags of feature samples, subject to the constraint that at least one point in a convex hull of each bag, corresponding to said feature sample, is correctly classified according to the label of the associated region-of-interestwherein said classifier is trained by minimizing the expression vE(ξ
  
  )+Φ
  
  (ω
  
  ,η
  
  )+Ψ
  
  (λ
  
  ) over arguments (ξ
  
  ,ω
  
  ,η
  
  ,λ
  
  )ε
  
  R^r+n+1+γsubject to the conditions
  ξ
  
  ⁱ=dⁱ−
  
  (λ
  
  _jⁱB_jⁱω
  
  −
  
  eη
  
  ),
  ξ
  
  ε
  
  Ω
  
  ,
  e′
  
  λ
  
  _jⁱ−
  
  1,
  0≦
  
  λ
  
  _jⁱ,wherein ξ
  
  ={ξ
  
  ₁, . . . ,ξ
  
  _r} are slack terms, E;
  
  R^rR represents a loss function, ω
  
  is a hyperplane coefficient, η
  
  is the bias term, λ
  
  is a vector containing the coefficients of the convex combination that defines the representative point of bag i in class j wherein 0≦
  
  λ
  
  _jⁱ,e′
  
  λ
  
  _jⁱ=1, γ
  
  is the total number of convex hull coefficients corresponding to the representative points in class j, Φ
  
  ;
  
  R⁽ⁿ⁺¹⁾R is a regularization function on the hyperplane coefficients, Ψ
  
  is a regularization function on the convex combination coefficients λ
  
  _jⁱ, Ω
  
  represents a feasible set for ξ
  
  , matrix B_jⁱε
  
  R^m^jⁱ,i=1, . . . ,r_j, jε
  
  {±
  
  1} is the i^thbag of class label j, r is the total number of representative points, n is the number of features, m_jⁱis the number of rows in B, vector dε
  
  {±
  
  1}^r^jrepresents binary bag-labels for the malignant and healthy sets, respectively, and the vector e represents a vector with all its elements equal to one.
- View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
- - 12. The computer readable program storage device of claim 11, wherein E(ξ
    - )=∥
      
      (ξ
      
      )₊∥
      
      ₂²,Φ
      
      (ω
      
      ,η
      
      )=∥
      
      (ω
      
      ,η
      
      )∥
      
      ₂²and Ω
      
      =R^r⁺, wherein ξ
      
      ₊ and r₊ are respectively slack variables and points labeled by +1.
  - 13. The computer readable program storage device of claim 11, wherein E(ξ
    - )=∥
      
      (ξ
      
      )∥
      
      ₂²,Φ
      
      (ω
      
      ,η
      
      )=∥
      
      (ω
      
      ,η
      
      )∥
      
      ₂²and Ω
      
      =R^r.
  - 14. The computer readable program storage device of claim 11, wherein v=1, E(ξ
    - )=∥
      
      ξ
      
      ∥
      
      ₂²and Ω
      
      ={ξ
      
      ;
      
      e′
      
      ξ
      
      _j=0,jε
      
      {±
      
      1}}.
  - 15. The computer readable program storage device of claim 14, the method further comprising replacing ξ
    - ⁱby dⁱ−
      
      (λ
      
      _jⁱB_jⁱω
      
      −
      
      eη
      
      ) in the objective function, replacing equality constraints e′
      
      ξ
      
      _j=0 by ω
      
      ′
      
      (μ
      
      ₊−
      
      μ
      
      ₋)=2, wherein said classifier is trained by minimizing the expression ω
      
      ^TS_Wω
      
      +Φ
      
      (ω
      
      )+Ψ
      
      (λ
      
      ) with respect to the arguments (ω
      
      ,λ
      
      )ε
      
      R^n+γ
16. The computer readable program storage device of claim 15, the method further comprising:
- initializing
17. The computer readable program storage device of claim 16, the method further comprising setting convex-hull coefficients of negative bags to be 1.
18. The computer readable program storage device of claim 16, the method further comprising transforming said feature samples into a higher dimensional space using a kernel transformation (X{+}, X) for the positive class and K(X{−
- }, X) for the negative class, wherein X{+}, X{−
  
  }, and X are data matrices for positive, negative and all samples respectively, wherein each row is a sample vector in these matrices, wherein if the size of X is too large, subsampling a random subset from said original feature samples.
19. The computer readable program storage device of claim 15, wherein Φ
- (ω
  
  )=ε
  
  ∥
  
  ω
  
  ∥
  
  ₂²and Ψ
  
  (λ
  
  )=ε
  
  ∥
  
  λ
  
  ∥
  
  ₂², wherein ε
  
  is a positive regularization parameter.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Siemens Healthcare GMBH (Siemens AG)
Original Assignee
Siemens Medical Solutions USA Incorporated (Siemens AG)
Inventors
Krishnapuram, Balaji, Fung, Glenn, Rao, R. Bharat, Dundar, Murat
Primary Examiner(s)
Mehta; Bhavesh M
Assistant Examiner(s)
Thomas; Mia M

Application Number

US11/671,777
Publication Number

US 20070189602A1
Time in Patent Office

1,631 Days
Field of Search

382128-131, 382155-161, 382224-229, 706/20
US Class Current

382/159
CPC Class Codes

G06F 18/2415   based on parametric or prob...

G06T 2207/30004   Biomedical image processing

G06T 7/0012   Biomedical image inspection

G06V 10/764   using classification, e.g. ...

System and method for multiple instance learning for computer aided detection

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

62 Citations

19 Claims

Specification

Solutions

Use Cases

Quick Links

System and method for multiple instance learning for computer aided detection

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

62 Citations

19 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links