3-brain architecture for an intelligent decision and control system

US 6,169,981 B1
Filed: 06/04/1997
Issued: 01/02/2001
Est. Priority Date: 06/04/1996
Status: Expired due to Fees

First Claim

Patent Images

1. A computer program product, comprising:

a computer storage medium and a computer program code mechanism embedded in the computer storage medium for causing a computer to control an external device, the computer program code mechanism comprising;

a first computer code device configured to initially establish a neural network including nodes, connections and connection weights;

a second computer code device configured to train the neural network; and

a third computer code device configured to use multi-level adaptive, dynamic programming to receive inputs and produce outputs based on the trained neural network, wherein each level, i, in the multi-level adaptive, dynamic programming, updates (1) a value function J⁽ⁱ⁾, and (2) a transition matrix, M_j, applicable to said level, wherein J⁽ⁱ⁾depends upon a corresponding value function J⁽ⁱ⁾and transition matrix M_jof an adjacent level, wherein the transition matrix M_idepends on the corresponding transition matrix M_j, wherein increasing levels correspond to increasing periods of time, and wherein at least two levels are used, said outputs being calculated to maximize said value function J⁽ⁱ⁾and control said external device.

View all claims

4 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method and system for intelligent control of external devices using a mammalian brain-like structure having three parts. The method and system include a computer-implemented neural network system which is an extension of the model-based adaptive critic design and is applicable to real-time control (e.g., robotic control) and real-time distributed control. Additional uses include data visualization, data mining, and other tasks requiring complex analysis of inter-relationships between data.

Citations

18 Claims

1. A computer program product, comprising:
- a computer storage medium and a computer program code mechanism embedded in the computer storage medium for causing a computer to control an external device, the computer program code mechanism comprising;
  
  a first computer code device configured to initially establish a neural network including nodes, connections and connection weights;
  
  a second computer code device configured to train the neural network; and
  
  a third computer code device configured to use multi-level adaptive, dynamic programming to receive inputs and produce outputs based on the trained neural network, wherein each level, i, in the multi-level adaptive, dynamic programming, updates (1) a value function J⁽ⁱ⁾, and (2) a transition matrix, M_j, applicable to said level, wherein J⁽ⁱ⁾depends upon a corresponding value function J⁽ⁱ⁾and transition matrix M_jof an adjacent level, wherein the transition matrix M_idepends on the corresponding transition matrix M_j, wherein increasing levels correspond to increasing periods of time, and wherein at least two levels are used, said outputs being calculated to maximize said value function J⁽ⁱ⁾and control said external device.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
- - 2. The computer program product as claimed in claim 1, wherein the third computer code device comprises a fourth computer code device configured to use clock-based multi-level adaptive, dynamic programming.
  - 3. The computer program product as claimed in claim 2, wherein the fourth computer code device comprises a fifth computer code device configured to use a one-step update for each level, i.
  - 4. The computer program product as claimed in claim 3, wherein the fifth computer code device comprises a sixth computer code device configured to calculate the value function J⁽ⁱ⁾at each level, i, according to J⁽ⁱ⁾=J⁽ⁱ⁻
    - 1)+M_i−
      
      1J^(i−
      
      1).
  - 5. The computer program product as claimed in claim 4, wherein at each increasing level M_icorresponds to an increasing period of 2ⁱ.
  - 6. The computer program product as claimed in claim 3, wherein at each increasing level M_icorresponds to an increasing period of 2ⁱ.
  - 7. The computer program product as claimed in claim 3, wherein the fifth computer code device comprises a sixth computer code device configured to calculate an adjacent level j according to j=i−
    - 1.
  - 8. The computer program product as claimed in claim 3, wherein the fifth computer code device comprises a sixth computer code device configured to calculate an adjacent level j according to j=i+1.
  - 9. The computer program product as claimed in claim 2, wherein the fourth computer code device comprises a fifth computer code device configured to use a k-step update for each level, i.
  - 10. The computer program product as claimed in claim 9, wherein the fifth computer code device comprises a sixth computer code device configured to calculate the value function J_n,kaccording to:
11. The computer program product as claimed in claim 2, wherein the fourth computer code device comprises a fifth computer code device configured to calculate a true value function J^π
- according to;
12. The computer program product as claimed in claim 1, wherein the third computer code device comprises a fourth computer code device configured to use event-based multi-level adaptive, dynamic programming.

13. A computer program product, comprising:
- a computer storage medium and a computer program code mechanism embedded in the computer storage medium for causing a computer to control an external device under neural-network control, the computer program code mechanism comprising;
  
  a first computer code device configured to initially establish an interconnection including nodes, connections and connection weights;
  
  a second computer code device configured to train the interconnection; and
  
  a third computer code device configured to use two-level adaptive, event-based, dynamic programming to receive inputs and produce outputs based on the trained interconnection, wherein the third computer code device comprises a fourth computer code device configured to use a first transition matrix, M^A, representing transitions from within a block A of states of the external device, and a second transition matrix M^AB, representing transitions from the block A to a different block B of states of the external device, and wherein the fourth computer code device comprises a fifth computer code device configured to use a policy, π
  
  , to calculated a value function J^π|_Afor the block A from a utility function, U^π, according to;
- View Dependent Claims (14, 15, 16, 17, 18)
- - 14. The computer program product as claimed in claim 13, wherein the fifth computer code device comprises a sixth computer code device configured to use recurrence relations for J^Aand J^ABaccording to:
  - 15. The computer program product as claimed in claim 14, further comprising a seventh computer code device configured to perform policy updates locally.
  - 16. The computer program product as claimed in claim 14, further comprising a seventh computer code device configured to perform policy updates globally.
  - 17. The computer program product according to claim 13, wherein the third computer code device comprises a fourth computer code device configured to use fuzzy partitions with weight arrays β
    - _ij.
  - 18. The computer program product according to claim 17, wherein the fourth computer code device comprises a fifth computer code device configured to set the weight arrays β
    - _ijequal to μ
      
      (j)/μ
      
      (i), where μ
      
      (j) and μ
      
      (i) represent degrees of membership of given states i and j, respectively, in a given decision block.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
IPU Power Management, LLC
Original Assignee
Paul J. Werbos
Inventors
Werbos, Paul J.
Primary Examiner(s)
Hafiz, Tariq R.
Assistant Examiner(s)
Rhodes, Jason W.

Application Number

US08/867,871
Time in Patent Office

1,308 Days
Field of Search

395/21, 395/22, 395/23, 706/15-17, 706/19, 706/23, 706/25
US Class Current

706/23
CPC Class Codes

G05B 13/0265   the criterion being a learn...

G05B 13/027   using neural networks only

G05B 13/0285   using neural networks and f...

G06N 3/105   Shells for specifying net l...

3-brain architecture for an intelligent decision and control system

First Claim

4 Assignments

0 Petitions

Accused Products

Abstract

Citations

18 Claims

Specification

Solutions

Use Cases

Quick Links

3-brain architecture for an intelligent decision and control system

First Claim

4 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

18 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links