GENERATING APPARATUS, GENERATION METHOD, INFORMATION PROCESSING METHOD AND PROGRAM
1 Assignment
0 Petitions
Accused Products
Abstract
A generating apparatus generates a set of gain vectors with respect to a transition model having observable visible states and unobservable hidden states and expressing a transition from a present visible state to a subsequent visible state according to an action, the set of gain vectors being generated for each visible state and used for calculation of a cumulative expected gain at and after a reference point in time, the apparatus including a setting section for setting, with respect to each hidden state, a probability distribution over the hidden states for selection used to select vectors to be included in the set of gain vectors from the gain vectors including a component for a cumulative gain, and a selection section for including, in the set of gain vectors, with priority, the gain vector giving the maximum of the cumulative expected gain with respect to the probability distribution for selection.
-
Citations
16 Claims
-
1-14. -14. (canceled)
-
15. An apparatus arranged to generate a set of gain vectors with respect to a transition model having observable visible states and unobservable hidden states and expressing a transition from a present visible state to a subsequent visible state according to an action, the set of gain vectors being generated for each visible state and used for calculation of a cumulative expected gain at and after a reference point in time, the apparatus comprising:
-
a setting section for setting, with respect to each hidden state, a probability distribution over the hidden states for selection used to select vectors to be included in the set of gain vectors from the gain vectors including a component for a cumulative gain; and a selection section for including, in the set of gain vectors, with priority, the gain vector giving the maximum of the cumulative expected gain with respect to the probability distribution for selection.
-
-
16. A program product for causing a computer to function as a generating apparatus arranged to generate a set of gain vectors with respect to a transition model having observable visible states and unobservable hidden states and expressing a transition from a present visible state to a subsequent visible state according to an action, the set of gain vectors being generated for each visible state and used for calculation of a cumulative expected gain at and after a reference point in time, the program product being executed to cause the computer to function as:
-
a setting section for setting, with respect to each hidden state, a probability distribution over the hidden states for selection used to select vectors to be included in the set of gain vectors from the gain vectors including a component for a cumulative gain; and a selection section for including, in the set of gain vectors, with priority, the gain vector giving the maximum of the cumulative expected gain with respect to the probability distribution for selection.
-
Specification