×

Information providing device and non-transitory computer readable medium storing information providing program

  • US 9,939,791 B2
  • Filed: 03/07/2017
  • Issued: 04/10/2018
  • Est. Priority Date: 03/11/2016
  • Status: Active Grant
First Claim
Patent Images

1. An information providing device comprising:

  • an agent electronic control unit includinga state space construction unit that is configured to define a state of a vehicle by associating a plurality of types of vehicle data with one another, and construct a state space as a set of a plurality of states,an action space construction unit that is configured to define, as an action, data indicating contents of an operation of an in-vehicle component that is performed through a response, from a driver, to an operation proposal for the in-vehicle component, and construct an action space as a set of a plurality of actions,a reinforced learning unit that is configured to accumulate a history of the response, from the driver, to the operation proposal for the in-vehicle component, set a reward function as an index representing an appropriateness degree of the operation proposal for the in-vehicle component while using the accumulated history, and calculate a probability distribution of performance of each of the actions constructing the action space in each of the states constructing the state space, through reinforced learning based on the reward function,a dispersion degree computation unit that is configured to compute a dispersion degree of the probability distribution that is calculated by the reinforced learning unit, andan information providing unit that is configured to make a definitive operation proposal to fix a target action as a target of the operation proposal and output the target action when the dispersion degree of the probability distribution that is computed by the dispersion degree computation unit is smaller than a threshold, and make a trial-and-error operation proposal to select the target action as the target of the operation proposal from a plurality of candidates and output the target action when the dispersion degree of the probability distribution that is computed by the dispersion degree computation unit is equal to or larger than the threshold.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×