Author Question: Which is not a method to determine an optimal stationary policy? a.value iteration b.IP c.LP ... (Read 52 times)

moongchi

  • Hero Member
  • *****
  • Posts: 516
Which is not a method to determine an optimal stationary policy?
  a.value iteration
  b.IP
  c.LP
  d.policy iteration

Question 2

In an MDP the next period's state depends on
  a.all previous states and only the current decision chosen.
  b.only the current period's state and the current decision chosen.
  c.all previous states and all previous decisions.
  d.only the current period's state and all previous decisions chosen.



kjohnson

  • Sr. Member
  • ****
  • Posts: 330
Answer to Question 1

correct: b

Answer to Question 2

correct: b



Related Topics

Need homework help now?

Ask unlimited questions for free

Ask a Question
 

Did you know?

During pregnancy, a woman is more likely to experience bleeding gums and nosebleeds caused by hormonal changes that increase blood flow to the mouth and nose.

Did you know?

To prove that stomach ulcers were caused by bacteria and not by stress, a researcher consumed an entire laboratory beaker full of bacterial culture. After this, he did indeed develop stomach ulcers, and won the Nobel Prize for his discovery.

Did you know?

Only 12 hours after an egg cell is fertilized by a sperm cell, the egg cell starts to divide. As it continues to divide, it moves along the fallopian tube toward the uterus at about 1 inch per day.

Did you know?

The average older adult in the United States takes five prescription drugs per day. Half of these drugs contain a sedative. Alcohol should therefore be avoided by most senior citizens because of the dangerous interactions between alcohol and sedatives.

Did you know?

Serum cholesterol testing in adults is recommended every 1 to 5 years. People with diabetes and a family history of high cholesterol should be tested even more frequently.

For a complete list of videos, visit our video library