Author Question: Which is not a method to determine an optimal stationary policy? a.value iteration b.IP c.LP ... (Read 46 times)

moongchi

  • Hero Member
  • *****
  • Posts: 516
Which is not a method to determine an optimal stationary policy?
  a.value iteration
  b.IP
  c.LP
  d.policy iteration

Question 2

In an MDP the next period's state depends on
  a.all previous states and only the current decision chosen.
  b.only the current period's state and the current decision chosen.
  c.all previous states and all previous decisions.
  d.only the current period's state and all previous decisions chosen.



kjohnson

  • Sr. Member
  • ****
  • Posts: 330
Answer to Question 1

correct: b

Answer to Question 2

correct: b



Related Topics

Need homework help now?

Ask unlimited questions for free

Ask a Question
 

Did you know?

Amphetamine poisoning can cause intravascular coagulation, circulatory collapse, rhabdomyolysis, ischemic colitis, acute psychosis, hyperthermia, respiratory distress syndrome, and pericarditis.

Did you know?

The calories found in one piece of cherry cheesecake could light a 60-watt light bulb for 1.5 hours.

Did you know?

Individuals are never “cured” of addictions. Instead, they learn how to manage their disease to lead healthy, balanced lives.

Did you know?

Malaria was not eliminated in the United States until 1951. The term eliminated means that no new cases arise in a country for 3 years.

Did you know?

Astigmatism is the most common vision problem. It may accompany nearsightedness or farsightedness. It is usually caused by an irregularly shaped cornea, but sometimes it is the result of an irregularly shaped lens. Either type can be corrected by eyeglasses, contact lenses, or refractive surgery.

For a complete list of videos, visit our video library