Author Question: Which is not a method to determine an optimal stationary policy? a.value iteration b.IP c.LP ... (Read 50 times)

moongchi

  • Hero Member
  • *****
  • Posts: 516
Which is not a method to determine an optimal stationary policy?
  a.value iteration
  b.IP
  c.LP
  d.policy iteration

Question 2

In an MDP the next period's state depends on
  a.all previous states and only the current decision chosen.
  b.only the current period's state and the current decision chosen.
  c.all previous states and all previous decisions.
  d.only the current period's state and all previous decisions chosen.



kjohnson

  • Sr. Member
  • ****
  • Posts: 330
Answer to Question 1

correct: b

Answer to Question 2

correct: b



Related Topics

Need homework help now?

Ask unlimited questions for free

Ask a Question
 

Did you know?

When intravenous medications are involved in adverse drug events, their harmful effects may occur more rapidly, and be more severe than errors with oral medications. This is due to the direct administration into the bloodstream.

Did you know?

The average human gut is home to perhaps 500 to 1,000 different species of bacteria.

Did you know?

Parkinson's disease is both chronic and progressive. This means that it persists over a long period of time and that its symptoms grow worse over time.

Did you know?

Signs and symptoms of a drug overdose include losing consciousness, fever or sweating, breathing problems, abnormal pulse, and changes in skin color.

Did you know?

The calories found in one piece of cherry cheesecake could light a 60-watt light bulb for 1.5 hours.

For a complete list of videos, visit our video library