Author Question: Which is not a method to determine an optimal stationary policy? a.value iteration b.IP c.LP ... (Read 10 times)

moongchi

  • Hero Member
  • *****
  • Posts: 516
Which is not a method to determine an optimal stationary policy?
  a.value iteration
  b.IP
  c.LP
  d.policy iteration

Question 2

In an MDP the next period's state depends on
  a.all previous states and only the current decision chosen.
  b.only the current period's state and the current decision chosen.
  c.all previous states and all previous decisions.
  d.only the current period's state and all previous decisions chosen.



kjohnson

  • Sr. Member
  • ****
  • Posts: 330
Answer to Question 1

correct: b

Answer to Question 2

correct: b



Related Topics

Need homework help now?

Ask unlimited questions for free

Ask a Question
 

Did you know?

Green tea is able to stop the scent of garlic or onion from causing bad breath.

Did you know?

More than 30% of American adults, and about 12% of children utilize health care approaches that were developed outside of conventional medicine.

Did you know?

Disorders that may affect pharmacodynamics include genetic mutations, malnutrition, thyrotoxicosis, myasthenia gravis, Parkinson's disease, and certain forms of insulin-resistant diabetes mellitus.

Did you know?

Certain chemicals, after ingestion, can be converted by the body into cyanide. Most of these chemicals have been removed from the market, but some old nail polish remover, solvents, and plastics manufacturing solutions can contain these substances.

Did you know?

An identified risk factor for osteoporosis is the intake of excessive amounts of vitamin A. Dietary intake of approximately double the recommended daily amount of vitamin A, by women, has been shown to reduce bone mineral density and increase the chances for hip fractures compared with women who consumed the recommended daily amount (or less) of vitamin A.

For a complete list of videos, visit our video library