Author Question: Which is not a method to determine an optimal stationary policy? a.value iteration b.IP c.LP ... (Read 60 times)

moongchi

  • Hero Member
  • *****
  • Posts: 516
Which is not a method to determine an optimal stationary policy?
  a.value iteration
  b.IP
  c.LP
  d.policy iteration

Question 2

In an MDP the next period's state depends on
  a.all previous states and only the current decision chosen.
  b.only the current period's state and the current decision chosen.
  c.all previous states and all previous decisions.
  d.only the current period's state and all previous decisions chosen.



kjohnson

  • Sr. Member
  • ****
  • Posts: 330
Answer to Question 1

correct: b

Answer to Question 2

correct: b



Related Topics

Need homework help now?

Ask unlimited questions for free

Ask a Question
 

Did you know?

The longest a person has survived after a heart transplant is 24 years.

Did you know?

Pregnant women usually experience a heightened sense of smell beginning late in the first trimester. Some experts call this the body's way of protecting a pregnant woman from foods that are unsafe for the fetus.

Did you know?

Opium has influenced much of the world's most popular literature. The following authors were all opium users, of varying degrees: Lewis Carroll, Charles, Dickens, Arthur Conan Doyle, and Oscar Wilde.

Did you know?

As the western states of America were settled, pioneers often had to drink rancid water from ponds and other sources. This often resulted in chronic diarrhea, causing many cases of dehydration and death that could have been avoided if clean water had been available.

Did you know?

Human neurons are so small that they require a microscope in order to be seen. However, some neurons can be up to 3 feet long, such as those that extend from the spinal cord to the toes.

For a complete list of videos, visit our video library