enow.com Web Search

Search results

  1. Results from the WOW.Com Content Network
  2. Markov decision process - Wikipedia

    en.wikipedia.org/wiki/Markov_decision_process

    The "Markov" in "Markov decision process" refers to the underlying structure of state transitions that still follow the Markov property. The process is called a "decision process" because it involves making decisions that influence these state transitions, extending the concept of a Markov chain into the realm of decision-making under uncertainty.

  3. Category:Markov processes - Wikipedia

    en.wikipedia.org/wiki/Category:Markov_processes

    Markov chain approximation method; Markov chain central limit theorem; Markov chain mixing time; Markov chain tree theorem; Markov Chains and Mixing Times; Markov chains on a measurable state space; Markov decision process; Markov information source; Markov kernel; Markov chain; Markov property; Markov renewal process; Markov reward model ...

  4. Markov model - Wikipedia

    en.wikipedia.org/wiki/Markov_model

    A Markov decision process is a Markov chain in which state transitions depend on the current state and an action vector that is applied to the system. Typically, a Markov decision process is used to compute a policy of actions that will maximize some utility with respect to expected rewards.

  5. Gauss–Markov process - Wikipedia

    en.wikipedia.org/wiki/Gauss–Markov_process

    Gauss–Markov stochastic processes (named after Carl Friedrich Gauss and Andrey Markov) are stochastic processes that satisfy the requirements for both Gaussian processes and Markov processes. [1] [2] A stationary Gauss–Markov process is unique [citation needed] up to rescaling; such a process is also known as an Ornstein–Uhlenbeck process.

  6. Markov chain - Wikipedia

    en.wikipedia.org/wiki/Markov_chain

    Note that there is no definitive agreement in the literature on the use of some of the terms that signify special cases of Markov processes. Usually the term "Markov chain" is reserved for a process with a discrete set of times, that is, a discrete-time Markov chain (DTMC), [11] but a few authors use the term "Markov process" to refer to a ...

  7. Partially observable Markov decision process - Wikipedia

    en.wikipedia.org/wiki/Partially_observable...

    A partially observable Markov decision process (POMDP) is a generalization of a Markov decision process (MDP). A POMDP models an agent decision process in which it is assumed that the system dynamics are determined by an MDP, but the agent cannot directly observe the underlying state.

  8. Detailed balance - Wikipedia

    en.wikipedia.org/wiki/Detailed_balance

    A Markov process is called a reversible Markov process or reversible Markov chain if there exists a positive stationary distribution π that satisfies the detailed balance equations [12] =, where P ij is the Markov transition probability from state i to state j, i.e. P ij = P(X t = j | X t − 1 = i), and π i and π j are the equilibrium probabilities of being in states i and j, respectively ...

  9. Markov reward model - Wikipedia

    en.wikipedia.org/wiki/Markov_reward_model

    In probability theory, a Markov reward model or Markov reward process is a stochastic process which extends either a Markov chain or continuous-time Markov chain by adding a reward rate to each state. An additional variable records the reward accumulated up to the current time. [1]