Markov chain probabilistic processes

An example of a circuit with two states

The chain of Markov is a sequence of random events with a finite or countable number of outcomes, characterized by the property that, speaking loosely, with a fixed present, the future is independent of the past. Named in honor of A. A. Markov (senior).

Markov chain with discrete time

Definition

Sequence of discrete random variables called a simple Markov chain (with discrete time) if

Markov chain .

Thus, in the simplest case, the conditional distribution of the subsequent state of the Markov chain depends only on the current state and does not depend on all previous states (unlike the Markov chains of higher orders).

Range of values of random variables Markov chain called the chain space , and the number - step number.

Transition Matrix and Homogeneous Circuits [edit]

Matrix Markov chain where

Markov chain

is called the matrix of transition probabilities on Markov chain m step and vector where

Markov chain

- the initial distribution of the Markov chain.

Obviously, the transition probability matrix is stochastic, that is,

Markov chain .

A Markov chain is said to be single -pitch if the transition probability matrix does not depend on the step number, that is,

Markov chain .

Otherwise, the Markov chain is called inhomogeneous. In the following, we will assume that we are dealing with homogeneous Markov chains.

Finite-dimensional distributions and transition matrix in n steps

From the properties of conditional probability and the definition of a homogeneous Markov chain we get:

Markov chain ,

whence the special case of the Kolmogorov – Chapman equation follows:

Markov chain ,

that is, the transition probability matrix for steps of a homogeneous Markov chain -th degree of the matrix of transition probabilities for 1 step. Finally,

Markov chain .

Classification of Markov Chain States

Returnable condition;
Markov return chain;
Achievable condition;
The indecomposable Markov chain;
Periodic state;
Periodic Markov chain;
Absorbing state;
Ergodic condition.

Examples

Branching process;
Random walk;
In the series, 4 numbers (Numb3rs), using the Markov chain as an example, try to uncover the escape of two prisoners. Season 1, Episode 13

Markov chain with continuous time

Definition

Family of discrete random variables called a Markov chain (with continuous time) if

Markov chain .

A chain of Markov with continuous time is called homogeneous if

Markov chain .

The matrix of transition functions and the Kolmogorov – Chapman equation

Similar to the discrete time case, the finite-dimensional distributions of a homogeneous Markov chain with continuous time are completely determined by the initial distribution

Markov chain

and the matrix of transition functions ( transition probabilities )

Markov chain .

The matrix of transition probabilities satisfies the Kolmogorov – Chapman equation: Markov chain or

Markov chain

Intensity matrix and Kolmogorov differential equations

By definition, the intensity matrix Markov chain or equivalently

Markov chain .

From the Kolmogorov-Chapman equation, two equations follow:

Kolmogorov direct equation
Kolmogorov inverse equation

For both equations, the initial condition is chosen Markov chain . Appropriate solution

Properties of matrices P and Q [edit]

For anyone Markov chain matrix has the following properties:

Matrix Elements non-negative: (nonnegative probability).
The sum of the elements in each row equals 1: (total probability), that is, the matrix is stochastic on the right (or in rows).
All proper numbers matrices do not exceed 1 in absolute value: . If a then .
Proper number matrices corresponds to at least one non-negative left eigenvector line (equilibrium): .
For own number matrices all root vectors are proper, that is, the corresponding Jordan cells are trivial.

Matrix Markov chain has the following properties:

Off-diagonal matrix elements non-negative: .
Diagonal Matrix Elements non-positive: .
The sum of the elements in each row equals 0:
Real part of all proper numbers matrices non-positive: . If a then
Proper number matrices corresponds to at least one non-negative left eigenvector line (equilibrium):
For own number matrices all root vectors are proper, that is, the corresponding Jordan cells are trivial.

Graph of transitions, connectivity and ergodic Markov chains

For a Markov chain with continuous time, an oriented transition graph (briefly, a transition graph) is constructed according to the following rules:

The set of vertices of the graph coincides with the set of chain states.
Vertices connected by oriented edge , if a (i.e. the flow rate from th state in is positive.

Topological properties of the transition graph associated with the spectral properties of the matrix . In particular, the following theorems are true for finite Markov chains:

The following three properties of A, B, and B of a finite Markov chain are equivalent (the chains possessing them are sometimes called weakly ergodic ):

A. For any two different vertices of the transition graph Markov chain there is such a vertex a graph (“common drain”) that there are oriented paths from the top to the top and from the top to the top . Note : possible case or ; in this case, the trivial (empty) path from to or from to also considered an oriented way.

B. Zero eigenvalue of the matrix Markov chain nondegenerate.

B. When Markov chain matrix tends to the matrix, in which all the rows coincide (and coincide, obviously, with the equilibrium distribution).

The following five properties A, B, C, D, D of a finite Markov chain are equivalent (the chains possessing them are called ergodic ):

A. The transition graph of a chain is oriented.

B. Zero eigenvalue of the matrix Markov chain is non-degenerate and corresponds to a strictly positive left eigenvector (equilibrium distribution).

B. For some Markov chain matrix strictly positive (i.e. for all ).

G. For all Markov chain matrix strictly positive.

D. When Markov chain matrix tends to a strictly positive matrix, in which all the rows coincide (and obviously coincide with the equilibrium distribution).

Examples

Fig. Examples of transition graphs for Markov chains: a) the chain is not weakly ergodic (there is no common flow for states

); b) weakly ergodic, but not ergodic chain (transition graph is not oriented connected); c) ergodic chain (transition graph orientedly connected).

Let us consider Markov chains with three states and with continuous time, corresponding to the transition graphs shown in Fig. In case (a), only the following nondiagonal elements of the intensity matrix are nonzero: in case (b) are non-zero only , and in case (c) - . The remaining elements are determined by the properties of the matrix. Markov chain (the sum of the elements in each row is 0). As a result, for graphs (a), (b), (c), the intensity matrices are:

Basic kinetic equation

Main article: Basic kinetic equation

The basic kinetic equation describes the evolution of the probability distribution in a Markov chain with continuous time. The “basic equation” here is not an epithet, but a translation of the term English. Master equation . For a probability distribution vector string the basic kinetic equation is:

Markov chain

and coincides, essentially, with the direct Kolmogorov equation. In the physical literature, probability column vectors are used more often and the basic kinetic equation is written in the form that explicitly uses the law of conservation of total probability:

Markov chain

Where Markov chain

If for the basic kinetic equation there is a positive equilibrium Markov chain then it can be written in the form

Markov chain

Lyapunov functions for the main kinetic equation

For the basic kinetic equation, there exists a rich family of convex Lyapunov functions — monotonously varying with time distribution probability functions. Let be Markov chain - convex function of one variable. For any positive probability distribution ( ) we define the function Morimoto :

Markov chain .

Derivative Markov chain on time if satisfies the basic kinetic equation, there is

Markov chain .

The last inequality holds because of the bulge Markov chain .

Examples of Morimoto Functions [edit]

this function is the distance from the current probability distribution to the equilibrium in Markov chain -norm The time shift is a contraction of the space of probability distributions in this norm. (For compression properties, see the Banach Fixed Point Theorem article.)

this function is (minus) Kullback entropy (see Kullback – Leibler Distance). In physics, it corresponds to the free energy divided by Markov chain (Where —Permanent Boltzmann, - absolute temperature):

if a Markov chain (Boltzmann distribution), then

Markov chain .

This function is an analogue of the free energy for Burg entropy, widely used in signal processing:

Markov chain

this is a quadratic approximation for the (minus) Kullback entropy near the equilibrium point. Up to a time-constant term, this function coincides with the (minus) Fisher entropy, which is given by the following choice,

this is (minus) Fisher entropy.

This is one of the analogues of free energy for the entropy of Tsallis. Tsallis entropy

Markov chain

serves as the basis for the statistical physics of nonextensive quantities. With Markov chain it tends to the classical Boltzmann – Gibbs – Shannon entropy, and the corresponding Morimoto function to the (minus) Kullback entropy.

Markov chain

Markov chain with discrete time

Definition

Transition Matrix and Homogeneous Circuits [edit]

Finite-dimensional distributions and transition matrix in n steps

Classification of Markov Chain States

Examples

Markov chain with continuous time

Definition

The matrix of transition functions and the Kolmogorov – Chapman equation

Intensity matrix and Kolmogorov differential equations

Properties of matrices P and Q [edit]

Graph of transitions, connectivity and ergodic Markov chains

Examples

Basic kinetic equation

Lyapunov functions for the main kinetic equation

Examples of Morimoto Functions [edit]

Comments

To leave a comment

probabilistic processes

Terms: probabilistic processes