Next: Some other matrices
Up: Semiseparable matrices as covariance
Previous: Semiseparable matrices as covariance
  Contents
  Index
The multinomial distribution
Before defining the multinomial distribution, the binomial
distribution is discussed. These comments are based on the book
[180] and on the web pages:
http://www.stat.yale.edu/Courses/1997-98/
101/binom.htm. The use
of the distribution is explained with an example. Briefly the binomial
distribution describes the number of ``success'' outcomes when a
Bernoulli experiment is repeated
times, independently.
More precisely: the binomial distribution describes the behavior of a
count variable
, which counts the number of successes in
observations, if the following conditions are satisfied:
- The number of observations
is fixed.
- Each observation is independent.
- Each observation represents one of two outcomes (``success'' or
``failure'').
- The
probability of ``success''
is the same for each outcome.
The binomial distribution is denoted as
, with
denoting
the number of observations and
the chance of success. This
distribution is used in several examples, for example in:
Example 23
Suppose we have a group of individuals with a certain gene. These
people have a 0.60 probability of getting a certain disease.
If a study is performed on 300 individuals, with this specific gene,
then the distribution of the variable
describing the number of people who will get the disease has
the following binomial distribution:

.
Example 24
The
number of sixes rolled by a single die in

rolls has a binomial
distribution

.
The distribution function for the binomial distribution
satisfies
the following equation:
where
The mean and the variance of this distribution can be calculated,
using the formulas above and are: (denote the mean as
and the
variance as
)
Before giving the distribution function we will try to explain what is
meant with a multinomial distribution. A multinomial trials process
is a sequence of independent, identically distributed random variables
, where each random variable can take now
values. For the Bernoulli process, this corresponds to
,
(success and failure). Therefore this is a generalization of a
Bernoulli trials process. We denote the outcomes by the integers
, for simplicity. This means that for a trial variable
we can write the distribution function in the following way.
Of course
for each
and
.
As with the binomial distribution, we are interested
in the variables counting the number of times each outcome has
occurred, where in the binomial case one variable for counting was
enough, here we need
. Thus, let (with
we denote the cardinality of the set)
where
is the number of observations.
Note that
So if we know the values of
of the counting variables, we can
find the value of the remaining counting variable as already mentioned
before. Generalizing the binomial distribution we get the following
function:
(More information about these distributions can be
found at http://www.math.uah.edu/statold/bernoulli/)
with
and
. Before we
start calculating the covariance matrix, we will give an
example:
Example 25
It is very easy to see that the dice experiment also fits in here. For
example if one rolls 10 dice, we can calculate the
probability that 1 and 2 occur once, and the other occur all two
times. To calculate this, one needs the multinomial distribution.
For this distribution:
,
and
.
As an example we calculate the mean using the following binonium, and multinonium
formulas:
For this distribution we calculate the mean of the variable
, all the other variances and covariances can
be calculated in a complete analogous way.
where
has a binomial distribution
, which gives us the
last equality.
The covariance matrix of this distribution looks like
This matrix can be rewritten as a semiseparable plus diagonal matrix in the following
way. Denote:
then the matrix can be written, in a more compact form as
This is a semiseparable plus diagonal matrix.
The main reason of writing the covariance matrices in this form,
is the simple expression of the inverses of this type of
matrices. The book [95]
states several theorems about the inverses of tridiagonal and semiseparable
matrices, resulting in an inversion formula for this type of matrices, namely:
with
,
and
with
as the diagonal elements of
.
This leads to the following explicit structure of the matrix
,
This matrix is clearly again semiseparable plus diagonal, as we
expected according to the theorems of Chapter 1.
Next: Some other matrices
Up: Semiseparable matrices as covariance
Previous: Semiseparable matrices as covariance
  Contents
  Index
Raf Vandebril
2004-05-03