Derivation of Shannon's Result next up previous
Next: Some Useful Theorems Up: Information Theory Previous: Information Theory

Derivation of Shannon's Result

Our derivation will follow directly from our earlier definition of information. Let $H(1/n, 1/n, \cdots, 1/n) = A(n)$ denote the information content of a source consisting of n equiprobable symbols.

Now we know $\forall s \ge t, \exists n: s^m \le t^n < s^{m+1}$

Then, $\forall s > t$,

\begin{displaymath}\frac{m}{n} \le \frac{\log t}{\log s} < \frac{m+1}{n}
\end{displaymath}


 \begin{displaymath}
\Rightarrow \left\vert \frac{\log t}{\log s} - \frac{m}{n}\right\vert < \frac{1}{n}
\end{displaymath} (7)

Now consider three sources with sm, tn and sm+1 equiprobable symbols respectively. Using property (3), A(sm) = mA(s), A(tn) = nA(t) and A(sm+1) = (m+1)A(s). (Why? Discuss this.)

Since A must increase monotonically with the number of equiprobable choices (property 2),

\begin{displaymath}A(s^m) \le A(t^n) < A(s^{m+1})
\end{displaymath}

or equivalently using property (3) and dividing by nA(s),


\begin{displaymath}\frac{m}{n} \le \frac{A(t)}{A(s)} < \frac{m+1}{n}
\end{displaymath}


 \begin{displaymath}
\Rightarrow \left\vert\frac{A(t)}{A(s)} - \frac{m}{n}\right\vert < \frac{1}{n}
\end{displaymath} (8)

(8) - (9) gives:

\begin{displaymath}\left\vert\frac{\log t}{\log s} - \frac{A(t)}{A(s)}\right\vert < \frac{2}{n}
\end{displaymath}

Since n is aribtrarily large,

 \begin{displaymath}
A(s) = K\log s
\end{displaymath} (9)

Let pi = ni/N where the ni are integers and $N =
{\sum_{i=1}^m n_i}$.

Now consider a choice over N symbols. We can break this down into a choice over m symbols each with probability pi as defined above and then within each choice i make a second choice over ni equally likely symbols. Let $H = H(s_1, \cdots, s_m)$. Using (10), we can now write:

\begin{displaymath}K\log N = H + K \sum_{i=1}^m p_i\log n_i
\end{displaymath}

Hence,

\begin{displaymath}H = K\left[ \log N - \sum_{i=1}^m p_i\log n_i \right]
\end{displaymath}

Since $\sum_{i=1}^m p_i = 1$,

\begin{eqnarray*}H &=& K\left[ \sum_{i=1}^m p_i \log N - \sum_{i=1}^m p_i\log n_...
...i \log \frac{n_i}{N} \right]\\
&=& -K\sum_{i=1}^m p_i \log p_i
\end{eqnarray*}


If the pi are real numbers and not rational as per our above assumption, they can be approximated by rationals as closely as we please. And since H is a continuous function by property (1), the expression is valid for $p_i \in [0,1]$.

By analogy with thermodynamics, we call H the Entropy of the source. Note that we choose to interpret the entropy of the source as the amount of information contained in it.


next up previous
Next: Some Useful Theorems Up: Information Theory Previous: Information Theory
Anand Venkataraman
1999-09-16