Various theorems and their proofs regarding entropies and
relative entropies. If
,
then
1.
2.
3.
with equality iff some
4.
with equality iff
Prefix codes
Kraft inequality
Read the two papers given: Shannon's seminal BSTJ report (1949) and
the Village paper (1999) by myself and Ray Kemp.
What is information?
Pay particular attention to how Shannon characterizes information
Section 6 (p.10) of the report. This is crucial in proving his
result later on.
Information is quantified by the measure of how much
``choice'' is involved in the event selection. This measure,
,
where the pi are the probabilities of
occurrences of the events whose info-content H is trying to
quantify, must have the following desirable properties:
1.
H should be continuous in the pi
2.
If all the pi are equal, pi = 1/n, then H should be a
monotonic increasing function of n. With equally likely events there
is more choice, or uncertainty, when there are more possible events.
3.
If a choice be broken down into two successive choices, the
original H should be the weighted sum of the individual values of
H.
Ponder:
Is the above characterization of information sensible?
Can you think of any other way of characterizing information?