The Shannon index has been a popular diversity index in the ecological literature, where it is also known as Shannon's diversity index, the Shannon–Wiener index, the Shannon–Weaver (Warren Weaver) index and the Shannon entropy.
The measure was originally proposed by Claude Shannon to quantify the entropy (Entropy (information theory)) (uncertainty or information content) in strings of text.
The idea is that the more different letters there are, and the more equal their proportional abundances in the string of interest, the more difficult it is to correctly predict which letter will be the next one in the string.
The Shannon entropy quantifies the uncertainty (entropy or degree of surprise) associated with this prediction.
It has been shown that the Shannon index is based on the weighted geometric mean of the proportional abundances of the types, and that it equals the logarithm of true diversity.
When all types in the dataset of interest are equally common, all values equal , and the Shannon index hence takes the value . The more unequal the abundances of the types, the larger the weighted geometric mean of the values, and the smaller the corresponding Shannon entropy.
If practically all abundance is concentrated to one type, and the other types are very rare (even if there are many of them), Shannon entropy approaches zero. When there is only one type in the dataset, Shannon entropy exactly equals zero (there is no uncertainty in predicting the type of the next randomly chosen entity).
# Rényi entropy
The Rényi entropy is a generalization of the Shannon entropy to other values of than unity.
# See also * Diversity index * Simpson index * Berger–Parker index