David MacKay
.






Search :

.

English and German letter frequencies
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z -
ENGLISH (e) 0.07 0.01 0.03 0.03 0.1 0.02 0.02 0.05 0.06 0.001 0.006 0.03 0.02 0.06 0.06 0.02 0.0009 0.05 0.05 0.08 0.02 0.008 0.02 0.002 0.01 0.0008 0.17
GERMAN (g) 0.06 0.02 0.03 0.04 0.15 0.01 0.03 0.04 0.07 0.002 0.01 0.03 0.02 0.08 0.02 0.007 0.0002 0.06 0.06 0.05 0.04 0.006 0.02 0.0003 0.0003 0.01 0.14

The entropies of these two distributions are H(e) = 4.1 bits; H(g) = 4.1 bits; and the relative entropies between them are D_KL( e || g) = 0.16 bits and D_KL( g || e) = 0.12 bits. The relative entropies between the uniform distribution u and the English distribution e are D_KL( e || u) ~= 0.6 bits and D_KL( u || e) ~= 1 bits.


Site last modified Thu Sep 30 20:34:34 BST 2004