Another important modification of this corpus is its size: whereas the ICE guidelines only require 200 texts (i.e. 400,000 words) in the written components of each subcorpus, we decided to compile parallel written corpora for Kenya and Tanzania, thus producing 800,000 words of written data for East African English. One of the reasons for doing this was that the linguistic situation in Kenya and Tanzania differs to such an extent that we considered it to be necessary to represent both varieties of written English fully. English is, certainly, a second language variety in Kenya (ESL) but classification of the position of the language in Tanzania is not clear-cut. In both countries, English is prestigious, being the language of secondary and tertiary education and the High Court, but Kiswahili is the language used in Parliament and government institutions in Tanzania. English serves as a lingua franca between people of various ethnic groups in Kenya, particularly in bigger towns, and the capital, Nairobi, but Kiswahili fulfils this function in Tanzania.
The accompanying manual provides a comprehensive description of all text categories with special reference to ICE-East Africa, in addition to lists of all the texts used, background information on the speakers and authors where this could be obtained. The manual is published with the corpus on the new ICAME CD-ROM.