A survey of some existing corpora and a classification

Some important existing English corpora

Nowadays you will find modern corpora, which differ from those named above. In the first place, thanks to technological advancements, in particular faster and more powerful computers, the size of modern corpora is vastly greater. The British National Corpus, for example, consists of around 100 million words, i.e. it is a hundred times larger than the Brown corpus! Also, corpus designers today usually try to include as much spoken material as is financially and technically feasible. (Remember that creating transcripts of conversations is a time-consuming and expensive process!) Three examples of modern corpora are the British National Corpus, which I have just mentioned, the International Corpus of English and the Bank of English, situated at Birmingham University:

More links

Online corpora

This list of corpora is only a rather subjective selection. For a more exhaustive list of corpora and other online resources go to English language corpora and corpus resources .

A possible classification