Search results
Results from the WOW.Com Content Network
The Corpus of Contemporary American English (COCA) is composed of one billion words as of November 2021. [1] [2] [4] The corpus is constantly growing: In 2009 it contained more than 385 million words; [5] in 2010 the corpus grew in size to 400 million words; [6] by March 2019, [7] the corpus had grown to 560 million words.
Corpus of Political Speeches contains four collections of political speeches in English and Chinese from The Corpus of U.S. Presidential Speeches (1789–2015), The Corpus of Policy Address by Hong Kong Governors (1984–1996) and Hong Kong Chief Executives (1997–2014), The Corpus of Speeches given on New Year's days and Double Tenth days by ...
The table also includes frequencies from other corpora. As well as usage differences, lemmatisation may differ from corpus to corpus – for example splitting the prepositional use of "to" from the use as a particle. Also, the Corpus of Contemporary American English (COCA) list includes dispersion as well as frequency to calculate rank.
Retrieved from "https://en.wikipedia.org/w/index.php?title=COCA:_Corpus_of_Contemporary_American_English&oldid=227128251"
Main page; Contents; Current events; Random article; About Wikipedia; Contact us
Justice Lee looked at 500 randomized sample sentences from the Corpus of Contemporary American English (COCA) and found that the most common sense of "custody" was in the context of divorce rather than adoption. [4]: 724 Further, he found that "custody" is ten times more likely to co-occur (or collocate) with "divorce" than with "adoption".
Mark E. Davies (born 1963) is an American linguist. He specializes in corpus linguistics and language variation and change.He is the creator of most of the text corpora from English-Corpora.org (including the Corpus of Contemporary American English/ COCA) as well as the Corpus del español and the Corpus do português.
The Academic Vocabulary List, based on the Academic Word List, drawing from the Corpus of Contemporary American English (COCA), was developed by Gardner and Davies in 2013. Rather than relying on word families, like the AWL, the AVL is composed of 3000 English lemmas, and provides a broader coverage of Academic English. [5]