enow.com Web Search

Search results

  1. Results from the WOW.Com Content Network
  2. Corpus of Contemporary American English - Wikipedia

    en.wikipedia.org/wiki/Corpus_of_Contemporary...

    The Corpus of Contemporary American English (COCA) is composed of one billion words as of November 2021. [1] [2] [4] The corpus is constantly growing: In 2009 it contained more than 385 million words; [5] in 2010 the corpus grew in size to 400 million words; [6] by March 2019, [7] the corpus had grown to 560 million words.

  3. American National Corpus - Wikipedia

    en.wikipedia.org/wiki/American_National_Corpus

    American National Corpus. The American National Corpus (ANC) is a text corpus of American English containing 22 million words of written and spoken data produced since 1990. Currently, the ANC includes a range of genres, including emerging genres such as email, tweets, and web data that are not included in earlier corpora such as the British ...

  4. Most common words in English - Wikipedia

    en.wikipedia.org/wiki/Most_common_words_in_English

    The number of distinct senses that are listed in Wiktionary is shown in the polysemy column. For example, "out" can refer to an escape, a removal from play in baseball, or any of 36 other concepts. On average, each word in the list has 15.38 senses. The sense count does not include the use of terms in phrasal verbs such as "put out" (as in ...

  5. Brown Corpus - Wikipedia

    en.wikipedia.org/wiki/Brown_Corpus

    The Brown Corpus was a carefully compiled selection of current American English, totalling about a million words drawn from a wide variety of sources. Kučera and Francis subjected it to a variety of computational analyses, from which they compiled a rich and variegated opus, combining elements of linguistics, psychology, statistics, and sociology.

  6. List of text corpora - Wikipedia

    en.wikipedia.org/wiki/List_of_text_corpora

    Corpus of Contemporary American English (COCA) 425 million words, 1990–2011. Freely searchable online; Corpus Resource Database (CoRD), more than 80 English language corpora. [2] Coruña Corpus, a corpus of late Modern English scientific writing covering the period 1700–1900, developed by the Muste research group at the University of A Coruña

  7. Cambridge English Corpus - Wikipedia

    en.wikipedia.org/wiki/Cambridge_English_Corpus

    The Cambridge International Corpus (CIC) is a collection of over 800 million words of real spoken and written English . The texts are stored in a database that can be searched to see how English is used. The CIC also contains the Cambridge Learner Corpus, a unique collection of over 60,000 exam papers from Cambridge ESOL.

  8. TenTen Corpus Family - Wikipedia

    en.wikipedia.org/wiki/TenTen_Corpus_Family

    The TenTen Corpus Family (also called TenTen corpora) is a set of comparable web text corpora, i.e. collections of texts that have been crawled from the World Wide Web and processed to match the same standards. These corpora are made available through the Sketch Engine corpus manager. There are TenTen corpora for more than 35 languages.

  9. COCA: Corpus of Contemporary American English - Wikipedia

    en.wikipedia.org/?title=COCA:_Corpus_of...

    Retrieved from "https://en.wikipedia.org/w/index.php?title=COCA:_Corpus_of_Contemporary_American_English&oldid=227128251"