enow.com Web Search

Search results

  1. Results from the WOW.Com Content Network
  2. British National Corpus - Wikipedia

    en.wikipedia.org/wiki/British_National_Corpus

    The British National Corpus (BNC) is a 100-million-word text corpus of samples of written and spoken English from a wide range of sources. [1] The corpus covers British English of the late 20th century from a wide variety of genres, with the intention that it be a representative sample of spoken and written British English of that time.

  3. List of text corpora - Wikipedia

    en.wikipedia.org/wiki/List_of_text_corpora

    Text corpora (singular: text corpus) are large and structured sets of texts, which have been systematically collected.Text corpora are used by corpus linguists and within other branches of linguistics for statistical analysis, hypothesis testing, finding patterns of language use, investigating language change and variation, and teaching language proficiency.

  4. Category:English corpora - Wikipedia

    en.wikipedia.org/wiki/Category:English_corpora

    Main page; Contents; Current events; Random article; About Wikipedia; Contact us; Pages for logged out editors learn more

  5. List of children's speech corpora - Wikipedia

    en.wikipedia.org/wiki/List_of_children's_speech...

    For example, an age range of "K - G10" refers to speakers ranging from kindergarten age to grade 10. This table is based on a paper from the Interspeech conference, 2016. [ 4 ] This online article is intended to provide an interactive table for readers and a place where information about children speech corpora that can be updated continuously ...

  6. CLAWS (linguistics) - Wikipedia

    en.wikipedia.org/wiki/CLAWS_(linguistics)

    In tagging the BNC, the many rounds of work that went into CLAWS4 focused on making the CLAWS program independent from the tagsets. For example, the BNC project used two tagset versions: "a main tagset (C5) with 62 tags with which the whole of the corpus has been tagged, and a larger (C7) tagset with 152 tags, which has been used to make a ...

  7. Category:Corpora - Wikipedia

    en.wikipedia.org/wiki/Category:Corpora

    Main page; Contents; Current events; Random article; About Wikipedia; Contact us; Pages for logged out editors learn more

  8. Bank of English - Wikipedia

    en.wikipedia.org/wiki/Bank_of_English

    The Bank of English (BoE) is a representative subset of the 4.5 billion words COBUILD corpus, a collection of English texts.These are mainly British in origin, but content from North America, Australia, New Zealand, South Africa and other Commonwealth countries is also being included.

  9. Google Books Ngram Viewer - Wikipedia

    en.wikipedia.org/wiki/Google_Books_Ngram_Viewer

    Commas delimit user-entered search terms, where each comma-separated term is searched in the database as an n-gram (for example, "nursery school" is a 2-gram or bigram). [6] The Ngram Viewer then returns a plotted line chart. Note that due to limitations on the size of the Ngram database, only matches found in at least 40 books are indexed. [6]