enow.com Web Search

Search results

  1. Results from the WOW.Com Content Network
  2. British National Corpus - Wikipedia

    en.wikipedia.org/wiki/British_National_Corpus

    The British National Corpus (BNC) is a 100-million-word text corpus of samples of written and spoken English from a wide range of sources. [1] The corpus covers British English of the late 20th century from a wide variety of genres, with the intention that it be a representative sample of spoken and written British English of that time.

  3. List of text corpora - Wikipedia

    en.wikipedia.org/wiki/List_of_text_corpora

    British National Corpus; Bergen Corpus of London Teenage Language (COLT) Brown Corpus, forming part of the "Brown Family" of corpora, together with LOB, Frown and F-LOB; Corpus of Contemporary American English (COCA) 425 million words, 1990–2011. Freely searchable online; Corpus Resource Database (CoRD), more than 80 English language corpora. [2]

  4. Most common words in English - Wikipedia

    en.wikipedia.org/wiki/Most_common_words_in_English

    Some lists of common words distinguish between word forms, while others rank all forms of a word as a single lexeme (the form of the word as it would appear in a dictionary). For example, the lexeme be (as in to be) comprises all its conjugations (is, was, am, are, were, etc.), and contractions of those conjugations. [5]

  5. CLAWS (linguistics) - Wikipedia

    en.wikipedia.org/wiki/CLAWS_(linguistics)

    For example, the BNC project used two tagset versions: "a main tagset (C5) with 62 tags with which the whole of the corpus has been tagged, and a larger (C7) tagset with 152 tags, which has been used to make a selected 'core' sample corpus of two million words."

  6. Bank of English - Wikipedia

    en.wikipedia.org/wiki/Bank_of_English

    The Bank of English totals 650 million running words. [1] Copies of the corpus are held both at HarperCollins Publishers and the University of Birmingham. The version at Birmingham can be accessed for academic research. The Bank of English forms part of the Collins Word Web together with the French, German and Spanish corpora.

  7. Text corpus - Wikipedia

    en.wikipedia.org/wiki/Text_corpus

    An example of annotating a corpus is part-of-speech tagging, or POS-tagging, in which information about each word's part of speech (verb, noun, adjective, etc.) is added to the corpus in the form of tags. Another example is indicating the lemma (base) form of each word. When the language of the corpus is not a working language of the ...

  8. Google Books Ngram Viewer - Wikipedia

    en.wikipedia.org/wiki/Google_Books_Ngram_Viewer

    The program can search for a word or a phrase, including misspellings or gibberish. [5] The n-grams are matched with the text within the selected corpus, and if found in 40 or more books, are then displayed as a graph. [6] The Google Books Ngram Viewer supports searches for parts of speech and wildcards. [6] It is routinely used in research. [7 ...

  9. Word sketch - Wikipedia

    en.wikipedia.org/wiki/Word_sketch

    Word sketch of verb "read" in the British National Corpus in Sketch Engine A word sketch is a one-page, automatic, corpus-derived summary of a word’s grammatical and collocational behaviour. Word sketches were first introduced by the British corpus linguist Adam Kilgarriff [ 1 ] and exploited within the Sketch Engine [ 2 ] corpus management ...