enow.com Web Search

Search results

  1. Results from the WOW.Com Content Network
  2. British National Corpus - Wikipedia

    en.wikipedia.org/wiki/British_National_Corpus

    The British National Corpus (BNC) is a 100-million-word text corpus of samples of written and spoken English from a wide range of sources. [1] The corpus covers British English of the late 20th century from a wide variety of genres, with the intention that it be a representative sample of spoken and written British English of that time.

  3. CLAWS (linguistics) - Wikipedia

    en.wikipedia.org/wiki/CLAWS_(linguistics)

    In tagging the BNC, the many rounds of work that went into CLAWS4 focused on making the CLAWS program independent from the tagsets. For example, the BNC project used two tagset versions: "a main tagset (C5) with 62 tags with which the whole of the corpus has been tagged, and a larger (C7) tagset with 152 tags, which has been used to make a ...

  4. Wikipedia:Database download - Wikipedia

    en.wikipedia.org/wiki/Wikipedia:Database_download

    Before starting a download of a large file, check the storage device to ensure its file system can support files of such a large size, check the amount of free space to ensure that it can hold the downloaded file, and make sure the device(s) you'll use the storage with are able to read your chosen file system.

  5. Bank of English - Wikipedia

    en.wikipedia.org/wiki/Bank_of_English

    The Bank of English (BoE) is a representative subset of the 4.5 billion words COBUILD corpus, a collection of English texts.These are mainly British in origin, but content from North America, Australia, New Zealand, South Africa and other Commonwealth countries is also being included.

  6. Lancaster-Oslo-Bergen Corpus - Wikipedia

    en.wikipedia.org/wiki/Lancaster-Oslo-Bergen_Corpus

    The Lancaster-Oslo/Bergen (LOB) Corpus is a one-million-word collection of British English texts which was compiled in the 1970s in collaboration between the University of Lancaster, the University of Oslo, and the Norwegian Computing Centre for the Humanities, Bergen, to provide a British counterpart to the Brown Corpus compiled by Henry Kučera and W. Nelson Francis for American English in ...

  7. WordSmith (software) - Wikipedia

    en.wikipedia.org/wiki/WordSmith_(software)

    WordSmith Tools is - along with several other software products similar in nature - an internationally popular program for the work based on corpus-linguistic methodology. It is used by investigators in assorted fields as can be seen in the list below of works using the software.

  8. Corpus manager - Wikipedia

    en.wikipedia.org/wiki/Corpus_manager

    BYU-BNC [5] – a website that allows searches of the British National Corpora and others created at Brigham Young University; Coma [6] – a tool extension of the system EXMARaLDA for working with oral corpora on a computer; NoSketch Engine [7] – a free open-source corpus management system combining Manatee (back-end) and Bonito (web interface)

  9. Sketch Engine - Wikipedia

    en.wikipedia.org/wiki/Sketch_Engine

    Sketch Engine is a product of Lexical Computing, a company founded in 2003 by the lexicographer and research scientist Adam Kilgarriff. [4] He started a collaboration with Pavel Rychlý, a computer scientist working at the Natural Language Processing Centre, Masaryk University, [5] and the developer of Manatee and Bonito (two major parts of the software suite).