Search results
Results from the WOW.Com Content Network
The British National Corpus (BNC) is a 100-million-word text corpus of samples of written and spoken English from a wide range of sources. [1] The corpus covers British English of the late 20th century from a wide variety of genres, with the intention that it be a representative sample of spoken and written British English of that time.
The CLAWS4 was used for the 100-million-word British National Corpus (BNC). A general-purpose grammatical tagger, it is a successor of the CLAWS1 tagger. [ 11 ] In tagging the BNC, the many rounds of work that went into CLAWS4 focused on making the CLAWS program independent from the tagsets.
To download a subset of the database in XML format, such as a specific category or a list of articles see: Special:Export, usage of which is described at Help:Export. Wiki front-end software: MediaWiki. Database backend software: MySQL. Image dumps: See below.
Sketch Engine is a corpus manager and text analysis software developed by Lexical Computing since 2003. Its purpose is to enable people studying language behaviour (lexicographers, researchers in corpus linguistics, translators or language learners) to search large text collections according to complex and linguistically motivated queries.
Versions 1.0 through 4.0 were sold exclusively by Oxford University Press, the current version 8.0 and previous versions are now also distributed by Lexical Analysis Software Limited. The software runs under Windows. WordSmith is a download-only product which is registered by entering a code costing 50 pounds sterling for a single user license.
A corpus manager (corpus browser or corpus query system) is a tool for multilingual corpus analysis, which allows effective searching in corpora. [ 1 ] A corpus manager usually represents a complex tool that allows one to perform searches for language forms or sequences.
The Bank of English (BoE) is a representative subset of the 4.5 billion words COBUILD corpus, a collection of English texts.These are mainly British in origin, but content from North America, Australia, New Zealand, South Africa and other Commonwealth countries is also being included.
CLAN provides all the basic tools of corpus analysis such as key-word and line, concordance, frequency counting, partial regular expression search, and so on. CLAN provides additional analysis programs for between-speak contingency patterns, utterance and word length, cooccurrence clusters, and so on.