Search results
Results from the WOW.Com Content Network
Corpus of Contemporary American English (COCA) 425 million words, 1990–2011. Freely searchable online; Corpus Resource Database (CoRD), more than 80 English language corpora. [2] Coruña Corpus, a corpus of late Modern English scientific writing covering the period 1700–1900, developed by the Muste research group at the University of A Coruña
His Corpus, Concordance, Collocation formulated the "idiom principle". [4] Though he had written many books, at his valedictory lecture in 2000 he stated that none of his many published articles passed successfully through peer-review, and that even an article he had been invited to write for a journal was peer-reviewed by mistake and rejected.
Key Word In Context (KWIC) is the most common format for concordance lines. The term KWIC was coined by Hans Peter Luhn . [ 1 ] The system was based on a concept called keyword in titles , which was first proposed for Manchester libraries in 1864 by Andrea Crestadoro .
Corpus linguists specify a key word in context and identify the words immediately surrounding them, to illustrate the way words are used in practice. The processing of collocations involves a number of parameters, the most important of which is the measure of association , which evaluates whether the co-occurrence is purely by chance or ...
[1] [2] [4] The corpus is constantly growing: In 2009 it contained more than 385 million words; [5] in 2010 the corpus grew in size to 400 million words; [6] by March 2019, [7] the corpus had grown to 560 million words. [7] As of November 2021, the Corpus of Contemporary American English is composed of 485,202 texts. [4] According to the corpus ...
In corpus linguistics a key word is a word which occurs in a text more often than we would expect to occur by chance alone. [1] Key words are calculated by carrying out a statistical test (e.g., loglinear or chi-squared) which compares the word frequencies in a text against their expected frequencies derived in a much larger corpus, which acts as a reference for general language use.
A concordance is an alphabetical list of the principal words used in a book or body of work, listing every instance of each word with its immediate context.Historically, concordances have been compiled only for works of special importance, such as the Vedas, [1] Bible, Qur'an or the works of Shakespeare, James Joyce or classical Latin and Greek authors, [2] because of the time, difficulty, and ...
In recent years, linguists have used corpus linguistics and concordancing software to find such hidden associations. Specialised software is used to arrange key words in context from a corpus of several million words of naturally occurring text. The collocates can then be arranged alphabetically according to first or second word to the right or ...