Search results
Results from the WOW.Com Content Network
If so, the collocation is considered strong, and is worth paying closer attention to. In this example, "no stranger to" is a very frequent collocation; so are words such as "mysterious", "handsome", and "dark". This comes as no surprise. More interesting, however, is "no stranger to controversy".
Collocation extraction is the task of using a computer to extract collocations automatically from a corpus.. The traditional method of performing collocation extraction is to find a formula based on the statistical quantities of those words to calculate a score associated to every word pairs.
Corpus linguists specify a key word in context and identify the words immediately surrounding them, to illustrate the way words are used in practice. The processing of collocations involves a number of parameters, the most important of which is the measure of association , which evaluates whether the co-occurrence is purely by chance or ...
An example given by John Sinclair is the verb set in, which has a negative prosody: e.g. rot (with negative associations) is a prime example of what is going to 'set in'. [1] Another well-known example is the verb sense of cause , which is also used mostly in a negative context (accident, catastrophe, etc.), [ 2 ] though one can also say that ...
A word sketch triple is a triple consisting of headword, grammatical relation, collocation (e.g. man, modifier, young).Considering an underlying text corpus, a word sketch quintuple is a quintuple consisting of headword, grammatical relation, collocation, position of headword in the corpus, position of collocation in the corpus (e.g. man, modifier, young, 104, 103).
Corpus linguistics and its statistic analyses reveal patterns of co-occurrences within a language and enable to work out typical collocations for its lexical items. A co-occurrence restriction is identified when linguistic elements never occur together.
His Corpus, Concordance, Collocation formulated the "idiom principle". [4] Though he had written many books, at his valedictory lecture in 2000 he stated that none of his many published articles passed successfully through peer-review, and that even an article he had been invited to write for a journal was peer-reviewed by mistake and rejected.
The Cambridge International Corpus (CIC) is a collection of over 2 billion words [1] of real spoken and written English. The texts are stored in a database that can be searched to see how English is used. The CIC also contains the Cambridge Learner Corpus, a unique collection of over 60,000 exam papers from Cambridge ESOL.