Search results
Results from the WOW.Com Content Network
Ancient text corpora are the entire collection of texts from the period of ancient history, defined in this article as the period from the beginning of writing up to 300 AD. These corpora are important for the study of literature , history , linguistics , and other fields, and are a fundamental component of the world's cultural heritage .
Provides searchable transliterations and translations of the compositions published in the series State Archives of Assyria, which include many corpora of Neo-Assyrian and Neo-Babylonian texts. various scholars (transliterations and translations from the Neo-Assyrian Text Corpus Project, directed by Simo Parpola) Xcat: The X Catalogue
Text corpora (singular: text corpus) are large and structured sets of texts, which have been systematically collected.Text corpora are used by both AI developers to train large language models and corpus linguists and within other branches of linguistics for statistical analysis, hypothesis testing, finding patterns of language use, investigating language change and variation, and teaching ...
Corpus Corporum (Lat. "the collection of collections") or in full, Corpus Córporum: repositorium operum latinorum apud universitatem Turicensem, is a digital Medieval Latin library developed by the University of Zurich, Institute for Greek and Latin Philology.
The main collection focuses on the classical materials of ancient Greece and ancient Rome, and features an extensive number of texts written in Ancient Greek and Latin chosen for their status as a canonical literary text, in a degree of completeness and representativeness no other digital library can claim. [1]
The TenTen Corpus Family (also called TenTen corpora) is a set of comparable web text corpora, i.e. collections of texts that have been crawled from the World Wide Web and processed to match the same standards. These corpora are made available through the Sketch Engine corpus manager. There are TenTen corpora for more than 35 languages.
The Czech National Corpus (CNC) (Czech : Český národní korpus) is a large electronic corpus of written and spoken Czech language, developed by the Institute of the Czech National Corpus (ICNC) in the Faculty of Arts at Charles University in Prague.
Text corpora are also used in the study of historical documents, for example in attempts to decipher ancient scripts, or in Biblical scholarship. Some archaeological corpora can be of such short duration that they provide a snapshot in time. One of the shortest corpora in time may be the 15–30 year Amarna letters texts .