OTA Core Collection
Datasets and texts collected from a variety of people and projects in the period since 1976. This collection excludes 'Legacy' and 'Text Creation Partnership' items in the Oxford Text Archive, and the contents of this collection are thought to be of reasonable quality and usefulness
Recent Submissions
-
LexiconOxford Text Archive Core CollectionDate of publication:
1450-1700Description:Lists of repeated clusters of words, lemmata and part-of-speech tags derived from the 60238 works in the public domain from the Early English Books Online collection, as made available in the Oxford Text Archive collections ...This item contains 28 files (3.94 MB).Publicly Available -
-
LexiconOxford Text Archive Core CollectionDate of publication:
200 BCE-2000Description:This dataset is a collection of lexical annotation of the corpus occurrences 40 Latin lemmas. The corpus instances are from LatinISE and the process is described in Schlechtweg et al. (2020, 2021). The annotation was ...This item contains 1 file (1.94 MB). -
-
LexiconOxford Text Archive Core CollectionDate of publication:
800 BCE-500Description:Datasets containing semantic annotation of the Ancient Greek words mus, harmonia, and kosmos in the Diorisis Ancient Greek corpus. The files are in a tab-separated format. Authors: Viivi Lähteenoja (dataset for ...This item contains 3 files (1.29 MB). -
-
CorpusOxford Text Archive Core CollectionDate of publication:
1801-1807Description:Transcription of letters from Elizabeth Gwillim and Mary Symonds, 1801-1807 British Library APAC IOR Mss.Eur.C.240/1-4. These transcriptions were made by members of the Gwillim Project (2019-2022) from images of the ...This item contains 3 files (1.28 GB).Publicly Available -
-
CorpusOxford Text Archive Core CollectionDate of publication:
2020Description:The CorCenCC corpus contains over 11 million words (circa 14.4m tokens) from written, spoken and electronic (online, digital texts) Welsh language sources, taken from a range of genres, language varieties (regional and ...This item contains 1 file (49.41 KB).Publicly Available -