GECEG Links


CorpusSearch


CorpusSearch 2 Homepage
CorpusSearch 2 is a tool for linguistic research on syntactically parsed corpora. The GeCeG can be searched with this program.


Early German Corpora


   Research corpora with annotations

DDD Referenzkorpus Altdeutsch
The Old German Reference Corpus collects and annotates the entire extant High and Low German material written between c. 750 and 1050. The texts will be fully annotated for parts of speech, lemmata, sentence tokens, syntax and manuscript layout. A short description of the project can be found here (in German). The corpus is not expected to reach completion for several years.


Reference Corpus Middle High German (1050--1350)
The project Reference Corpus MHG aims at creating a reference corpus of Middle High German annotated with morpho-syntactic information. The corpus comprises all available High German texts composed between 1050 and 1200 and a balanced sample for the period 1200 to 1350. The corpus is scheduled for release in 2014. More information is available here.


Corpus of Historical Low German (CHLG)
The Corpus of Historical Low German is currently under development at the universities of Gent, Manchester and Cambridge. The project has received a £400,000 grant and is expected to be completed by 2017. A project outline can be found here.


Kali Korpus
Kali is a diachronic corpus of German, developed at the German Department of Leibniz Universität Hannover. It contains 25 texts (213,798 words). A list of the texts can be found here. Furthermore, all verb tokens are morphologically annotated and lemmatized. Click here for the annotation guidelines.


   Online text editions

Thesaurus Indogermanischer Text- und Sprachmaterialien (TITUS)
TITUS stores digitized editions of ancient Indo-European texts, including a large number of Old High German works. Some are available only to TITUS members.


Mediaevum - Althochdeutsche Texte im Internet
Mediaevum offers an overview over Old and Middle High German text editions and collections online (in German). Every entry comes with a short description and a link to the complete text.


Digitales Mittelhochdeutsches Textarchiv (MHGTA)
The MHGTA offers an extensive collection of Middle High German texts based on reliable standard editions.


Middle High German Source Materials
A reference list of all texts used for the compilation of the Middle High German Dictionary, many of which are available online.


Manuscripts


Paderborner Repertorium
The Paderborner Repertorium der deutschsprachigen Textüberlieferung des 8. bis 12. Jahrhunderts is a comprehensive database of all c. 300 manuscripts containing German written between the 8th to 12th centuries. The manuscript entries contain very rich information on content, dialect, date of origin, manuscript layout, editions and more. The site also has a very convenient search interface.


Dictionaries


Middle High German Dictionary
The Mittelhochdeutsches Wörterbuch (MWB) is the standard resource for Middle High German lexicography. It is based on a near-exhaustive corpus of Middle High German prose and verse texts.


Gerhardt Köbler's Old High German Dictionary
Köbler's Althochdeutsches Wörterbuch lists for many early German words useful grammatical information, for example declension classes, as well as reconstructions and cognates in other languages.