DraCor

DraCor
Type of site	Science
Available in	English
Owner	Freie Universität Berlin, University of Potsdam
URL	https://dracor.org/
Commercial	No
Launched	2017; 8 years ago
Current status	Online

DraCor (Drama Corpora) is an open digital infrastructure developed for the computational study of European drama from Greco-Roman antiquity to the 20th century. The platform hosts plays encoded in the TEI format across various languages, supporting comparative and computational methods in drama studies. As of 2025, the collection comprised over 4,000 texts in more than 20 languages. Data provided by DraCor has seen widespread use in digital humanities research.^[1] The project received the Rahtz Prize for TEI Ingenuity by the TEI Consortium in 2022.^[2]

Overview

DraCor aims to create reliable, expandable, and interoperable corpora of dramatic literature. The project emphasizes the concept of Programmable Corpora,^[3] where the data is not only accessible but also designed for computational analysis through APIs and integration with other tools. The platform strives to adhere to FAIR data principles (Findable, Accessible, Interoperable, Reusable).

Key features

Multilingual corpora: Contains drama corpora in more than 20 languages, primarily European.
TEI encoding: Texts are encoded according to the TEI guidelines to maintain structural and semantic consistency.
API access: Provides a documented Application Programming Interface for programmatic access to texts and metadata.
Network visualizations: Generates network graphs representing character co-occurrence within plays.
Data download: Offers options to download subsets of texts, such as speeches or stage directions, as well as network data.
Open access: Data is openly available for research and related purposes.
Programmable Corpora: Supports integration with external analytical tools and programming languages, with API wrappers available for Python (pydracor^[4]) and R (rdracor^[5]).

Corpora

DraCor’s collection of corpora is continuously growing and covers plays in French, German, English, Russian, Spanish, Dutch, Italian, Hungarian, Swedish, Polish, Ukrainian, Ancient Greek, Latin, and other languages. Each corpus is curated by individual scholars or teams^[6] and provides rich metadata alongside TEI-encoded texts, supporting analyses of dramatic structures, character interactions, and related topics.

Tools and usage

The DraCor platform includes basic visualization tools, particularly for network analysis. It also supports programmatic access to the corpora, enabling integration into computational research workflows. This facilitates various types of analyses, including:

Quantitative text analysis
Social network analysis of characters
Stylometric studies
Comparative drama research across languages

Community, development, impact

DraCor is developed through collaboration among researchers from multiple institutions. It is maintained by teams at the Freie Universität Berlin and the University of Potsdam.^[7] As an open-source project, it actively encourages community contributions and feedback.

External links

References

^ "DraCor Research". dracor.org. Retrieved 16 May 2025.
^ "Rahtz Prize for TEI Ingenuity". tei-c.org. Retrieved 16 May 2025.
^ Fischer, Frank; Börner, Ingo; et al. (2019). Programmable Corpora: Introducing DraCor, an Infrastructure for the Research on European Drama. DH2019: “Complexities”. Utrecht University. doi:10.5281/zenodo.4284002.
^ "pydracor". Python Package Index. Retrieved 21 May 2025.
^ "rdracor". Comprehensive R Archive Network. Retrieved 21 May 2025.
^ "DraCor Corpus Registry". dracor.org. Retrieved 21 May 2025.
^ "DraCor Credits". dracor.org. Retrieved 21 May 2025.

[Research-1] "DraCor Research". dracor.org. Retrieved 16 May 2025.

[Rahtz-2] "Rahtz Prize for TEI Ingenuity". tei-c.org. Retrieved 16 May 2025.

[Fischer-3] Fischer, Frank; Börner, Ingo; et al. (2019). Programmable Corpora: Introducing DraCor, an Infrastructure for the Research on European Drama. DH2019: “Complexities”. Utrecht University. doi:10.5281/zenodo.4284002.

[pydracor-4] "pydracor". Python Package Index. Retrieved 21 May 2025.

[rdracor-5] "rdracor". Comprehensive R Archive Network. Retrieved 21 May 2025.

[Corpora-6] "DraCor Corpus Registry". dracor.org. Retrieved 21 May 2025.

[Credits-7] "DraCor Credits". dracor.org. Retrieved 21 May 2025.

[1]

[2]

[3]

[4]

[5]

[6]

[7]