Case study
ParlaMint, a CLARIN ERIC flagship project on comparable and interoperable parliamentary corpora, developed uniformly annotated corpora of parliamentary debates of 29 European countries and autonomous regions. ParlaMint I ran between 2020–2021. Its successor, ParlaMint II (2022–2023), not only extended the timespan and geographical coverage of the project, but also provided machine translations into English and improved the usability of the datasets.
Problem addressed
ParlaMint aims to address the growing importance of access, for machines and humans alike, to parliamentary interactions by providing interoperable and comparable datasets of debates across 29 countries and regions of Europe. Enriched with linguistic annotations, named entities, and speaker metadata, these datasets facilitate transnational analyses and enhance the understanding of parliamentary discourse and its societal impact.
Added value
Offers interoperable and comparable corpora for parliamentary discourse.
Enables transnational, cross-lingual, and multidisciplinary analyses.
Provides insights into societal dynamics and enhances understanding locally and globally.
Corpus size in millions of words of ParlaMint I and ParlaMint II, covering the period between 1998 and 2022. Graph by: Matyáš Kopp.