The 8th International Conference


Kraków, Poland (online)

Keynote speakers:

Dagmar Divjak

Jan Rybicki

Benedikt Szmrecsanyi

Unsplashed background img 1

Call for Papers

In the recent years, the availability of large annotated and searchable corpora, together with a new interest in the empirical foundation and validation of linguistic theory and description, has sparked a surge of novel and interesting work using corpus methods to study the grammar of natural languages. However, a look at the relevant research on the grammars of English, as well as other Germanic, Romance or Slavic languages which is currently available, reveals a variety of different theoretical approaches and empirical foci, which can be traced back to different philological and linguistic traditions. Still, this state of affairs should not be seen as an obstacle; it arguably provides an ideal basis for a fruitful exchange of ideas between different research paradigms.

In addition to deepening our knowledge and understanding of individual languages, corpus-oriented work on grammar has wider implications, concerning methodological as well as theoretical aspects. Relevant topics and research questions include e.g. annotation schemata for (larger) syntactic units and syntactic relations, the increased use of (advanced) statistical methods and models in linguistics, the relation and boundary between grammar and discourse, and more generally the interface between corpus linguistics and linguistic theory.

Unsplashed background img 2

Important Details

New deadline for abstract submission: 4 May 2020
Notification of acceptance: 26 June 2020
Registration deadline: 15 August 2020
Conference: 25-27 November 2020

Unsplashed background img 3


We welcome submissions that explore the use of corpus methods in the description and theoretical analysis of the grammar of natural languages. Focal areas of interest include, but are not limited to:

  • Corpus-based studies on the grammar of natural languages, with the focus on Slavic, Germanic and Romance, however contributions pertaining other languages are welcome:
    • The use of (large) corpora in the description of patterns of grammar from both a language-specific and a contrastive/cross-linguistic perspective
    • The identification and formal modelling of (different types of) synchronic linguistic variation using corpus methods
    • New insights into the connection between linguistic variation and change made available by inspecting “language change in progress” in large corpora
    • The use of advanced corpus-linguistic and statistical methods in historical linguistics as a means to compensate for the relative scarcity of data
  • Theoretical and methodological issues pertaining to corpus-oriented research on grammar:
    • Tools, methods and techniques in corpus assembly, annotation and analysis
    • The interaction between corpus linguistics and computational linguistics
    • The interaction between corpus linguistics and linguistic theory
    • The use of statistical and quantitative methods in detecting patterns of grammar
    • The impact of corpus-based vs. corpus-driven approaches on our view/understanding of grammar

Unsplashed background img 3

Book of Abstracts

Keynote speaker
Dagmar Divjak What can be learned from usage? Reconciling descriptive accuracy and cognitive reality using corpus data, large and small.
Jan Rybicki What Else Can Be Done with Lots of Texts: Distant Reading by Counting Words, Lemmas and Part-of-Speech Tags
Benedikt Szmrecsanyi How difficult is grammatical variation, really?

General Session
Tatiana Bladier and Laura Kallmeyer Automatic Extraction of Tree-Wrapping Grammars for German
Renata Bronikowska Unfinished "verbization" - development of predicative constructions with a feminine adjective in the 17th and 18th centuries in the light of corpus data
Gasparde Coutanson Why are there more pataquès in French folk songs? The example of postverbal pataquès
Malgorzata Czachor Different Approaches to Prototypical Transitivity and the Evidence drawn from Polish Participles
Izabela Czerniak Relativisation strategies in English – Output from Penn Parsed Corpora of Historical English
Christian Ebert, Paul Widmer and Balthasar Bickel A usage-based approach to word order variation in Baltic, Slavic, Germanic and Romance
Gauthier Delaby and Timothy Colleman Word order variation in verb clusters with receptive krijgen ‘to get’ + past participle in Dutch
Magdalena Derwojedowa Narrative modal MIEĆ in Polish
Irene Elmerot Why nationality matters – in- and outgroups in the news after 1989
Serhii Fokin Analytical grammatical forms extraction as a new challenge (case of conditional mood in Polish and Ukrainian)
Maite Gil and Augusto Soares da Silva A corpus-based analysis of prepositions and goal-oriented motion verbs in Brazilian Portuguese
Lukasz Grabowski and Nicholas Groom Grammar patterns as an exploratory tool for studying formulaicity in English-to-Polish translation: a corpus-based study
Helena Grochola-Szczepanek and Rafal L. Górski The dialect and the standard – preservation and abandonment of grammatical features of the dialect of Spisz
Wojciech Guz and Łukasz Jędrzejowski On the need for spoken corpora in grammatical description: the case of Polish że as an elaboration marker
Stefan Hartmann and Tobias Ungerer When pink is the new black and data scientists are the new rock stars: Corpus approaches to the productivity of ‘snowclones’
Witold Kieraś, Bartłomiej Nitoń and Marcin Woliński New linguistic annotation in the National Corpus of Polish
Nikolas Koch, Antje Endesfelder Quick and Stefan Hartmann Quantifying early syntactic productivity in child language corpora: A critical evaluation of the Traceback method
Kristin Kopf and Felix Bildhauer What’s driving German genitive alternation? New findings on variation in placement and phrase type
Torsten Leuschner Grammar and Corpora in Historical-Contrastive Linguistics: a Hypothesis-Driven Approach to V1-Conditionals in English and German
Marek Łaziński and Michał Woźniak Verbal aspect in dictionaries and corpora. Tagging aspect pairs
Jagoda Marszałek Textual Cohesion of Medieval Latin Treatises. A corpus-based study.
Dorota Mika and Rafał Ludwik Górski Comparative Constructions in Middle Polish
Matías Guzmán Naranjo and Laura Becker Word order flexibility across types of argument realizations and argument structures
Matías Guzmán Naranjo A distance-based method for analogical classification
Krzysztof Nowak The Sense of Being. Copular and existential meaning of the verb esse ‘to be’ in a medieval Latin corpus
Beatrice Andreea Pahontu A corpus study of the Romanian periphrasis a fi pe cale
Nilo Pedrazzini Same question, different annotation depths: early Slavic dative absolutes in deeply versus shallowly annotated treebanks
Oliver Schallert, Carsten Becker and Helmut Schmid Areal variation in Middle High German: Methodological and quantitative aspects
Dorota Sieroń Italian article and word order in Polish. A contrastive study
Magdalena Szczyrbak 'No man can be supposed to be indifferent to the knowledge of facts.' The passive of reporting verbs in judicial writing: “be expected to” vs “be supposed to”
Beata Trawinski Negation Raising and Mood. Corpus Evidence from Polish
Vojtěch Veselý On modal meanings of the Czech verb muset in constructions with non-agentive verbs
Thilo Weber NP-functions and Grammatical Case in German – A Quantitative Perspective
Björn Wiemer and Joanna Wrzesien-Kwiatkowska Employing cluster analysis to explore the diachronic dynamics of Slavic aspect
Michał Woźniak Corpus similarity measure in grammatical analysis
Piotr Wyroślak Polish reflexive pronoun ‘sobie’ : understanding the ‘ethical’ uses
Daria Bębeniec Prototype identification methods in cognitive corpus linguistics – an overview

SlaviCorp Session
Martina Berrocal, Václav Cvrček and David Lukeš MDA of Czech parliamentary discourse as way to understand populism
Tora Hedin Translating politeness in fiction. A study using Czech parallel corpora.
Michal Křen Czech National Corpus in 2020: Corpora and Applications
Zuzana Laubeová and David Lukeš ORATOR: A new corpus of spoken Czech
Thomas Samuelsson Designing and Building a Corpus of Russian On-line Media
Maria Shvedova General Regionally Annotated Corpus of Ukrainian (GRAC): grammatical diachrony, competing norms, and diaspora language
Dmitri Sitchinava Parallel corpora within the Russian National Corpus: current state and the study of Russian functional words
Vasyl Starko Implementing Semantic Annotation for a Ukrainian Corpus

Unsplashed background img 3
Organizing Committee:
  • Rafał L. Górski (chair)
  • Małgorzata W. Czachor
  • Łukasz Halida
  • Zdzisław Koczarski
  • Dorota Mika
Reading Committee:
  • Anne Abeillé, Piotr Bański, Tilman Berger, Neil Bermel, Felix Bildhauer, Gosse Bouma, Anna Cermakova, Benoit Crabbé, Václav Cvrček, Magdalena Derwojedowa, Achille Falaise, Sandra Hansen-Morath, Milena Hebal-Jezierska, Gerrit Kentner, Michal Křen, Marc Kupietz, Marek Łaziński, Christian Mair, Roland Meyer, John Nerbonne, Krzysztof Nowak, Adam Pawłowski, Vladimir Petkevic, Sophie Prévost, Alexandr Rosen, Paweł Rutkowski, Susan Schlotthauer, Thomas Schmidt, Barbara Sonnenhauser, Juliette Thuilier, Beata Trawinski, Anna Volodina, Ruprecht von Waldenfels, Björn Wiemer, Christoph Wolk, Michał Woźniak