The 8th International Conference

GRAMMAR&CORPORA

-
Kraków, Poland (online)

Keynote speakers:


Dagmar Divjak

Jan Rybicki

Benedikt Szmrecsanyi


Unsplashed background img 1

Call for Papers

In the recent years, the availability of large annotated and searchable corpora, together with a new interest in the empirical foundation and validation of linguistic theory and description, has sparked a surge of novel and interesting work using corpus methods to study the grammar of natural languages. However, a look at the relevant research on the grammars of English, as well as other Germanic, Romance or Slavic languages which is currently available, reveals a variety of different theoretical approaches and empirical foci, which can be traced back to different philological and linguistic traditions. Still, this state of affairs should not be seen as an obstacle; it arguably provides an ideal basis for a fruitful exchange of ideas between different research paradigms.

In addition to deepening our knowledge and understanding of individual languages, corpus-oriented work on grammar has wider implications, concerning methodological as well as theoretical aspects. Relevant topics and research questions include e.g. annotation schemata for (larger) syntactic units and syntactic relations, the increased use of (advanced) statistical methods and models in linguistics, the relation and boundary between grammar and discourse, and more generally the interface between corpus linguistics and linguistic theory.

Unsplashed background img 2

Important Details

New deadline for abstract submission: 4 May 2020
Notification of acceptance: 26 June 2020
Registration deadline: 15 August 2020
Registration deadline for non-presenters: 23 November 2020
Conference: 25-27 November 2020





Registration:
If you wish to listen on-line to the presentations via Zoom and you are not (co-author), nor have you already registered via EasyChair, please register free of charge at https://ankiety.ijp.pan.pl/index.php/697186?lang=en by 23 November 2020. The day before the Conference you will receive an e-mail with the passwords to Zoom-meetings.
Unsplashed background img 3

Submissions

We welcome submissions that explore the use of corpus methods in the description and theoretical analysis of the grammar of natural languages. Focal areas of interest include, but are not limited to:

  • Corpus-based studies on the grammar of natural languages, with the focus on Slavic, Germanic and Romance, however contributions pertaining other languages are welcome:
    • The use of (large) corpora in the description of patterns of grammar from both a language-specific and a contrastive/cross-linguistic perspective
    • The identification and formal modelling of (different types of) synchronic linguistic variation using corpus methods
    • New insights into the connection between linguistic variation and change made available by inspecting “language change in progress” in large corpora
    • The use of advanced corpus-linguistic and statistical methods in historical linguistics as a means to compensate for the relative scarcity of data
  • Theoretical and methodological issues pertaining to corpus-oriented research on grammar:
    • Tools, methods and techniques in corpus assembly, annotation and analysis
    • The interaction between corpus linguistics and computational linguistics
    • The interaction between corpus linguistics and linguistic theory
    • The use of statistical and quantitative methods in detecting patterns of grammar
    • The impact of corpus-based vs. corpus-driven approaches on our view/understanding of grammar

Unsplashed background img 3

Grammar and Corpora 2020 Conference Programme


UTC +1 Day 1: 25/11/2020
link to Zoom meeting
8.30-9.15 Organisational and technical details
9.15-9.30 Opening ceremony
9.30-10.30 Plenary talk:
Dagmar Divjak: What can be learned from usage? Reconciling descriptive accuracy and cognitive reality using corpus data, large and small.
10.30-10.45 Virtual coffee break (free thought exchange)
Track 1
Zoom
Track 2
Zoom
10.45-11.15 Oliver Schallert, Carsten Becker and Helmut Schmid: Areal variation in Middle High German: Methodological and quantitative aspects Witold Kieraś, Bartłomiej Nitoń and Marcin Woliński: New linguistic annotation in the National Corpus of Polish
11.15-11.45 Torsten Leuschner: Grammar and Corpora in Historical-Contrastive Linguistics: a Hypothesis-Driven Approach to V1-Conditionals in English and German Małgorzata Czachor:Different Approaches to Prototypical Transitivity and the Evidence drawn from Polish Participles
11.45-14.00 Lunch break
14.00-14.30 Izabela Czerniak: Relativisation strategies in English – Output from Penn Parsed Corpora of Historical English Vojtěch Veselý On modal meanings of the Czech verb muset in constructions with non-agentive verbs
14.30-15.00 Nilo Pedrazzini: Same question, different annotation depths: early Slavonic dative absolute in deeply versus shallowly annotated treebanks Stefan Hartmann and Tobias Ungerer: When pink is the new black and data scientists are the new rock stars: Corpus approaches to the productivity of ‘snowclones’
15.00-15.30 Virtual coffee break (free thought exchange)
15.30-16.00 Thilo Weber: NP-functions and Grammatical Case in German – A Quantitative Perspective Renata Bronikowska: Unfinished "verbization" - development of predicative constructions with a feminine adjective in the 17th and 18th centuries in the light of corpus data
UTC +1 Day 2: 26/11/2020
Track 1
Zoom
Track 2
Zoom
9.00-9.30 Björn Wiemer and Joanna Wrzesien-Kwiatkowska: Employing cluster analysis to explore the diachronic dynamics of Slavic aspect Daria Bębeniec: Prototype identification methods in cognitive corpus linguistics – an overview
9.30-10.00 Beata Trawiński: Negation Raising and Mood. Corpus Evidence from Polish Matías Guzmán Naranjo: A distance-based method for analogical classification
10.00-10.30 Łukasz Grabowski and Nicholas Groom: Grammar patterns as an exploratory tool for studying formulaicity in English-to-Polish translation: a corpus-based study Marek Łaziński and Michał Woźniak: Verbal aspect in dictionaries and corpora. Tagging aspect pairs
10.30-11.30 Virtual coffee break (free thought exchange)
11.30-12.30 Plenary talk:
Benedikt Szmrecsanyi: How difficult is grammatical variation, really?
Zoom
12.30-14.00 Lunch break
14.00-14.30 Beatrice Andreea Pahontu: A corpus study of the Romanian periphrasis a fi pe cale Dorota Mika and Rafał L. Górski: Comparative Constructions in Middle Polish
14.30-15.00 Maite Gil and Augusto Soares da Silva A corpus-based analysis of prepositions and goal-oriented motion verbs in Brazilian Portuguese Magdalena Derwojedowa Narrative modal MIEĆ in Polish
15.00-15.30 Virtual coffee break (free thought exchange)
15.30-16.00 Christian Ebert, Paul Widmer and Balthasar Bickel A usage-based approach to word order variation in Baltic, Slavic, Germanic and Romance Serhii Fokin: Analytical grammatical forms extraction as a new challenge (case of conditional mood in Polish and Ukrainian)
16.00-16.30 Dorota Sieroń Italian article and word order in Polish. A contrastive study Michał Woźniak: Corpus similarity measure in grammatical analysis
16.30-17.00 Gauthier Delaby and Timothy Colleman Word order variation in verb clusters with receptive krijgen ‘to get’ + past participle in Dutch Magdalena Szczyrbak:'No man can be supposed to be indifferent to the knowledge of facts.' The passive of reporting verbs in judicial writing: “be expected to” vs “be supposed to”
UTC +1 Day 3: 27/11/2020
G&C
Zoom
9.00-9.30 Kristin Kopf and Felix Bildhauer: What’s driving German genitive alternation? New findings on variation in placement and phrase type
9.30-10.00 Gasparde Coutanson: Why are there more pataquès in French folk songs? The example of postverbal pataquès
10.00-10.30 Wojciech Guz and Łukasz Jędrzejowski: On the need for spoken corpora in grammatical description: the case of Polish że as an elaboration marker
10.30-11.00 Virtual coffee break (free thought exchange)
11.00-12.00 Plenary talk:
Jan Rybicki: What Else Can Be Done with Lots of Texts: Distant Reading by Counting Words, Lemmas and Part-of-Speech TagsZoom
12.00-14.00 Lunch break
14.00-14.30 Piotr Wyroślak: Polish reflexive pronoun sobie: understanding the ‘ethical’ use
14.30-15.00 Tatiana Bladier, Laura Kallmeyer: Automatic Extraction of Tree-Wrapping Grammars for German.
15.00-15.30 Nikolas Koch, Antje Endesfelder Quick and Stefan Hartmann: Quantifying early syntactic productivity in child language corpora: A critical evaluation of the Traceback method
15.30-16.00 Helena Grochola-Szczepanek and Rafal L. Górski: The dialect and the standard – preservation and abandonment of grammatical features of the dialect of Spisz
16.00-16.30 Closing remarks
UTC +1 Day 3: 27/11/2020
SlaviCorp
Zoom
9.00-09.30 Introduction by Marek Łaziński
9.30-10.00 Martina Berrocal, Václav Cvrček and David Lukeš: MDA of Czech parliamentary discourse as way to understand populism
10.00-10.30 Thomas Samuelsson: Designing and Building a Corpus of Russian On-line Media
10.30-11.00 Virtual coffee break (free thought exchange)
11.00-11.30 Zuzana Laubeová and David Lukeš: ORATOR: A new corpus of spoken Czech
11.30-12.00 Maria Shvedova: General Regionally Annotated Corpus of Ukrainian (GRAC): grammatical diachrony, competing norms, and diaspora language
12.00-12.30 Vasyl Starko: Implementing Semantic Annotation for a Ukrainian Corpus
12.30-14.00 Lunch break
14.00-14.30 Dmitri Sitchinava: Parallel corpora within the Russian National Corpus: current state and the study of Russian functional words
14.30-15.00 Irene Elmerot: Why nationality matters – in- and outgroups in the news after 1989
15.00-15.30 Virtual coffee break (free thought exchange)
15.30-16.00 Michal Křen: Czech National Corpus in 2020: Corpora and Applications
16.00-16.30 Tora Hedin: Translating politeness in fiction. A study using Czech parallel corpora
16.30-17.00 Closing remarks
Unsplashed background img 3

Book of Abstracts


Keynote speaker
Dagmar Divjak What can be learned from usage? Reconciling descriptive accuracy and cognitive reality using corpus data, large and small.
Jan Rybicki What Else Can Be Done with Lots of Texts: Distant Reading by Counting Words, Lemmas and Part-of-Speech Tags
Benedikt Szmrecsanyi How difficult is grammatical variation, really?


General Session
Daria Bębeniec
Maria Curie-Skłodowska University
Prototype identification methods in cognitive corpus linguistics – an overview
Laura Becker
University of Erlangen-Nuremberg
Subordination in spoken language: A crosslinguistic corpus study
Laura Becker
University of Erlangen-Nuremberg
Grammatical properties that favor the development of indefinite articles
Tatiana Bladier and Laura Kallmeyer
Heinrich Heine University Düsseldorf
Automatic Extraction of Tree-Wrapping Grammars for German
Renata Bronikowska
Institute of Polish Language PAS
Unfinished "verbization" - development of predicative constructions with a feminine adjective in the 17th and 18th centuries in the light of corpus data
Gasparde Coutanson
Université Paris Nanterre
Why are there more pataquès in French folk songs? The example of postverbal pataquès
Małgorzata Czachor
Institute of Polish Language PAS
Different Approaches to Prototypical Transitivity and the Evidence drawn from Polish Participles
Izabela Czerniak
Åbo Akademi University
Relativisation strategies in English – Output from Penn Parsed Corpora of Historical English
Christian Ebert, Paul Widmer and Balthasar Bickel
University of Zurich
A usage-based approach to word order variation in Baltic, Slavic, Germanic and Romance
Gauthier Delaby and Timothy Colleman
Ghent University
Word order variation in verb clusters with receptive krijgen ‘to get’ + past participle in Dutch
Magdalena Derwojedowa
University of Warsaw
Narrative modal MIEĆ in Polish
Serhii Fokin
Taras Shevchenko National University of Kyiv
Analytical grammatical forms extraction as a new challenge (case of conditional mood in Polish and Ukrainian)
Maite Gil and Augusto Soares da Silva
Universidade Católica Portuguesa & Universidade do Minho, Universidade Católica Portuguesa
A corpus-based analysis of prepositions and goal-oriented motion verbs in Brazilian Portuguese
Lukasz Grabowski and Nicholas Groom
University of Opole, University of Birmingham
Grammar patterns as an exploratory tool for studying formulaicity in English-to-Polish translation: a corpus-based study
Helena Grochola-Szczepanek and Rafal L. Górski
Institute of Polish Language PAS
The dialect and the standard – preservation and abandonment of grammatical features of the dialect of Spisz
Wojciech Guz and Łukasz Jędrzejowski
Lublin Catholic University, University of Cologne
On the need for spoken corpora in grammatical description: the case of Polish że as an elaboration marker
Stefan Hartmann and Tobias Ungerer
University of Düsseldorf, The University of Edinburgh
When pink is the new black and data scientists are the new rock stars: Corpus approaches to the productivity of ‘snowclones’
Witold Kieraś, Bartłomiej Nitoń and Marcin Woliński
Institute of Computer Science PAS
New linguistic annotation in the National Corpus of Polish
Nikolas Koch, Antje Endesfelder Quick and Stefan Hartmann
Ludwig Maximilian University of Munich, University of Leipzig, University of Düsseldorf
Quantifying early syntactic productivity in child language corpora: A critical evaluation of the Traceback method
Kristin Kopf and Felix Bildhauer
Leibniz-Institut für Deutsche Sprache
What’s driving German genitive alternation? New findings on variation in placement and phrase type
Torsten Leuschner
Ghent University
Grammar and Corpora in Historical-Contrastive Linguistics: a Hypothesis-Driven Approach to V1-Conditionals in English and German
Marek Łaziński and Michał Woźniak
University of Warsaw, Institute of Polish Language PAS
Verbal aspect in dictionaries and corpora. Tagging aspect pairs
Jagoda Marszałek
Institute of Polish Language PAS
Textual Cohesion of Medieval Latin Treatises. A corpus-based study.
Dorota Mika and Rafał Ludwik Górski
Institute of Polish Language PAS
Comparative Constructions in Middle Polish
Matías Guzmán Naranjo and Laura Becker
Université de Paris, University of Erlangen-Nuremberg
Word order flexibility across types of argument realizations and argument structures
Matías Guzmán Naranjo
Université de Paris
A distance-based method for analogical classification
Krzysztof Nowak
Institute of Polish Language PAS
The Sense of Being. Copular and existential meaning of the verb esse ‘to be’ in a medieval Latin corpus
Beatrice Andreea Pahontu
Paris Diderot University and University of Bucharest
A corpus study of the Romanian periphrasis a fi pe cale
Nilo Pedrazzini
University of Oxford
Same question, different annotation depths: early Slavic dative absolutes in deeply versus shallowly annotated treebanks
Oliver Schallert, Carsten Becker and Helmut Schmid
University of Munich, University of Marburg, University of Munich
Areal variation in Middle High German: Methodological and quantitative aspects
Dorota Sieroń
Jagiellonian University
Italian article and word order in Polish. A contrastive study
Magdalena Szczyrbak
Jagiellonian University<
'No man can be supposed to be indifferent to the knowledge of facts.' The passive of reporting verbs in judicial writing: “be expected to” vs “be supposed to”
Beata Trawinski
Leibniz-Institut für Deutsche Sprache
Negation Raising and Mood. Corpus Evidence from Polish
Vojtěch Veselý
Czech Language Institute CAS
On modal meanings of the Czech verb muset in constructions with non-agentive verbs
Thilo Weber
Leibniz-Institut für Deutsche Sprache Mannheim
NP-functions and Grammatical Case in German – A Quantitative Perspective
Björn Wiemer and Joanna Wrzesien-Kwiatkowska
Johannes-Gutenberg University Mainz
Employing cluster analysis to explore the diachronic dynamics of Slavic aspect
Michał Woźniak
Institute of Polish Language PAS
Corpus similarity measure in grammatical analysis
Piotr Wyroślak
Adam Mickiewicz University in Poznań and Universitè Paris 8
Polish reflexive pronoun ‘sobie’ : understanding the ‘ethical’ uses


SlaviCorp Session
Martina Berrocal, Václav Cvrček and David Lukeš
Friedrich Schiller University Jena, Czech National Corpus, Czech National Corpus
MDA of Czech parliamentary discourse as way to understand populism
Irene Elmerot
Stockholm University
Why nationality matters – in- and outgroups in the news after 1989
Tora Hedin
Stockholm University
Translating politeness in fiction. A study using Czech parallel corpora.
Michal Křen
Czech National Corpus
Czech National Corpus in 2020: Corpora and Applications
Zuzana Laubeová and David Lukeš
Czech National Corpus
ORATOR: A new corpus of spoken Czech
Thomas Samuelsson
Stockholm University
Designing and Building a Corpus of Russian On-line Media
Maria Shvedova
Kyiv National Linguistic University
General Regionally Annotated Corpus of Ukrainian (GRAC): grammatical diachrony, competing norms, and diaspora language
Dmitri Sitchinava
Russian Language Institute / Higher School of Economics, Moscow
Parallel corpora within the Russian National Corpus: current state and the study of Russian functional words
Vasyl Starko
Ukrainian Catholic University
Implementing Semantic Annotation for a Ukrainian Corpus
Unsplashed background img 3
Organizing Committee:
  • Rafał L. Górski (chair)
  • Małgorzata W. Czachor
  • Łukasz Halida
  • Zdzisław Koczarski
  • Albert Leśniak
  • Dorota Mika
  • Krzysztof Nowak
Reading Committee:
  • Anne Abeillé, Piotr Bański, Tilman Berger, Neil Bermel, Felix Bildhauer, Gosse Bouma, Anna Cermakova, Benoit Crabbé, Václav Cvrček, Magdalena Derwojedowa, Achille Falaise, Sandra Hansen-Morath, Milena Hebal-Jezierska, Gerrit Kentner, Michal Křen, Marc Kupietz, Marek Łaziński, Christian Mair, Roland Meyer, John Nerbonne, Krzysztof Nowak, Adam Pawłowski, Vladimir Petkevic, Sophie Prévost, Alexandr Rosen, Paweł Rutkowski, Susan Schlotthauer, Thomas Schmidt, Barbara Sonnenhauser, Juliette Thuilier, Beata Trawinski, Anna Volodina, Ruprecht von Waldenfels, Björn Wiemer, Christoph Wolk, Michał Woźniak