Grammar & Corpora 2020

The 8^th International Conference

GRAMMAR&CORPORA

25-27 November 2020

Kraków, Poland (online)

Keynote speakers:

Dagmar Divjak

Jan Rybicki

Benedikt Szmrecsanyi

Call for Papers

In the recent years, the availability of large annotated and searchable corpora, together with a new interest in the empirical foundation and validation of linguistic theory and description, has sparked a surge of novel and interesting work using corpus methods to study the grammar of natural languages. However, a look at the relevant research on the grammars of English, as well as other Germanic, Romance or Slavic languages which is currently available, reveals a variety of different theoretical approaches and empirical foci, which can be traced back to different philological and linguistic traditions. Still, this state of affairs should not be seen as an obstacle; it arguably provides an ideal basis for a fruitful exchange of ideas between different research paradigms.

In addition to deepening our knowledge and understanding of individual languages, corpus-oriented work on grammar has wider implications, concerning methodological as well as theoretical aspects. Relevant topics and research questions include e.g. annotation schemata for (larger) syntactic units and syntactic relations, the increased use of (advanced) statistical methods and models in linguistics, the relation and boundary between grammar and discourse, and more generally the interface between corpus linguistics and linguistic theory.

Important Details

New deadline for abstract submission: 4 May 2020
Notification of acceptance: 26 June 2020
Registration deadline: 15 August 2020
Registration deadline for non-presenters: 23 November 2020
Conference: 25-27 November 2020

Registration:
If you wish to listen on-line to the presentations via Zoom and you are not (co-author), nor have you already registered via EasyChair, please register free of charge at https://ankiety.ijp.pan.pl/index.php/697186?lang=en by 23 November 2020. The day before the Conference you will receive an e-mail with the passwords to Zoom-meetings.

Submissions

We welcome submissions that explore the use of corpus methods in the description and theoretical analysis of the grammar of natural languages. Focal areas of interest include, but are not limited to:

Corpus-based studies on the grammar of natural languages, with the focus on Slavic, Germanic and Romance, however contributions pertaining other languages are welcome:
- The use of (large) corpora in the description of patterns of grammar from both a language-specific and a contrastive/cross-linguistic perspective
- The identification and formal modelling of (different types of) synchronic linguistic variation using corpus methods
- New insights into the connection between linguistic variation and change made available by inspecting “language change in progress” in large corpora
- The use of advanced corpus-linguistic and statistical methods in historical linguistics as a means to compensate for the relative scarcity of data
Theoretical and methodological issues pertaining to corpus-oriented research on grammar:
- Tools, methods and techniques in corpus assembly, annotation and analysis
- The interaction between corpus linguistics and computational linguistics
- The interaction between corpus linguistics and linguistic theory
- The use of statistical and quantitative methods in detecting patterns of grammar
- The impact of corpus-based vs. corpus-driven approaches on our view/understanding of grammar

Grammar and Corpora 2020 Conference Programme

UTC +1	Day 1: 25/11/2020 link to Zoom meeting
8.30-9.15	Organisational and technical details
9.15-9.30	Opening ceremony
9.30-10.30	Plenary talk: Dagmar Divjak: What can be learned from usage? Reconciling descriptive accuracy and cognitive reality using corpus data, large and small.
10.30-10.45	Virtual coffee break (free thought exchange)
	Track 1 Zoom	Track 2 Zoom
10.45-11.15	Oliver Schallert, Carsten Becker and Helmut Schmid: Areal variation in Middle High German: Methodological and quantitative aspects	Witold Kieraś, Bartłomiej Nitoń and Marcin Woliński: New linguistic annotation in the National Corpus of Polish
11.15-11.45	Torsten Leuschner: Grammar and Corpora in Historical-Contrastive Linguistics: a Hypothesis-Driven Approach to V1-Conditionals in English and German	Małgorzata Czachor:Different Approaches to Prototypical Transitivity and the Evidence drawn from Polish Participles
11.45-14.00	Lunch break
14.00-14.30	Izabela Czerniak: Relativisation strategies in English – Output from Penn Parsed Corpora of Historical English	Vojtěch Veselý On modal meanings of the Czech verb muset in constructions with non-agentive verbs
14.30-15.00	Nilo Pedrazzini: Same question, different annotation depths: early Slavonic dative absolute in deeply versus shallowly annotated treebanks	Stefan Hartmann and Tobias Ungerer: When pink is the new black and data scientists are the new rock stars: Corpus approaches to the productivity of ‘snowclones’
15.00-15.30	Virtual coffee break (free thought exchange)
15.30-16.00	Thilo Weber: NP-functions and Grammatical Case in German – A Quantitative Perspective	Renata Bronikowska: Unfinished "verbization" - development of predicative constructions with a feminine adjective in the 17th and 18th centuries in the light of corpus data
UTC +1	Day 2: 26/11/2020
	Track 1 Zoom	Track 2 Zoom
9.00-9.30	Björn Wiemer and Joanna Wrzesien-Kwiatkowska: Employing cluster analysis to explore the diachronic dynamics of Slavic aspect	Daria Bębeniec: Prototype identification methods in cognitive corpus linguistics – an overview
9.30-10.00	Beata Trawiński: Negation Raising and Mood. Corpus Evidence from Polish	Matías Guzmán Naranjo: A distance-based method for analogical classification
10.00-10.30	Łukasz Grabowski and Nicholas Groom: Grammar patterns as an exploratory tool for studying formulaicity in English-to-Polish translation: a corpus-based study	Marek Łaziński and Michał Woźniak: Verbal aspect in dictionaries and corpora. Tagging aspect pairs
10.30-11.30	Virtual coffee break (free thought exchange)
11.30-12.30	Plenary talk: Benedikt Szmrecsanyi: How difficult is grammatical variation, really? Zoom
12.30-14.00	Lunch break
14.00-14.30	Beatrice Andreea Pahontu: A corpus study of the Romanian periphrasis a fi pe cale	Dorota Mika and Rafał L. Górski: Comparative Constructions in Middle Polish
14.30-15.00	Maite Gil and Augusto Soares da Silva A corpus-based analysis of prepositions and goal-oriented motion verbs in Brazilian Portuguese	Magdalena Derwojedowa Narrative modal MIEĆ in Polish
15.00-15.30	Virtual coffee break (free thought exchange)
15.30-16.00	Christian Ebert, Paul Widmer and Balthasar Bickel A usage-based approach to word order variation in Baltic, Slavic, Germanic and Romance	Serhii Fokin: Analytical grammatical forms extraction as a new challenge (case of conditional mood in Polish and Ukrainian)
16.00-16.30	Dorota Sieroń Italian article and word order in Polish. A contrastive study	Michał Woźniak: Corpus similarity measure in grammatical analysis
16.30-17.00	Gauthier Delaby and Timothy Colleman Word order variation in verb clusters with receptive krijgen ‘to get’ + past participle in Dutch	Magdalena Szczyrbak:'No man can be supposed to be indifferent to the knowledge of facts.' The passive of reporting verbs in judicial writing: “be expected to” vs “be supposed to”
UTC +1	Day 3: 27/11/2020 G&C Zoom
9.00-9.30	Kristin Kopf and Felix Bildhauer: What’s driving German genitive alternation? New findings on variation in placement and phrase type
9.30-10.00	Gasparde Coutanson: Why are there more pataquès in French folk songs? The example of postverbal pataquès
10.00-10.30	Wojciech Guz and Łukasz Jędrzejowski: On the need for spoken corpora in grammatical description: the case of Polish że as an elaboration marker
10.30-11.00	Virtual coffee break (free thought exchange)
11.00-12.00	Plenary talk: Jan Rybicki: What Else Can Be Done with Lots of Texts: Distant Reading by Counting Words, Lemmas and Part-of-Speech Tags Zoom
12.00-14.00	Lunch break
14.00-14.30	Piotr Wyroślak: Polish reflexive pronoun sobie: understanding the ‘ethical’ use
14.30-15.00	Tatiana Bladier, Laura Kallmeyer: Automatic Extraction of Tree-Wrapping Grammars for German.
15.00-15.30	Nikolas Koch, Antje Endesfelder Quick and Stefan Hartmann: Quantifying early syntactic productivity in child language corpora: A critical evaluation of the Traceback method
15.30-16.00	Helena Grochola-Szczepanek and Rafal L. Górski: The dialect and the standard – preservation and abandonment of grammatical features of the dialect of Spisz
16.00-16.30	Closing remarks
UTC +1	Day 3: 27/11/2020 SlaviCorp Zoom
9.00-09.30		Introduction by Marek Łaziński
9.30-10.00		Martina Berrocal, Václav Cvrček and David Lukeš: MDA of Czech parliamentary discourse as way to understand populism
10.00-10.30		Thomas Samuelsson: Designing and Building a Corpus of Russian On-line Media
10.30-11.00	Virtual coffee break (free thought exchange)
11.00-11.30		Zuzana Laubeová and David Lukeš: ORATOR: A new corpus of spoken Czech
11.30-12.00		Maria Shvedova: General Regionally Annotated Corpus of Ukrainian (GRAC): grammatical diachrony, competing norms, and diaspora language
12.00-12.30		Vasyl Starko: Implementing Semantic Annotation for a Ukrainian Corpus
12.30-14.00	Lunch break
14.00-14.30		Dmitri Sitchinava: Parallel corpora within the Russian National Corpus: current state and the study of Russian functional words
14.30-15.00		Irene Elmerot: Why nationality matters – in- and outgroups in the news after 1989
15.00-15.30	Virtual coffee break (free thought exchange)
15.30-16.00		Michal Křen: Czech National Corpus in 2020: Corpora and Applications
16.00-16.30		Tora Hedin: Translating politeness in fiction. A study using Czech parallel corpora
16.30-17.00	Closing remarks

Book of Abstracts

Keynote speaker
Dagmar Divjak	What can be learned from usage? Reconciling descriptive accuracy and cognitive reality using corpus data, large and small.
Jan Rybicki	What Else Can Be Done with Lots of Texts: Distant Reading by Counting Words, Lemmas and Part-of-Speech Tags
Benedikt Szmrecsanyi	How difficult is grammatical variation, really?

General Session
Daria Bębeniec Maria Curie-Skłodowska University	Prototype identification methods in cognitive corpus linguistics – an overview
Laura Becker University of Erlangen-Nuremberg	Subordination in spoken language: A crosslinguistic corpus study
Laura Becker University of Erlangen-Nuremberg	Grammatical properties that favor the development of indefinite articles
Tatiana Bladier and Laura Kallmeyer Heinrich Heine University Düsseldorf	Automatic Extraction of Tree-Wrapping Grammars for German
Renata Bronikowska Institute of Polish Language PAS	Unfinished "verbization" - development of predicative constructions with a feminine adjective in the 17th and 18th centuries in the light of corpus data
Gasparde Coutanson Université Paris Nanterre	Why are there more pataquès in French folk songs? The example of postverbal pataquès
Małgorzata Czachor Institute of Polish Language PAS	Different Approaches to Prototypical Transitivity and the Evidence drawn from Polish Participles
Izabela Czerniak Åbo Akademi University	Relativisation strategies in English – Output from Penn Parsed Corpora of Historical English
Christian Ebert, Paul Widmer and Balthasar Bickel University of Zurich	A usage-based approach to word order variation in Baltic, Slavic, Germanic and Romance
Gauthier Delaby and Timothy Colleman Ghent University	Word order variation in verb clusters with receptive krijgen ‘to get’ + past participle in Dutch
Magdalena Derwojedowa University of Warsaw	Narrative modal MIEĆ in Polish
Serhii Fokin Taras Shevchenko National University of Kyiv	Analytical grammatical forms extraction as a new challenge (case of conditional mood in Polish and Ukrainian)
Maite Gil and Augusto Soares da Silva Universidade Católica Portuguesa & Universidade do Minho, Universidade Católica Portuguesa	A corpus-based analysis of prepositions and goal-oriented motion verbs in Brazilian Portuguese
Lukasz Grabowski and Nicholas Groom University of Opole, University of Birmingham	Grammar patterns as an exploratory tool for studying formulaicity in English-to-Polish translation: a corpus-based study
Helena Grochola-Szczepanek and Rafal L. Górski Institute of Polish Language PAS	The dialect and the standard – preservation and abandonment of grammatical features of the dialect of Spisz
Wojciech Guz and Łukasz Jędrzejowski Lublin Catholic University, University of Cologne	On the need for spoken corpora in grammatical description: the case of Polish że as an elaboration marker
Stefan Hartmann and Tobias Ungerer University of Düsseldorf, The University of Edinburgh	When pink is the new black and data scientists are the new rock stars: Corpus approaches to the productivity of ‘snowclones’
Witold Kieraś, Bartłomiej Nitoń and Marcin Woliński Institute of Computer Science PAS	New linguistic annotation in the National Corpus of Polish
Nikolas Koch, Antje Endesfelder Quick and Stefan Hartmann Ludwig Maximilian University of Munich, University of Leipzig, University of Düsseldorf	Quantifying early syntactic productivity in child language corpora: A critical evaluation of the Traceback method
Kristin Kopf and Felix Bildhauer Leibniz-Institut für Deutsche Sprache	What’s driving German genitive alternation? New findings on variation in placement and phrase type
Torsten Leuschner Ghent University	Grammar and Corpora in Historical-Contrastive Linguistics: a Hypothesis-Driven Approach to V1-Conditionals in English and German
Marek Łaziński and Michał Woźniak University of Warsaw, Institute of Polish Language PAS	Verbal aspect in dictionaries and corpora. Tagging aspect pairs
Jagoda Marszałek Institute of Polish Language PAS	Textual Cohesion of Medieval Latin Treatises. A corpus-based study.
Dorota Mika and Rafał Ludwik Górski Institute of Polish Language PAS	Comparative Constructions in Middle Polish
Matías Guzmán Naranjo and Laura Becker Université de Paris, University of Erlangen-Nuremberg	Word order flexibility across types of argument realizations and argument structures
Matías Guzmán Naranjo Université de Paris	A distance-based method for analogical classification
Krzysztof Nowak Institute of Polish Language PAS	The Sense of Being. Copular and existential meaning of the verb esse ‘to be’ in a medieval Latin corpus
Beatrice Andreea Pahontu Paris Diderot University and University of Bucharest	A corpus study of the Romanian periphrasis a fi pe cale
Nilo Pedrazzini University of Oxford	Same question, different annotation depths: early Slavic dative absolutes in deeply versus shallowly annotated treebanks
Oliver Schallert, Carsten Becker and Helmut Schmid University of Munich, University of Marburg, University of Munich	Areal variation in Middle High German: Methodological and quantitative aspects
Dorota Sieroń Jagiellonian University	Italian article and word order in Polish. A contrastive study
Magdalena Szczyrbak Jagiellonian University<	'No man can be supposed to be indifferent to the knowledge of facts.' The passive of reporting verbs in judicial writing: “be expected to” vs “be supposed to”
Beata Trawinski Leibniz-Institut für Deutsche Sprache	Negation Raising and Mood. Corpus Evidence from Polish
Vojtěch Veselý Czech Language Institute CAS	On modal meanings of the Czech verb muset in constructions with non-agentive verbs
Thilo Weber Leibniz-Institut für Deutsche Sprache Mannheim	NP-functions and Grammatical Case in German – A Quantitative Perspective
Björn Wiemer and Joanna Wrzesien-Kwiatkowska Johannes-Gutenberg University Mainz	Employing cluster analysis to explore the diachronic dynamics of Slavic aspect
Michał Woźniak Institute of Polish Language PAS	Corpus similarity measure in grammatical analysis
Piotr Wyroślak Adam Mickiewicz University in Poznań and Universitè Paris 8	Polish reflexive pronoun ‘sobie’ : understanding the ‘ethical’ uses

SlaviCorp Session
Martina Berrocal, Václav Cvrček and David Lukeš Friedrich Schiller University Jena, Czech National Corpus, Czech National Corpus	MDA of Czech parliamentary discourse as way to understand populism
Irene Elmerot Stockholm University	Why nationality matters – in- and outgroups in the news after 1989
Tora Hedin Stockholm University	Translating politeness in fiction. A study using Czech parallel corpora.
Michal Křen Czech National Corpus	Czech National Corpus in 2020: Corpora and Applications
Zuzana Laubeová and David Lukeš Czech National Corpus	ORATOR: A new corpus of spoken Czech
Thomas Samuelsson Stockholm University	Designing and Building a Corpus of Russian On-line Media
Maria Shvedova Kyiv National Linguistic University	General Regionally Annotated Corpus of Ukrainian (GRAC): grammatical diachrony, competing norms, and diaspora language
Dmitri Sitchinava Russian Language Institute / Higher School of Economics, Moscow	Parallel corpora within the Russian National Corpus: current state and the study of Russian functional words
Vasyl Starko Ukrainian Catholic University	Implementing Semantic Annotation for a Ukrainian Corpus

Organizing Committee:

Rafał L. Górski (chair)
Małgorzata W. Czachor
Łukasz Halida
Zdzisław Koczarski
Albert Leśniak
Dorota Mika
Krzysztof Nowak

Reading Committee:

Anne Abeillé, Piotr Bański, Tilman Berger, Neil Bermel, Felix Bildhauer, Gosse Bouma, Anna Cermakova, Benoit Crabbé, Václav Cvrček, Magdalena Derwojedowa, Achille Falaise, Sandra Hansen-Morath, Milena Hebal-Jezierska, Gerrit Kentner, Michal Křen, Marc Kupietz, Marek Łaziński, Christian Mair, Roland Meyer, John Nerbonne, Krzysztof Nowak, Adam Pawłowski, Vladimir Petkevic, Sophie Prévost, Alexandr Rosen, Paweł Rutkowski, Susan Schlotthauer, Thomas Schmidt, Barbara Sonnenhauser, Juliette Thuilier, Beata Trawinski, Anna Volodina, Ruprecht von Waldenfels, Björn Wiemer, Christoph Wolk, Michał Woźniak

The 8th International Conference