- Oxford Handbooks in Linguistics
- The Oxford Handbook of Information Structure
- The Contributors
- Alternative Semantics
- (Contrastive) Topic
- Question-based Models of Information Structure
- Information Structure and the Landscape of (Non-)at-issue Meaning
- Information Structure and Presupposition
- Information Structure: A Cartographic Perspective
- Nuclear Stress and Information Structure
- Focus Projection Theories
- Constraint Conflict and Information Structure
- Focus Sensitive Operators
- Quantification and Information Structure
- Contrast: Dissecting an Elusive Information-structural Notion and its Role in Grammar
- Verum Focus
- Predicate Focus
- Information Structure and Discourse Particles
- Ellipsis and Information Structure
- Word Order and Information Structure
- Dislocations and Information Structure
- On the Expression of Focus in the Metrical Grid and in the Prosodic Hierarchy
- Focus, Intonation, and Tonal Height
- Second Occurrence Focus
- Information Structure and Language Change
- Information Structure and Language Comprehension: Insights from Psycholinguistics
- Information Structure and Production Planning
- Information Structure in First Language Acquisition
- Towards a Neurobiology of Information Structure
- Corpus Linguistics and Information Structure Research
- Syntactic and Prosodic Reflexes of Information Structure in Germanic
- Syntactic and Prosodic Effects of Information Structure in Romance
- Discourse Functions: The Case of Hungarian
- Information Structure in Modern Greek
- Information Structure in Slavic
- Topic and Focus Marking in Chinese
- Information Structure in Japanese
- Information Structure in Asia: Yongning Na (Sino-Tibetan) and Vietnamese (Austroasiatic)
- Information Structure in Bantu
- Information Structure in Sign Languages
- Subject Index
- Language Index
- Oxford Handbooks in Linguistics
Abstract and Keywords
This chapter describes the contributions that Corpus Linguistics (the study of linguistic phenomena by means of systematically exploiting collections of naturally-occurring linguistic data) can make to IS research. It discusses issues of designing a corpus that can serve as a basis for qualitative or quantitative studies, and then turns to the central issue of data annotation: what corpora are available that have been annotated with IS-related annotations, and how can such annotations be evaluated? In case a corpus does not have direct IS annotation, can other types of annotations, especially in the form of multi-layer annotation, be used as indirect evidence for the presence of IS phenomena? Next, the present state of the art in automatic IS annotation (by means of techniques from computational linguistics) is sketched, and finally, several sample studies that exploit IS annotations are introduced briefly.
Anke Lüdeling is Professor of Corpus Linguistics and Morphology at Humboldt-Universität zu Berlin. She is interested in variation and the interplay between qualitative and quantitative knowledge, as well as in methodologies for analyzing and annotating corpora of ‘non-standard’ varieties of Germans such as learner language, or spoken language. She has been involved in building several corpora and tools for corpus search and analysis.
Julia Ritz is Development Manager at a software manufacturer in the media industry. Her areas of research include information status and the related concepts of coreference, specificity, reference and (non-)referentiality. She has worked on computational models for information structural categories, as well as on Information Extraction tasks, and the extraction of terminology and collocations.
Manfred Stede is Professor of Applied Computational Linguistics at Universität Potsdam. Besides computational applications of text mining, his research revolves around various aspects of discourse structure, ranging from more semantic phenomena, such as coreference, to pragmatic description, e.g. for the structure of argumentation. As a foundation for this work, he devises frameworks of multi-layer text annotation, and he is also interested in annotation methodology and practice.
Amir Zeldes is Assistant Professor of Computational Linguistics at Georgetown University. He works on corpus annotation, search, and visualization, especially for multilayer corpora. His work focuses on the syntax¬-semantics interface, where meaning and world knowledge are mapped onto lexical choice. His research explores the idea that constructions have idiosyncratic degrees of innovation that speakers must learn. He has worked on many topics, including German SLA and NLP for under-resourced languages.
Access to the complete content on Oxford Handbooks Online requires a subscription or purchase. Public users are able to search the site and view the abstracts and keywords for each book and chapter without a subscription.
If you have purchased a print title that contains an access token, please see the token for information about how to register your code.