- Oxford Handbooks in Linguistics
- List of Contributors
- Corpus Design
- Data Collection
- Corpus Annotation: Methodology and Transcription Systems
- On Automatic Phonological Transcription of Speech Corpora
- Statistical Corpus Exploitation
- Corpus Archiving and Dissemination
- Metadata Formats
- Data Formats for Phonological Corpora
- Corpus and Research in Phonetics and Phonology: Methodological and Formal Considerations
- A Corpus-Based Study of Apicalization of /s/ before /l/ in Oslo Norwegian
- Corpora, Variation, and Phonology: An Illustration from French Liaison
- Corpus-Based Investigations of Child Phonological Development: Formal and Practical Considerations
- Corpus Phonology and Second Language Acquisition
- ELAN: Multimedia Annotation Application
- The Use of Praat in Corpus Research
- Praat Scripting
- The PhonBank Project: Data and Software-Assisted Methods for the Study of Phonology and Phonological Development
- ANVIL: The Video Annotation Research Tool
- Web-Based Archiving and Sharing of Phonological Corpora
- The IViE Corpus
- French Phonology from a Corpus Perspective: The PFC Programme
- Two Norwegian Speech Corpora: NoTa-Oslo and TAUS
- The LeaP Corpus
- The Diachronic Electronic Corpus of Tyneside English: Annotation Practices and Dissemination Strategies
- The Lanchart Corpus
- Phonological and Phonetic Databases at the Meertens Institute
- The VALIBEL Speech Database
- Prosody and Discourse in the Australian Map Task Corpus
- A Phonological Corpus of L1 Acquisition of Taiwan Southern Min
Abstract and Keywords
This chapter introduces the term ‘metadata’ and traces its origins. It presents the Dublin Core, OLAC, and IMDI metadata sets, and discusses how the TEI can be used for phonological corpora. Furthermore, the Component Metadata Infrastructure (CMDI) is introduced and practical matters relating to the design and creation of corpus metadata are discussed.
Daan Broeder has a background in electrical engineering, is deputy head of TLA unit at the MPI for Psycholinguistics and as senior developer responsible for all infrastructure and metadata development since many years. He has leading tasks in European and national projects such as all metadata related work in TLA and CLARIN and is the responsible convener for ISO standards on metadata and persistent identifiers.
Dieter Van Uytvanck studied computer science at Ghent University and linguistics at the Radboud University Nijmegen. After graduating he started working at the Max Planck Institute for Psycholinguistics in Nijmegen. Since 2008 he has been active in the technical setup of the CLARIN research infrastructure (www.clarin.eu) and as of 2012 he is director at the CLARIN-ERIC.
Access to the complete content on Oxford Handbooks Online requires a subscription or purchase. Public users are able to search the site and view the abstracts and keywords for each book and chapter without a subscription.
If you have purchased a print title that contains an access token, please see the token for information about how to register your code.