- Oxford Handbooks in Linguistics
- List of Contributors
- Corpus Design
- Data Collection
- Corpus Annotation: Methodology and Transcription Systems
- On Automatic Phonological Transcription of Speech Corpora
- Statistical Corpus Exploitation
- Corpus Archiving and Dissemination
- Metadata Formats
- Data Formats for Phonological Corpora
- Corpus and Research in Phonetics and Phonology: Methodological and Formal Considerations
- A Corpus-Based Study of Apicalization of /s/ before /l/ in Oslo Norwegian
- Corpora, Variation, and Phonology: An Illustration from French Liaison
- Corpus-Based Investigations of Child Phonological Development: Formal and Practical Considerations
- Corpus Phonology and Second Language Acquisition
- ELAN: Multimedia Annotation Application
- The Use of Praat in Corpus Research
- Praat Scripting
- The PhonBank Project: Data and Software-Assisted Methods for the Study of Phonology and Phonological Development
- ANVIL: The Video Annotation Research Tool
- Web-Based Archiving and Sharing of Phonological Corpora
- The IViE Corpus
- French Phonology from a Corpus Perspective: The PFC Programme
- Two Norwegian Speech Corpora: NoTa-Oslo and TAUS
- The LeaP Corpus
- The Diachronic Electronic Corpus of Tyneside English: Annotation Practices and Dissemination Strategies
- The Lanchart Corpus
- Phonological and Phonetic Databases at the Meertens Institute
- The VALIBEL Speech Database
- Prosody and Discourse in the Australian Map Task Corpus
- A Phonological Corpus of L1 Acquisition of Taiwan Southern Min
Abstract and Keywords
This chapter describes modern methods of corpus archiving and dissemination. It discusses the traditional model of corpus archiving and dissemination, before analysing in detail what has changed in these processes due to the digital innovation. For corpus compilers, the curation and preservation requirements that need to be met are discussed, and the need for advanced infrastructures is described. The chapter also presents the MPI archive that meets some of the modern requirements.
Peter Wittenburg has a diploma degree in Electrical Engineering from the Technical University Berlin and became head of the technical group at the newly founded Max-Planck-Institute for Psycholinguistics in 1976. He has had leading roles in various European and national reearch projects including the DOBES programme, CLARIN and EUDAT as well as ISO initiatives. He is the head of The Language Archive, a collaboration between Max-Planck-Society, Berlin-Brandenburg-Academy of Sciences and the Royal Dutch Academy of Sciences.
Paul Trilsbeek is currently head of archive management at The Language Archive, Max Planck Institute for Psycholinguistics in Nijmegen. He studied Sonology at the Royal Conservatory in The Hague, after which he worked at the Radboud University in Nijmegen as a Music Technologist within the Music, Mind, Machine project. His experience in working with audiovisual media turned out to be of great value in the domain of language resource archiving, in which he has been working since 2003.
Florian Wittenburg works in the Language Archive at the MPI in Nijmegen in collaboration with the Max-Planck-Society, the Berlin-Brandenburg-Academy of Sciences and the Royal Dutch Academy of Sciences.
Access to the complete content on Oxford Handbooks Online requires a subscription or purchase. Public users are able to search the site and view the abstracts and keywords for each book and chapter without a subscription.
If you have purchased a print title that contains an access token, please see the token for information about how to register your code.