- Oxford Handbooks in Linguistics
- [UNTITLED]
- List of Contributors
- Introduction
- Corpus Design
- Data Collection
- Corpus Annotation: Methodology and Transcription Systems
- On Automatic Phonological Transcription of Speech Corpora
- Statistical Corpus Exploitation
- Corpus Archiving and Dissemination
- Metadata Formats
- Data Formats for Phonological Corpora
- Corpus and Research in Phonetics and Phonology: Methodological and Formal Considerations
- A Corpus-Based Study of Apicalization of /s/ before /l/ in Oslo Norwegian
- Corpora, Variation, and Phonology: An Illustration from French Liaison
- Corpus-Based Investigations of Child Phonological Development: Formal and Practical Considerations
- Corpus Phonology and Second Language Acquisition
- ELAN: Multimedia Annotation Application
- EMU
- The Use of Praat in Corpus Research
- Praat Scripting
- The PhonBank Project: Data and Software-Assisted Methods for the Study of Phonology and Phonological Development
- EXMARaLDA
- ANVIL: The Video Annotation Research Tool
- Web-Based Archiving and Sharing of Phonological Corpora
- The IViE Corpus
- French Phonology from a Corpus Perspective: The PFC Programme
- Two Norwegian Speech Corpora: NoTa-Oslo and TAUS
- The LeaP Corpus
- The Diachronic Electronic Corpus of Tyneside English: Annotation Practices and Dissemination Strategies
- The Lanchart Corpus
- Phonological and Phonetic Databases at the Meertens Institute
- The VALIBEL Speech Database
- Prosody and Discourse in the Australian Map Task Corpus
- A Phonological Corpus of L1 Acquisition of Taiwan Southern Min
- References
- Index
Abstract and Keywords
Lexical information, especially frequency information regarding sound patterns, plays an important role in child phonology acquisition. It is desirable to have child language corpora that are very large and also present a great amount of phonological detail. This chapter describes data collection, transcription, and annotations of the Taiwanese Child Language Corpus. This corpus (in CHILDES format) is based on 330 hours of longitudinal recordings of spontaneous speech between fourteen Taiwanese children and their caretakers during a period of three years. The target language Taiwanese is a variety of Southern Min Chinese spoken in Taiwan. The corpus contains 1 646 503 word tokens in 497 426 lines (utterances). Computer programs developed for specific phonological analyses are also described briefly.
Keywords: child language, phonology, L1 acquisition, corpus linguistics, Southern Min Chinese, Taiwanese
Tsay, Jane S. (蔡素娟) is Professor at the Graduate Institute of Linguistics and Dean of College of Humanities, National Chung Cheng University, Taiwan. Her primary research areas are theoretical phonology, experimental phonology and phonetics, child language acquisition, corpus linguistics, Chinese dialectology and Taiwan Sign Language.
Access to the complete content on Oxford Handbooks Online requires a subscription or purchase. Public users are able to search the site and view the abstracts and keywords for each book and chapter without a subscription.
Please subscribe or login to access full text content.
If you have purchased a print title that contains an access token, please see the token for information about how to register your code.
For questions on access or troubleshooting, please check our FAQs, and if you can''t find the answer there, please contact us.
- Oxford Handbooks in Linguistics
- [UNTITLED]
- List of Contributors
- Introduction
- Corpus Design
- Data Collection
- Corpus Annotation: Methodology and Transcription Systems
- On Automatic Phonological Transcription of Speech Corpora
- Statistical Corpus Exploitation
- Corpus Archiving and Dissemination
- Metadata Formats
- Data Formats for Phonological Corpora
- Corpus and Research in Phonetics and Phonology: Methodological and Formal Considerations
- A Corpus-Based Study of Apicalization of /s/ before /l/ in Oslo Norwegian
- Corpora, Variation, and Phonology: An Illustration from French Liaison
- Corpus-Based Investigations of Child Phonological Development: Formal and Practical Considerations
- Corpus Phonology and Second Language Acquisition
- ELAN: Multimedia Annotation Application
- EMU
- The Use of Praat in Corpus Research
- Praat Scripting
- The PhonBank Project: Data and Software-Assisted Methods for the Study of Phonology and Phonological Development
- EXMARaLDA
- ANVIL: The Video Annotation Research Tool
- Web-Based Archiving and Sharing of Phonological Corpora
- The IViE Corpus
- French Phonology from a Corpus Perspective: The PFC Programme
- Two Norwegian Speech Corpora: NoTa-Oslo and TAUS
- The LeaP Corpus
- The Diachronic Electronic Corpus of Tyneside English: Annotation Practices and Dissemination Strategies
- The Lanchart Corpus
- Phonological and Phonetic Databases at the Meertens Institute
- The VALIBEL Speech Database
- Prosody and Discourse in the Australian Map Task Corpus
- A Phonological Corpus of L1 Acquisition of Taiwan Southern Min
- References
- Index