(p. ix) List of Contributors
(p. ix) List of Contributors
Joan C. Beal is Professor of English Language at the University of Sheffield. Her research interests are in the fields of sociolinguistics/dialectology and the history of the English language since 1700. She has published widely in both fields.
Bruce Birch is currently Departmental Visitor in Linguistics at the Australian National University in Canberra. His research has focused on the development of a usage-based approach to the analysis of prosodic structure including intonation, as well as issues involved in the documentation of endangered languages. He has been collecting data and contributing to the building of online corpora for Iwaidja and other highly endangered languages of Northwestern Arnhem Land, Australia, since 1999.
Paul Boersma received an MSc. in physics from the University of Nijmegen in 1988 and a Ph.D in linguistics from the University of Amsterdam in 1998 for a dissertation entitled ‘Functional phonology’. Since 2005 he has been Professor of Phonetic Sciences at the University of Amsterdam.
Lasse Bombien works as a researcher at the Universities of Munich and Potsdam. In 2006 he received his MA in Phonetics at the University of Kiel, and in 2011 a D.Phil. in Phonetics at the University of Munich. His areas of interest include speech production, articulatory coordination, sound change, effects of prosodic structure on phonetic detail, phonetics and phonology of Scandinavian languages, techniques for speech production investigation, and software development.
Caren Brinckmann studied computational linguistics and phonetics at Saarland University (Saarbrücken, Germany) and Tohoku University (Sendai, Japan). For her master’s thesis in 2004 she improved the prosody prediction module of a German speech synthesis system with corpus-based statistical methods. As a researcher at the national Institute for the German Language (IDS, Mannheim, Germany) she subsequently focused on regional phonetic variation, word phonology, and automatic text classification with statistical methods. In 2011 she left academia to apply her data-mining skills in a major German e-commerce company.
Daan Broeder has a background in electrical engineering, is deputy head of the TLA unit at the Max Planck Institute for Psycholinguistics (Nijmegen, The Netherlands) and has been senior developer responsible for all infrastructure and metadata development for many years. He plays leading roles in European and national projects, such as all (p. x) metadata-related work in TLA and CLARIN, and is the responsible convener for ISO standards on metadata and persistent identifiers.
Karen P. Corrigan is Professor of Linguistics and English Language at Newcastle University. She researches language variation and change in dialects of the British Isles with a particular focus on Northern Ireland and northeast England. She was principal investigator on the research project that created the Newcastle Electronic Corpus of Tyneside English (2000–2005), and fulfilled the same role for the Diachronic Electronic Corpus of Tyneside English project (2010–2012) at Newcastle University.
Catia Cucchiarini obtained her Ph.D in phonetics from the University of Nijmegen. She worked at the Centre for Language and Education of K.U. Leuven in Belgium, and has been working at the University of Nijmegen on various projects on speech processing and computer-assisted language learning. She has supervised Ph.D students and has published many articles in international journals. In addition to her research activities, she has since 1999 been working at the Nederlandse Taalunie (Dutch Language Union) as a senior project manager for language policy and human language technologies.
Elisabeth Delais-Roussarie is a senior researcher at the CNRS, Laboratoire de Linguistique Formelle, Paris (Université Paris-Diderot). She has worked on several topics in sentence phonology, such as the modelling of intonation and accentual patterns in French, the phonology–syntax interface, and prosodic phrasing in French. Her recent work has focused on the development and evaluation of prosodic annotation systems and tools that facilitate a corpus-based approach in sentence phonology and in the L2 acquisition of prosody.
Jacques Durand is Emeritus Professor of Linguistics at the University of Toulouse II – Le Mirail and a Member of the Institut Universitaire de France. He was formerly Professor at the University of Salford, Director of the CLLE-ERSS research centre in Toulouse, and in charge of linguistics at CNRS headquarters. His publications are mainly in phonology (particularly within the framework of Dependency Phonology, in collaboration with John Anderson), but he also worked in machine translation in the 1980s and 1990s within the Eurotra project. Since the late 1990s he has coordinated two major research programmes in corpus phonology: Phonology of Contemporary French, with M.-H. Côté, B. Laks, and C. Lyche, and Phonology of Contemporary English, with P. Carr and A. Przewozny.
Janet Fletcher is Associate Professor of Phonetics at the University of Melbourne. She completed her Ph.D at the University of Reading and has held research positions at the University of Edinburgh, Ohio State University, and Macquarie University. Her research interests include articulatory and acoustic modelling of coarticulation, and prosody and intonation in Australian English and Australian Indigenous languages.
Michel Francard is Professor of Linguistics at the Catholic University of Louvain (Louvain-la-Neuve) and founder of the VALIBEL research centre in 1989. His main (p. xi) research interests include linguistic variation (especially the lexicography of peripheral French varieties) and the evolution of endangered languages in the globalized linguistic market. His most recent book, Dictionnaire des belgicismes (2010) illustrates the emergence of an autonomous norm within a variety of French outside France.
Frans Gregersen is Professor of Danish Language at the Department of Scandinavian Studies and Linguistics, University of Copenhagen, and director of the Danish National Research Foundation LANCHART (LANguage CHAnge in Real Time) Centre since 2005. The centre webpage with current publications may be found at www.lanchart.dk.
Ulrike Gut holds the Chair of English Linguistics at the Westfälische Wilhelms- University in Münster. She received her Ph.D from Mannheim University and her postdoctoral degree (Habilitation) from Freiburg University. Her main research interests include phonetics and phonology, corpus linguistics, second language acquisition, and worldwide varieties of English. She has collected the LeaP corpus and the ICE-Nigeria and is currently involved in the compilation of the ICE-Scotland.
Kristin Hagen is Senior Engineer at the Text Laboratory, Department of Linguistic and Scandinavian Studies at the University of Oslo. For many years she has worked with the development of speech corpora such as NoTa-Oslo and the Nordic Dialect Corpus. She has also worked in other language technology domains like POS tagging (the Oslo–Bergen tagger), parsing and grammar checking. Her background is in linguistics and in computer science.
Philippe Hambye is Professor of French Linguistics at the University of Louvain. His research mainly includes work in sociolinguistics relating to variation of linguistic norms and practices in the French-speaking world, language practices in education and work, and language policies, with a special interest in questions of legitimacy, power, and social inequalities.
Tina John is a lecturer in linguistics at the University of Kiel. After MA graduation in Phonetics, Linguistics and Computer Science in 2004, she joined the developer team of the EMU System. In 2012 she obtained her Ph.D in Phonetics and Computer Linguistics at the University of Munich. Her areas of interest, in addition to the development of the EMU System and algorithms in general, are all kinds of speech data analysis (e.g. finding acoustic correlates) as well as analyses of text corpora.
Michael Kipp is Professor for Interactive Media at Augsburg University of Applied Sciences, Germany. Previously he was head of a junior research group at the German Research Center for AI (DFKI), Saarbrücken, and Saarland University. His research topics are embodied agents, multimodal annotation, and communication and interaction design. He developed and maintains the ANVIL video annotation tool.
Gjert Kristoffersen is Professor of Scandinavian Languages at the University of Bergen. His research interests are synchronic and diachronic aspects of Scandinavian phonology, (p. xii) especially Norwegian and Swedish prosody from a variationist perspective. He is the author of The Phonology of Norwegian, published by Oxford University Press in 2000.
Bernard Laks is Professor at Paris Ouest Nanterre University and senior member of the Institut Universitaire de France. Former director of the Nanterre linguistic laboratory, he is with Jacques Durand, Chantal Lyche, and Marie-Hélène Côté, director of the ‘Phonologie du français contemporain’ corpus and research program (PFC). He has published intensively in phonology, corpus linguistics, variation, cognitive linguistics, the history of linguistics, and modelling.
Chantal Lyche is currently Professor of French Linguistics at the University of Oslo. She has been adjunct Professor at the University of Tromsø and an associate member of CASTL (Center for Advanced Studies in Theoretical Linguistics, Tromsø). She has published extensively on French phonology and is the co-founder of a research programme in corpus phonology: ‘Phonology of contemporary French’ (with Jacques Durand and Bernard Laks). Since the 1990s she has focused more specifically on varieties of French outside France, particularly in Switzerland, Louisiana, Mauritius, and Africa. In addition, she has worked on the study of large corpora from a prosodic point of view. She is also the co-author of a standard textbook on the phonology of French, and is actively involved in the teaching of French as a foreign language.
Brian MacWhinney, Professor of Psychology, Computational Linguistics, and Modern Languages at Carnegie Mellon University, has developed a model of first and second language acquisition and processing called the Competition Model which he has also applied to aphasia and fMRI studies of children with focal lesions. He has developed databases such as CHILDES, SLABank, BilingBank, and CABank for the study of language learning and usage. He is currently developing methods for second language learning based on mobile devices and web-based tutors and games.
Marie Maegaard holds a Ph.D from the University of Copenhagen. She is currently Associate Professor of Danish Spoken Language at the Department of Scandinavian Research, University of Copenhagen, and is in charge of the phonetic studies at the Danish National Research Foundation LANCHART (LANguage CHAnge in Real Time) Centre. The centre webpage with current publications may be found at www.lanchart.dk.
Adam J. Mearns was postdoctoral research associate on the Diachronic Electronic Corpus of Tyneside English project (2010–2012) at Newcastle University. His background is in the lexical semantics of Old English and the history of the English language.
Hermann Moisl is a Senior Lecturer in Computational Linguistics at the University of Newcastle, UK. His background is in linguistics and computer science, and his research interests and publications are in neural language modelling using nonlinear attractor dynamics, and in methodologies for preparation and cluster analysis of data abstracted from natural language corpora.
(p. xiii) Francis Nolan is Professor of Phonetics in the Department of Linguistics at the University of Cambridge. He studied languages at Cambridge before specializing in phonetics. After a first post in Bangor (North Wales) he returned to Cambridge, developing research interests in phonetic theory, connected speech processes, speaker characteristics, variation in English, and prosody—the latter two united in the IViE project in the late 1990s. He has been active in forensic phonetic research and casework. He is currently President of the British Association of Academic Phoneticians.
Marc van Oostendorp is a researcher at the Meertens Instituut of the Royal Netherlands Academy of Arts and Sciences, and a Professor of Phonological Microvariation at Leiden University. His main interests are models of geographical and social variation, the relation between language as a property of the mind and language as a property of a community, and alternatives to derivational relations in the phonology–morphology interface.
Nicolai Pharao is Assistant Professor at the Danish National Research Foundation’s Centre for Language Change in Real Time, LANCHART. He received his Ph.D in linguistics with the dissertation ‘Consonant Reduction in Copenhagen Danish’ in 2010. His research includes corpus based studies of phonetic variation and change and experimental studies of the relationship between phonetic variation, social meaning, and language attitudes. He is particularly interested in how the usage and social evaluation of phonetic features influences the representation of word forms in the mental lexicon.
Brechtje Post is Lecturer in Phonetics and Phonology at the University of Cambridge. Her research interests centre around the syntax–phonology interface and intonation, which she investigates from a phonetic, phonological, acquisitional, cognitive, and neural perspective. She has published in journals such as Linguistics, Journal of Phonetics, Language and Speech, Cognition, and Neuropsychologia.
Laurent Romary is Directeur de Recherche at INRIA, France and guest scientist at Humboldt University in Berlin, Germany. He carries out research on the modelling of semi-structured documents, with a specific emphasis on texts and linguistic resources. He is the chairman of ISO committee TC 37/SC 4 on Language Resource Management, and has been active as member (2001–2007), then chair (2008–2011), of the TEI (Text Encoding Initiative) council. He currently contributes to the establishment and coordination of the European Dariah infrastructure for the arts and humanities.
Yvan Rose is currently Associate Professor of Linguistics at Memorial University and co-director of the PhonBank Project within CHILDES. He received his Ph.D in Linguistics from McGill University. His research concentrates on the nature of phonological representations and of their acquisition by young children. He investigates these questions through software-assisted methods, implemented in the Phon program for the analysis of transcript data on phonology and phonological development.
Thomas Schmidt holds a Ph.D from the University of Dortmund. His research interests are spoken language corpora, text and corpus technology, and computational (p. xiv) lexicography. He is one of the developers of EXMARaLDA and the author of the Kicktionary, a multilingual electronic dictionary of football language. He has spent most of his professional life as a researcher at the University of Hamburg. He also worked as a language resource engineer for a commercial company and as a research associate at ICSI Berkeley and at the Berlin-Brandenburg Academy of Sciences. Currently he heads the Archive for Spoken German (AGD) at the Institute for the German Language in Mannheim.
Anne Catherine Simon is Professor of French Linguistics at the Catholic University of Louvain (Louvain-la-Neuve, Belgium). She has been in charge of the VALIBEL research centre since 2009. Her research in French linguistics is in the areas of prosody and syntax of spoken speech, and their interaction in various speaking genres. Her dissertation, ‘Structuration prosodique du discours en français’, was published in 2004. She is co-author of La variation prosodique régionale en français (2012).
Hanne Gram Simonsen is Professor of Linguistics at the Department of Linguistics and Scandinavian Studies at the University of Oslo. Her research interests include language acquisition (in particular phonology, morphology, and lexicon) and instrumental and articulatory phonetics, as well as clinical linguistics (language disorders in children and adults). She has published on these topics in journals such as Journal of Child Language, Journal of Phonetics, and Clinical Linguistics and Phonetics.
Han Sloetjes is a software developer at the Language Archive, a department of the Max Planck Institute for Psycholinguistics. He has been involved in the development of the multimedia annotation tool ELAN since 2003. Currently he is the main responsible person for supporting, maintaining, and further developing this application.
Lesley Stirling is Associate Professor of Linguistics and Applied Linguistics at the University of Melbourne. She has a disciplinary background in linguistics and cognitive science, and has published work on a variety of topics in descriptive and typological linguistics, semantics, and discourse analysis. One research interest has been the relationship between dialogue structure and prosody, involving collaborative cross-disciplinary research funded by the Australian Research Council.
Helmer Strik received his Ph.D in physics from the University of Nijmegen, where he is now Associate Professor of Speech Science and Technology. His research addresses both human speech processing (voice source modelling, intonation, pronunciation variation, speech pathology) and speech technology (automatic speech recognition and transcription, spoken dialogue systems, and computer-assisted language learning and therapy). He has published over 150 refereed papers, has coordinated national and international projects, and has been an invited speaker at international events.
Atanas Tchobanov is a research engineer in MoDyCo CNRS lab. He has been active in the field of oral corpora web implementations since 2001. His research interests also include data analysis and unsupervised learning of phonological invariants.
(p. xv) Paul Trilsbeek is currently head of archive management at the Language Archive, Max Planck Institute for Psycholinguistics, Nijmegen. He studied sonology at the Royal Conservatory in The Hague, after which he worked at the Radboud University in Nijmegen as a music technologist in the Music, Mind, Machine project. His experience in working with audiovisual media turned out to be of great value in the domain of language resource archiving, in which he has been working since 2003.
Jane S. Tsay received her Ph.D in Linguistics from the University of Arizona. She was a postdoctoral research fellow at the State University of New York at Buffalo 1993–1995. She is currently a Professor of Linguistics and the Dean of the College of Humanities at the National Chung Cheng University in Taiwan. Her research interests include phonology acquisition, experimental phonetics, corpus linguistics, and sign language phonology. She has constructed the Taiwanese Child Language Corpus, based on 330 hours of recordings of young children’s spontaneous speech. She is also the co-director of the Sign Language Research Group at the University and has compiled a Taiwan Sign Language online dictionary. Her recent research, besides phonology acquisition, is on the phonological structure of spoken and signed languages.
Dieter van Uytvanck studied computer science at Ghent University and linguistics at the Radboud University, Nijmegen. After graduating he worked at the Max Planck Institute for Psycholinguistics in Nijmegen. Since 2008 he has been active in the technical setup of the CLARIN research infrastructure (www.clarin.eu) and as of 2012 he is director at CLARIN-ERIC.
Holger Voormann received a degree in computer science from the University of Stuttgart. He worked as a research associate at the IMS Stuttgart and held several positions in IT companies. He is now a freelance software developer and consultant, and is involved in the development of several open source projects, such as the Platform for Annotated Corpora in XML (Pacx).
Andreas Witt received his Ph.D in Computational Linguistics and Text Technology from Bielefeld University in 2002, and continued there for the next four years as an instructor and researcher in those fields. In 2006 he moved to Tübingen University, where he participated in a project on ‘Sustainability of Linguistic Resources’ and in projects on the interoperability of language data. Since 2009 he has headed the Research Infrastructure group at the Institute for the German Language in Mannheim.
Florian Wittenburg works in the Language Archive at the MPI in Nijmegen in collaboration with the Max Planck Society, the Berlin Brandenburg Academy of Sciences, and the Royal Dutch Academy of Sciences.
Peter Wittenburg has a diploma degree in Electrical Engineering from the Technical University Berlin and in 1976 became head of the technical group at the newly founded Max Planck Institute for Psycholinguistics. He has had leading roles in various European and national reearch projects including the DOBES programme, CLARIN and EUDAT (p. xvi) as well as ISO initiatives. He is the head of the Language Archive, a collaboration between the Max Planck Society, the Berlin Brandenburg Academy of Sciences, and the Royal Dutch Academy of Sciences.
Kai Wörner holds a Ph.D in text technology from the University of Bielefeld. After finishing his university studies at Gießen University, he worked as a web developer in Hamburg. He is currently the managing director of the Hamburg Centre for Language Corpora and a research assistant in the language resource infrastructure project CLARIN-D. His research interests are corpus and computational linguistics. He is one of the developers of the EXMARaLDA system.
Hiyon Yoo is Associate Professor of Linguistics and Phonetics at the University Paris Diderot - Paris 7. Her current research interests include the description and modelling of rhythm and intonation (especially French, Korean and Modern Greek), the study of the phonology-syntax-semantics interface, corpus phonology, and L2 acquisition of prosody.