Show Summary Details

Page of

PRINTED FROM OXFORD HANDBOOKS ONLINE (www.oxfordhandbooks.com). © Oxford University Press, 2018. All Rights Reserved. Under the terms of the licence agreement, an individual user may print out a PDF of a single chapter of a title in Oxford Handbooks Online for personal use (for details see Privacy Policy and Legal Notice).

date: 15 July 2019

Abstract and Keywords

This chapter aims to answer questions generally relevant for the task of constructing a corpus that can serve as a sound empirical basis for the creation of dictionaries as well as for linguistic research. Starting from theoretical considerations of corpus design and representativeness, it also discusses practical issues, such as how the primary data in corpora can be enriched with other kinds of information, how raw data can be converted to corpora, how the necessary rights can be acquired, and in general, how to avoid possible pitfalls. In these considerations the construction of a corpus will be viewed primarily as a complex optimization task that should best be approached iteratively, for which typically no single optimal solution can be found, and in which costs are a crucial factor.

Keywords: very large corpora, Representativeness, corpus design, corpus construction, virtual corpora, primordial sample, German reference corpus

Access to the complete content on Oxford Handbooks Online requires a subscription or purchase. Public users are able to search the site and view the abstracts and keywords for each book and chapter without a subscription.

Please subscribe or login to access full text content.

If you have purchased a print title that contains an access token, please see the token for information about how to register your code.

For questions on access or troubleshooting, please check our FAQs, and if you can''t find the answer there, please contact us.