Abstract and Keywords
This chapter describes the contributions that Corpus Linguistics (the study of linguistic phenomena by means of systematically exploiting collections of naturally-occurring linguistic data) can make to IS research. It discusses issues of designing a corpus that can serve as a basis for qualitative or quantitative studies, and then turns to the central issue of data annotation: what corpora are available that have been annotated with IS-related annotations, and how can such annotations be evaluated? In case a corpus does not have direct IS annotation, can other types of annotations, especially in the form of multi-layer annotation, be used as indirect evidence for the presence of IS phenomena? Next, the present state of the art in automatic IS annotation (by means of techniques from computational linguistics) is sketched, and finally, several sample studies that exploit IS annotations are introduced briefly.
Access to the complete content on Oxford Handbooks Online requires a subscription or purchase. Public users are able to search the site and view the abstracts and keywords for each book and chapter without a subscription.
If you have purchased a print title that contains an access token, please see the token for information about how to register your code.