- The Oxford Handbook of Computational Linguistics
- Pragmatics and Dialogue
- Formal Grammars and Languages
- Text Segmentation
- Part-of-Speech Tagging
- Word-Sense Disambiguation
- Anaphora Resolution
- Natural Language Generation
- Speech Recognition
- Text-to-Speech Synthesis
- Finite-State Technology
- Statistical Methods
- Machine Learning
- Lexical Knowledge Acquisition
- Sublanguages and Controlled Languages
- Corpus Linguistics
- Tree-Adjoining Grammars
- Machine Translation: General Overview
- Machine Translation: Latest Developments
- Information Retrieval
- Information Extraction
- Question Answering
- Text Summarization
- Term Extraction and Automatic Indexing
- Text Data Mining
- Natural Language Interaction
- Natural Language in Multimodal and Multimedia Systems
- Natural Language Processing in Computer-Assisted Language Learning
- Multilingual On-Line Natural Language Processing
- Notes on Contributors
- Index of Authors
- Subject Index
Abstract and Keywords
Terms are pervasive in scientific and technical documents and their identification is a crucial issue for any application dealing with the analysis, understanding, generation, or translation of such documents. In particular, the ever-growing mass of specialized documentation available on-line, in industrial and governmental archives or in digital libraries, calls for advances in terminology processing for tasks such as information retrieval, cross-language querying, indexing of multimedia documents, translation aids, document routing and summarization, etc. This article presents a new domain of research and development in natural language processing (NLP) that is concerned with the representation, acquisition, and recognition of terms. It begins with presenting the basic notions about the concept of ‘terms’, ranging from the classical view, to the recent concepts. There are two main areas of research involving terminology in NLP, which are, term acquisition and term recognition. Finally, this article presents the recent advances and prospects in term acquisition and automatic indexing.
Christian Jacquemin is a Professor in Computer Science at the University of Paris XI and head of the Research Group ‘Language, Information, and Representation’ at CNRS-LIMSI. He has published at major conferences in computational linguistics and information retrieval (ACL, COLING, ACM/SIGIR, RIAO). Professor Jacquemin was involved in the organization of workshops on compound analysis and computational terminology. He authored a book on natural language processing for term recognition published by MIT Press in 2001, and co-edited a book on computational terminology published by John Benjamins in 2001.
Didier Bourigault is a researcher at CNRS-ERSS (Toulouse, France). He has developed terminology extraction software (Lexter as well as Syntex) and acts as a consultant in the field of language engineering and computational terminology. He was also instrumental in the setting up and development of the TIA group, a French research group on computational terminology and knowledge engineering. Didier Bourigault has published at major conferences in computational linguistics (EACL, COLING, KAW) and co-edited a book on computational terminology published by John Benjamins in 2001.
Access to the complete content on Oxford Handbooks Online requires a subscription or purchase. Public users are able to search the site and view the abstracts and keywords for each book and chapter without a subscription.
If you have purchased a print title that contains an access token, please see the token for information about how to register your code.