- The Oxford Handbook of Computational Linguistics
- Pragmatics and Dialogue
- Formal Grammars and Languages
- Text Segmentation
- Part-of-Speech Tagging
- Word-Sense Disambiguation
- Anaphora Resolution
- Natural Language Generation
- Speech Recognition
- Text-to-Speech Synthesis
- Finite-State Technology
- Statistical Methods
- Machine Learning
- Lexical Knowledge Acquisition
- Sublanguages and Controlled Languages
- Corpus Linguistics
- Tree-Adjoining Grammars
- Machine Translation: General Overview
- Machine Translation: Latest Developments
- Information Retrieval
- Information Extraction
- Question Answering
- Text Summarization
- Term Extraction and Automatic Indexing
- Text Data Mining
- Natural Language Interaction
- Natural Language in Multimodal and Multimedia Systems
- Natural Language Processing in Computer-Assisted Language Learning
- Multilingual On-Line Natural Language Processing
- Notes on Contributors
- Index of Authors
- Subject Index
Abstract and Keywords
Beyond the theoretical linguistics search for language universals, there exist a number of practical linguistic problems concerning multilingual environments. This article discusses a number of related multilingual natural language processing (NLP) applications, many connected with using Internet language identification. Internet users want to shop, interact in virtual communities, and find information without language being a barrier, with as much ease as they can. This article presents a technical overview of the research and development being performed on four multilingual applications: language identification, terminology alignment, cross-language information retrieval, and comprehension aids. It covers a wide range of tools that allows the user to understand text written in another language, without using the machinery of full automatic translation.
Gregory Grefenstette is Principal Research Scientist at Clairvoyance Corporation, Pittsburgh, USA. He is the author of Explorations in Automatic Thesaurus Discovery (Kluwer, 1994) and editor of Cross-Language Information Retrieval (Kluwer, 1998). Gregory Grefenstette has published numerous papers on computational linguistics, information retrieval, and corpus analysis. He is currently working on very large lexicons.
Frédérique Segond is managing the research group in Semantics at Xerox Resarch Centre Europe (XRCE). She joined the Xerox Research Centre Europe in 1993to work on computational linguistics and lead the Lexical Semantic project. For three years she has been in charge of e-learning projects at Xerox-New Starts, a business unit that was transferring research results into market place. Frédérique Segond received her Ph.D. in Applied Mathematics at the Ecole des Hautes Etudes en Sciences Sociales in Paris, implementing a categorical French grammar at IBM-France at the same time. After a postdoc at IBM Yorktown working on the links between syntax and semantics, Dr. Segond was Maitre de Conference at the Institut National des Telecommunications.
Access to the complete content on Oxford Handbooks Online requires a subscription or purchase. Public users are able to search the site and view the abstracts and keywords for each book and chapter without a subscription.
If you have purchased a print title that contains an access token, please see the token for information about how to register your code.