Metalinguistic Information Extraction
BeschreibungThis work presents an empirical study of the use and function of metalanguage in expert scientific knowledge and special-domain languages, with special focus on how each field's terminology is established, modified and negotiated within the group of experts. Through discourse statements called Explicit Metalinguistic Operations (EMO), the dynamic nature of conceptual structures and the sublanguages that embody them are formalized and analyzed.
On the other hand, it presents an implementation of a system for automatic extraction of metalinguistic information from specialized texts. The Metalinguistic Operation Processor (MOP) system extracts metalinguistic statements and definitions from special-domain documents, using finite-state machinery and machine-learning algorithms. The system creates semi-structured databases called Metalinguistic Information Databases (MID), useful for specialized lexicography, Natural Language Processing, and the empirical study of scientific knowledge.
PortraitDr. Carlos Rodriguez Penagos, is a senior researcher in the Language and Voice area of Barcelona Media (GLICOM). He has a doctoral degree in Linguistics, with Computational Linguistics as specialty. His major areas of expertise are Information Extraction, Text Mining, Natural Language Processing, Knowledge Engineering and Computer-Aided Translation. He has taught and coordinated various international research projects at the National Autonomous University of Mexico (UNAM), the Universitat Pompeu Fabra (UPF) and the National Cancer Research Center (CNIO), where he mined biomedical literature. He has been awarded various research grants by the National Science and Technology Council (Mexico) and the Generalitat de Catalunya Government (Spain).
Untertitel: Exploiting Metalanguage in Text. Paperback. Sprache: Englisch.
Verlag: VDM Verlag
Erscheinungsdatum: Mai 2008
Seitenanzahl: 192 Seiten