Rosa Estopà Bagot
Català | Español

 

iniciReference

Publishing information:

Barcelona: http://www.tdx.cbuc.es/ ISBN: 84-88042-53-1
Barcelona: Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra (Sèrie Tesis, 2), 2002. ISBN: 84-89782-02-4; DL: B-52162-2002

Extracció de terminologia: elements per a la construcció d'un SEACUSE (Sistema d'Extracció Automàtica de Candidats a Unitats de Significació Especialitzada)

Extracción de terminología: elementos para la construcción de un SEACUSE (Sistema de Extracción Automática de Candidatos a Unidades de Significación Especializada) [spanish Traduction]

Universitat Pompeu Fabra, Institut Institut Universitari de Lingüística Aplicada
Dissertation director: M. Teresa Cabré Castellví
Mark: Excel·lent per unanimitat Cum Laude

key words: Terminological Unit, Term automatic extraction, Terminological applications, Automatic extraction systems, Terminology, Term

iniciAbstract

The main objective of this dissertation is to design a model for an Automatic Extraction System for Candidates of Specialized Meaning Units (AESCSMU) based upon different types of linguistic foundations that are adequate to meet the needs of concrete professional activities. To achieve this purpose, the first chapter describes the automatic extraction systems for term candidates (AESTCs). It also analyzes and makes an evaluation of the main AESTCs with the purpose of presenting the state of the issue in this field of knowledge, while showing the characteristics and, above all, the limitations of these systems. The second chapter validates, in the first place, the hypotheses that are going to be taken for granted during the research project about the structural patterns of the multiword units, and, in the second place, tests the main limitations of the AESTCs that are based upon morphosyntactic patterns. These limitations become manifest in two aspects: silence (when the extractor does not detect pertinent units) and noise (when non-pertinent units are presented as if they were pertinent). Data derived from the analysis of silence and noise are studied both on the third and fourth chapters. There, we first analyze the types and causes of silence produced by the AESTCs, and then the types and causes of the noise generated by these systems. The fifth chapter proposes elements and strategies so that automatic extraction systems may reduce silence as well as noise. This way we can accomplish that their results may come closer to the manual identification and delimitation of specialized meaning units. The sixth chapter introduces of the user's point of view, and states the fact that not all the professional activities require the same type and the same number of specialized units from a text. This hypothesis is verified through an experimental test based upon the needs of four different professional activities. Finally, the seventh and last chapter proposes a possible model of an AESCSMU that, besides the strategies presented on chapter five, considers the goals of professionals at the time of presenting the results.

iniciAwards

Accèssit dels Premis Internacionals de Terminologia pera a la investigació aplicada i el desenvolupmanet en el camp de la Terminologia. Associació Europea de Terminologia i INFOTERM (International Information Centre for Terminology), Copenhague, agost de 2001.

iniciRelated Publications

Cabré, M. T.; Estopà, R. ; Vivaldi, J. "Automatic term detection: a review of current systems" En: D. Bourigault, C. Jacquemin, M.-C. L'Homme (eds.) Recent Advances in Computational Terminology. Amsterdam, John Benjamins, Natural Language Processing, 2, 2001, 53-87. ISBN: 90-272-4984-9.

Estopà, R. "Elementos lingüísticos de las unidades terminológicas para su extracción automática" En: Cabré, M. T.; Feliu, J. (eds.) La terminología científico-técnica. Barcelona, Instituto Universitario de Lingüíistica Aplicada; IULATERM, 2001, 67-80, ISBN: 84-477-0744-X

Cabré, M. T.; Estopà, R. "On the units of specialised meaning uses in professional communication". Terminology Science and Research, 2003, 1, Viena, ISSN 0929-9971

Cabré, M. T.; Estopà, R. "El conocimiento especializado y sus unidades de representación: diversidad cognitiva". Sendébar, 2002, 13, 141-153, Granada, ISSN: 1130-5509

Estopà, R.; Valero, T. "Adquisición de conocimiento especializado y unidades de significación especializada en medicina". Panace@ - Boletín de Medicina y Traducción, 2002, 3 (9-10), 72-82 http://www.medtrad.org/panacea ISSN: 1537-1964 Clave: A

Estopà, R. "Extracción de terminología: elementos para la construcción de un extractor". Tradterm, 7, 2002, 225-250, Sao Paulo, ISSN: 0104-693X Clave: A

Estopà, R. "Les unités de signification spécialisées: élargissant l'objet du travail en terminologie". Terminology, 7, 2, 2001, 217-237, Amsterdam-Philadelphia, ISSN : 0929-9971 Referenciado en Linguistic Abstracts, 18, 4,2002, 681, ISSN : 0267-5498

Estopà, R. "Los adjetivos en las unidades terminológicas poliléxicas: un análisis morfosemántico". Organon, 2000, 14, 28/29, 233-246, Porto Alegre, ISSN 0102-6267

Estopà, R.; Vivaldi, J.; Cabré, M.T. "Use of Greek and Latin forms for terms detection". Second International Conference on Language Resoureces and Evaluation, Atenas Any: 2000

Cabré, M.T.; Estopà, R.; Vivaldi, J. "Extraction of monolexical terminological units: requirements analysis". Second International Conference on Language Resoureces and Evaluation Terminology. Atenas, 2000

Estopà, R. "Eficiencia en la extracción automática de terminología" Perspectives: Studies in Traductology, 7, 2, 277-286, 1999, Copenhagen, ISBN: 87-7289-540-3. ISSN: 0997 676X

iniciRelated Research Project

Les unitats terminològiques polilexemàtiques en els lèxics especialitzats (dret i medicina). Directora: M. Teresa Cabré. Tribunal: Mercè Lorente, Toni Badia i M. Teresa Cabré. Defensa: novembre de 1996. [unpublished work]

 

 

Contact email: rosa.estopa@upf.edu
Last updated: 27-03-2006