You are here : Home News and Events News and Events TTC @ GSCL 2011 - September 28-30, Hamburg, Germany  

TTC @ GSCL 2011 - September 28-30, Hamburg, Germany

TTC was presented at the GSCL: German Society for Computational Linguistics and Language Technology conference held on September 28-30, 2011 at the University of Hamburg in Germany with the paper "Terminology extraction and term variation patterns: a study of French and German data" by Marion Weller, Anita Gojun, Ulrich Heid (IMS) and Helena Blancafort (SYLLABS).

Abstract:

The terminology of many technical domains, especially new and evolving ones, is not fully fixed and shows considerable variation. The purpose of the work described in this paper is to capture term variation. For term extraction, we apply hand-crafted POS patterns on tagged corpora, and we use rules to relate morphological and syntactic variants. We discuss some French and German variation patterns, and we present first experimental results from our tools. It is not always easy to distinguish (near) synonyms from variants that have a slightly different meaning from the original term; we discuss ways of operating such a distinction. Our tools are based on POS tagging and an approximation of derivation and compounding; however, we also propose a non-symbolic, statistics-based line of development. We discuss general issues of evaluating variant detection and present a small-scale precision evaluation.