This part aims at publishing TTC deliverables (public), annual reports, scientific papers and other dissemination materials...
TTC poster
TTC midterm poster (PDF).
TTC scientific papers
"TTC TermSuite: A UIMA Application for Multilingual Terminology Extraction from Comparable Corpora", Jerome Rocheteau and Beatrice Daille (LINA) at IJCNLP 2011: the 5th International Joint Conference on Natural Language Processing, November 8-13, 2011, Chiang Mai, Thailand (PDF).
"Simple methods for dealing with term variation and term alignment", Marion Weller, Anita Gojun, Ulrich Heid (IMS), Beatrice Daille, Rima Harastani (LINA) at TIA 2011: the 9th International Conference on Terminology and Artificial Intelligence, November 8-10, 2011, Paris, France (presentation/paper).
"Terminology extraction and term variation patterns: a study of French and German data", Marion Weller (IMS), Helena Blancafort (SYLLABS), Anita Gojun and Ulrich Heid (IMS) at GSCL: German Society for Computational Linguistics and Language Technology, September 28-30, 2011, Universität Hamburg, Germany (PDF).
"Knowledge-Poor Approach to Shallow Parsing: Contribution of Unsupervised Part-of-Speech Induction", Marie Guégan and Claude de Loupy (SYLLABS) at RANLP 2011: Recent Advances in Natural Language Processing, September 12-14, 2011, Hissar, Bulgaria (PDF).
"Babouk: Focused web crawling for corpus compilation and automatic terminology extraction", Clément de Groc (SYLLABS) at IEEE/WIC/ACM 2011: International Conference on Web Intelligence, August 22-27, 2011, Campus Scientifique de la Doua, Lyon, France (PDF).
"Babouk – exploration orientée du web pour la constitution de corpus et terminologies", Clément de Groc, Javier Couto, Helena Blancafort, Claude de Loupy (SYLLABS) at TALN 2011: Traitement Automatique des Langues Naturelles Conference, June 27 – July 1, 2011, Montpellier, France (abstract / poster).
"From Crawled Collections to Comparable Corpora: an Approach based on Automatic Archetype Identification", Richard Forsyth and Serge Sharoff at the 2011 Corpus Linguistics conference: Discourse and Corpus Linguistics, July 20-22, 2011, Birmingham, UK (abstract).
"TTC TermSuite: une chaîne de traitement pour la fouille terminologique multilingue" Béatrice Daille, Christine Jacquin, Laura Monceaux, Emmanuel Morin and Jérome Rocheteau at TALN 2011: Traitement Automatique des Langues Naturelles Conference, June 27 – July 1, 2011, Montpellier, France (PDF).
"Bilingual Lexicon Extraction from Comparable Corpora Enhanced with Parallel Corpora" ,Emmanuel Morin and Emmanuel Prochasson (UN, LINA) at the 4th Workshop on Building and Using Comparable Corpora (BUCC 2011), June 24, 2011, Portland, Oregon, USA (PDF).
“The proper place of men and machines in language technology. Processing Russian without any linguistic knowledge”, Serge Sharoff (CTS) and Joakim Nivre (Uppsala University) at the International Conference on Computational Linguistics and Artificial Intelligence Dialog 2011, May 25-29, 2011, Moscow region, Russia (PDF).
"Babouk - Exploration orientée du web pour la constitution de corpus et de terminologies", Clément de Groc (SYLLABS) at Ingénierie des connaissances 2011 (IC 2011): 22es Journées francophones d’Ingénierie des Connaissances, Chambéry, May 16-20, 2011, France (abstract).
"From Terminology Database to Platform for Terminology Services", Andrejs Vasiļjevs, Tatiana Gornostay and Inguna Skadiņa (TILDE) at the CHAT 2011 Workshop on Creation, Harmonization and Application of Terminology resources, May 11, 2011, Riga, Latvia. (PDF)
"Comparability Measurement for Terminology Extraction", Fabien Poulard, Béatrice Daille, Christine Jacquin and Laura Monceaux (UN, LINA), Helena Blancafort (SYLLABS) at the CHAT 2011 Workshop on Creation, Harmonization and Application of Terminology resources, May 11, 2011, Riga, Latvia. (PDF)
"User-centred Views on Terminology Extraction Tools: Usage Scenarios and Integration into MT and CAT Tools", Helena Blancafort (SYLLABS), Ulrich Heid (IMS), Tatiana Gornostay (TILDE), Claude Méchoulam (SOGITEC), Béatrice Daille (UN, LINA), Serge Sharoff (UL, CTS) at the TRALOGY Conference "Translation Careers and Technologies: Convergence Points for the Future", March 3-4, 2011, Paris, France. (PDF)
"Analysing Similarities and Differences between Corpora", Serge Sharoff (UL, CTS) at the 7th Conference “Language Technologies” (Jezikovne Tehnologije), October 14-15, 2010, the Institute “Jožef Stefan”, Ljubljana, Slovenia. (PDF)
"Reaching the User: Targeted Delivery of Federated Content in Multilingual Term Bank: scientific paper", Andrejs Vasiljevs (TILDE), Signe Rirdance (ECDC), Tatiana Gornostay (TILDE) at the Terminology and Knowledge Engineering (TKE) Conference “Presenting Terminology and Knowledge Engineering Resources Online: Models and Challenges”, August 12-13, 2010, Dublin City University, Ireland. (PDF)
"TTC: Terminology Extraction, Translation Tools and Comparable Corpora", Helena Blancafort (SYLLABS), Béatrice Daille (UN, LINA), Ulrich Heid (IMS), Claude Mechoulam (SOGITEC), Serge Sharoff (UL, CTS) at the 14th EURALEX International Congress, July 6-10, 2010, Leeuwarden/Ljouwert, The Netherlands. (PDF)
"Building a French-speaking community around UIMA, gathering research, education and industrial partners, mainly in Natural Language Processing and Speech Recognizing domains", Nicolas Hernandez, Fabien Poulard, Matthieu Vernier, Jérôme Rocheteau (UN, LINA) at "New Challenges for NLP Frameworks" workshop at LREC 2010, May 23 Valletta, Malta. (PDF)
"Terminology Management in Real Use", Tatiana Gornostay (TILDE) at the 5th International Conference “Applied Linguistics in Science and Education”, March 25-26, 2010, Saint-Petersburg, Russian Federation. (PDF)
TTC public deliverables
D2.2 Analysis of typologies to measure intertextuality. Evaluation on several monolingual corpora (March 2011/PDF)
D2.3 Open-source tools for measuring the composition of a corpus within a language and across languages (March 2011/PDF)
D3.3 Definition of the evaluation procedure including parameters and measures, preparation of the data for the evaluation, comprehensive report (December 2010/PDF)
D4.1 Neo-classical MWT detection program for English/French/German (June 2011/PDF)
D5.1 UIMA Type System specification for bilingual term extraction from comparable corpora (December 2010/PDF)
D5.2 UIMA components to integrate existing partners’ tools for term extraction over a given collection (December 2010/PDF)
D5.3 Web demo application to illustrate extraction and alignment operations (October 2011)
TTC Annual public report 2011
TTC Annual public report 2011 has been published (PDF). The document reports on the results the TTC consortium achieved in 2011:
- analysis of typologies to measure intertextuality and evaluation of several monolingual corpora;
- open-source tool for measuring the composition of a corpus within a language and across languages;
- a neo-classical multiword term detection program for English, French, and German, and others;
along with the summary of TTC research, development, and dissemination activities during the second project year list of TTC dissemination activities.
TTC Annual public report 2010
TTC annual public report 2010 has been published (PDF). The document describes TTC main goals:
- using comparable corpora;
- using a minimum of linguistic knowledge for candidate term extraction;
- defining and combining different strategies for term alignment;
- developing an open platform for use with MT and CAT tools;
as well as the overall project strategy, consortium, summary of activities during the first project year, and the project dissemination strategy and activities, including organised events, scientific papers accepted to conferences and free to download from the project website, invited talks, oral and poster; presentations and other dissemination materials mentioning TTC.
TTC Online Survey 2010: Results
Last year TTC partners Syllabs and Tilde conducted a questionnaire-based online survey "Calling Professionals: Help Us to Understand Your Needs!" about terminology and corpora practices to get a better knowledge of user needs and we are pleased to publish the survey results (PDF).
We are VERY grateful to all the respondents for your time and participation, for your input that has been very valuable for us and helped to define the specifications of the tools to be developed during the project.
Thank you very much again!





Releases

