Terminology Structuring Based on Croatian–English Parallel Corpus (CROSBI ID 557930)
Prilog sa skupa u zborniku | sažetak izlaganja sa skupa | međunarodna recenzija
Podaci o odgovornosti
Gašpar, Angelina ; Seljan, Sanja
engleski
Terminology Structuring Based on Croatian–English Parallel Corpus
Terminology bases have been frequently used by translators, terminologists, lexicographers and everyday users. Labour intensive terminological work encompasses the identification and the ranking of term candidates according to their domain representativity. In order to measure it, a sample of Croatian–English parallel corpora is used and a hybrid statistical and linguistic approach has been adopted as in Frantzi et al.(2000), Nakagava et.al.(1998), Daille (1955), Alegria et.al.(2004). While statistical techniques are used for ranking and filtering of n-gram term candidates, linguistic techniques are applied to select terms capturing their morpho-syntactic properties. The automatically created list is then evaluated and followed by the description of morpho-syntactic patterns. Multiword term list obtained through hybrid approach is then evaluated through measures of precision, recall and F-measure and analyzed with suggestions for further improvements. NooJ linguistic englineering tool has been applied in order to meet linguistic needs of this research.
terminology structuring; term candidates; parallel corpus; statistical approach; linguistic approach; evaluation; NooJ.
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
22-22.
2009.
objavljeno
Podaci o matičnoj publikaciji
Challenges of Translation Studies in a Globalized World - Book of Abstracts
Teržan Kopecky, Karmen ; Zupan, Simon
Maribor: Department of Translation Studies, Faculty of Arts Maribor
Podaci o skupu
Challenges of Translation Studies in a Globalized World
predavanje
15.10.2009-17.10.2009
Maribor, Slovenija