Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

Evaluating sentence alignment on Croatian-English parallel corpora (CROSBI ID 541351)

Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija

Seljan, Sanja ; Agić, Željko ; Tadić, Marko Evaluating sentence alignment on Croatian-English parallel corpora // Proceedings of the 6th International Conference on Formal Approaches to South Slavic and Balkan Languages / Tadić, Marko ; Dimitrova-Vulchanova, Mila ; Koeva, Svetla (ur.). Zagreb: Hrvatsko društvo za jezične tehnologije, 2008. str. 101-108

Podaci o odgovornosti

Seljan, Sanja ; Agić, Željko ; Tadić, Marko

engleski

Evaluating sentence alignment on Croatian-English parallel corpora

This paper describes an experiment in applying sentence alignment methods to Croatian-English parallel corpora and systematically evaluate their performance within the recall, precision and F-measure framework. It is our primary goal to provide an insight and a reference point on sentence alignment accuracy for Croatian-English language pair and also to extend the scope of (Tadić, 2000) – to our knowledge, the first experiment dealing with sentence alignment of Croatian-English parallel corpora – by utilizing newly implemented tools, creating corpora subsets defined by genre and finally by expanding and formalizing its preliminary observations on alignment accuracy. Therefore, in this paper we start off by briefly describing and argumenting sentence alignment paradigms of choice and presenting available language resources, subset of Croatian-English parallel corpus described in (Tadić, 2000) being our primary asset. These descriptions are followed by a formal definition of our testing framework. Results are then discussed in detail and conclusions are stated along with a brief insight on possible future work.

sentence alignment; croatian-english parallel corpora

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o prilogu

101-108.

2008.

objavljeno

Podaci o matičnoj publikaciji

Proceedings of the 6th International Conference on Formal Approaches to South Slavic and Balkan Languages

Tadić, Marko ; Dimitrova-Vulchanova, Mila ; Koeva, Svetla

Zagreb: Hrvatsko društvo za jezične tehnologije

978-953-55375-0-2

Podaci o skupu

6th International Conference on Formal Approaches to South Slavic and Balkan Languages (FASSBL 2008)

predavanje

25.09.2008-28.09.2008

Dubrovnik, Hrvatska

Povezanost rada

Računarstvo, Informacijske i komunikacijske znanosti, Filologija