crta
Hrvatska znanstvena Sekcija img
bibliografija
3 gif
 Naslovna
 O projektu
 FAQ
 Kontakt
4 gif
Pregledavanje radova
Jednostavno pretraživanje
Napredno pretraživanje
Skupni podaci
Upis novih radova
Upute
Ispravci prijavljenih radova
Ostale bibliografije
Slični projekti
 Bibliografske baze podataka

Pregled bibliografske jedinice broj: 507895

Zbornik radova

Autori: Ljubešić, Nikola; Bago, Petra; Boras, Damir
Naslov: Statistical machine translation of Croatian weather forecast: How much data do we need?
( Statistical machine translation of Croatian weather forecast: How much data do we need? )
Izvornik: Proceedings of the ITI 2010 32nd International Conference on INFORMATION TECHNOLOGY INTERFACES / Luzar-Stiffler, V. (ur.). - Zagreb : University Computing Centre, University of Zagreb , 2010. 91 (ISBN: 978-1-4244-5732-8).
ISSN: 1330-1012
Skup: ITI 2010 32nd International Conference on Information Technology Interfaces
Mjesto i datum: Cavtat / Dubrovnik, Hrvatska, 21.-24.06.2010.
Ključne riječi: statistical machine translation; weather forecast; automatic evaluation; human evaluation
( statistical machine translation; weather forecast; automatic evaluation; human evaluation )
Sažetak:
This research is a first step towards a system for translating Croatian weather forecast into multiple languages. This steps deals with the Croatian-English language pair. The parallel corpus consists of a one-year sample of the weather forecasts for the Adriatic consisting of 7, 893 sentence pairs. Evaluation is performed by best known automatic evaluation measures BLUE, NIST and METEOR, as well as by evaluating manually a sample of 200 translations. In this research we have shown that with a small-sized training set and the state-of-the art Moses system, decoding can be done with 96% accuracy concerning adequacy and fluency. Additional improvement is to be expected by increasing the training set size.
Vrsta sudjelovanja: Predavanje
Vrsta prezentacije u zborniku: Cjeloviti rad (više od 1500 riječi)
Vrsta recenzije: Međunarodna recenzija
Projekt / tema: 130-1301679-1380
Izvorni jezik: eng
Kategorija: Znanstveni
Znanstvena područja:
Informacijske i komunikacijske znanosti
Puni text rada: 507895.ljubesic10-statistical.pdf (tekst priložen 28. Ožu. 2011. u 16:43 sati)
Upisao u CROSBI: nljubesic@ffzg.hr (nljubesic@ffzg.hr), 28. Ožu. 2011. u 16:43 sati



Verzija za printanje   za tiskati


upomoc
foot_4