Parsing Croatian and Serbian by Using Croatian Dependency Treebanks (CROSBI ID 600123)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Agić, Željko ; Merkler, Danijela ; Berović, Daša
engleski
Parsing Croatian and Serbian by Using Croatian Dependency Treebanks
We investigate statistical dependency parsing of two closely related languages, Croatian and Serbian. As these two morphologically complex languages of relaxed word order are generally under- resourced -- with the topic of dependency parsing still largely unaddressed, especially for Serbian -- we make use of the two available dependency treebanks of Croatian to produce state-of- the-art parsing models for both languages. We observe parsing accuracy on four test sets from two domains. We give insight into overall parser performance for Croatian and Serbian, impact of preprocessing for lemmas and morphosyntactic tags and influence of selected morphosyntactic features on parsing accuracy.
dependency treebank; dependency parsing; Croatian; Serbian
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
22-33.
2013.
objavljeno
Podaci o matičnoj publikaciji
Seattle (WA): Association for Computational Linguistics (ACL)
978-1-937284-97-8
Podaci o skupu
Fourth Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2013)
predavanje
18.10.2013-21.10.2013
Seattle (WA), Sjedinjene Američke Države