Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

Croatian language networks (CROSBI ID 610178)

Prilog sa skupa u zborniku | sažetak izlaganja sa skupa

Martinčić-Ipšić, Sanda Croatian language networks // 2014 Adriatic Conference on Graph Theory and Complexity / Vukičević, Damir (ur.). Split: PMF Split, 2014. str. 8-9

Podaci o odgovornosti

Martinčić-Ipšić, Sanda

engleski

Croatian language networks

Written, as well as spoken language can be modeled via complex networks where the lingual units (words) are represented by nodes and their linguistic interactions by links. Such representations enable language analysis through varying linguistic units ; the examination of language evolution ; the modeling of language acquisition ; or assessing the text quality. The language networks construction can be on word- level and on subword-level. The study of networks interactions across language levels can reveal presently unavailable structural properties of the Croatian language at phonological, syllabic, morphological, co-occurrence and syntax level. In our research we are focused upon the word and sub-word co-occurrence networks of Croatian. Initially, we study the structure of Croatian word co-occurrence networks ; the change of network structure properties by systematically varying the co-occurrence window sizes, the corpus sizes and the removal of stopwords. Below the word level we constructed syllable networks. The results indicate that Croatian syllable networks exhibit certain properties of small world networks. Furthermore, we compared Croatian syllable networks with Portuguese and Chinese syllable networks and we have shown that they have similar properties. The applicative goal of this study is to derive an assessment model for the evaluation of the quality of Croatian texts from complex networks parameters, which could be used to develop software able to consistently carry out a desired analysis of a given text, such as assessing the quality of a summary or estimating the quality of a machine translation.

complex networks; language networks; natural language processing

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o prilogu

8-9.

2014.

objavljeno

Podaci o matičnoj publikaciji

2014 Adriatic Conference on Graph Theory and Complexity

Vukičević, Damir

Split: PMF Split

Podaci o skupu

2014 Adriatic Conference on Graph Theory and Complexity

predavanje

25.04.2014-27.04.2014

Split, Hrvatska

Povezanost rada

Računarstvo, Informacijske i komunikacijske znanosti