Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

Language Networks (CROSBI ID 774884)

Druge vrste radova | ostalo

Martinčić-Ipšić, Sanda Language Networks // HDJT - NLP Kruzok, FER. 2014.

Podaci o odgovornosti

Martinčić-Ipšić, Sanda

engleski

Language Networks

Language can be viewed as a complex network if it is presented as system of interacting linguistic’s units. Network analysis provides mechanisms that can reveal new patterns in a complex structure and can thus be applied to the study of the patterns in language structures. This, in turn, may contribute to a better understanding of the organization and the structure and evolution of a language. In our research we are focused upon the word and sub-word co-occurrence networks of Croatian. Initially, we study the structure of Croatian word co-occurrence networks ; the change of network structure properties by systematically varying the co-occurrence window sizes, the corpus sizes and the removal of stopwords. On the word co-occurrence level we compare the properties of linguistic networks for Croatian, English and Italian languages. We constructed co- occurrence networks from parallel text corpora, consisting of the translations of five books in the three languages. The networks’ measures across the three studied languages differ particularly in the average path length and average clustering coefficient. For the text differentiation we study the linguistic networks from different text types: literature, blogs and shuffled texts. The linguistic networks are constructed from texts as directed and weighted co-occurrence networks of words. The comparison of the networks structure is performed at global level in terms of: average node degree, average shortest path length, diameter, clustering coefficient, density and number of components. Furthermore, we perform analysis on the local level by comparing the rank plots of in and out degree, in and out strength and in and out selectivity. The selectivity-based measure points to the differences between the structure of the networks from different text types. Below the word level we constructed syllable networks. The Croatian syllable networks exhibit small world properties. Additionally, we compared networks form syllables and corresponding words. The results indicate there are some structural differences in their properties.

complex networks; language networks

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o izdanju

HDJT - NLP Kruzok, FER

2014.

nije evidentirano

objavljeno

Povezanost rada

Računarstvo, Informacijske i komunikacijske znanosti

Poveznice