crta
Hrvatska znanstvena Sekcija img
bibliografija
3 gif
 Naslovna
 O projektu
 FAQ
 Kontakt
4 gif
Pregledavanje radova
Jednostavno pretraživanje
Napredno pretraživanje
Skupni podaci
Upis novih radova
Upute
Ispravci prijavljenih radova
Ostale bibliografije
Slični projekti
 Bibliografske baze podataka

Pregled bibliografske jedinice broj: 697756

Ostalo

Autori: Martinčić-Ipšić, Sanda
Naslov: Language Networks
Izvornik: HDJT - NLP Kruzok, FER
Vrsta: Predavanje
Godina: 2014
Ključne riječi: complex networks; language networks
Sažetak:
Language can be viewed as a complex network if it is presented as system of interacting linguistic’s units. Network analysis provides mechanisms that can reveal new patterns in a complex structure and can thus be applied to the study of the patterns in language structures. This, in turn, may contribute to a better understanding of the organization and the structure and evolution of a language. In our research we are focused upon the word and sub-word co-occurrence networks of Croatian. Initially, we study the structure of Croatian word co-occurrence networks ; the change of network structure properties by systematically varying the co-occurrence window sizes, the corpus sizes and the removal of stopwords. On the word co-occurrence level we compare the properties of linguistic networks for Croatian, English and Italian languages. We constructed co- occurrence networks from parallel text corpora, consisting of the translations of five books in the three languages. The networks’ measures across the three studied languages differ particularly in the average path length and average clustering coefficient. For the text differentiation we study the linguistic networks from different text types: literature, blogs and shuffled texts. The linguistic networks are constructed from texts as directed and weighted co-occurrence networks of words. The comparison of the networks structure is performed at global level in terms of: average node degree, average shortest path length, diameter, clustering coefficient, density and number of components. Furthermore, we perform analysis on the local level by comparing the rank plots of in and out degree, in and out strength and in and out selectivity. The selectivity-based measure points to the differences between the structure of the networks from different text types. Below the word level we constructed syllable networks. The Croatian syllable networks exhibit small world properties. Additionally, we compared networks form syllables and corresponding words. The results indicate there are some structural differences in their properties.
Izvorni jezik: ENG
Znanstvena područja:
Računarstvo,Informacijske i komunikacijske znanosti
URL Internet adrese: http://hnk.ffzg.hr/hdjt/default.html
Upisao u CROSBI: Sanda Martinčić - Ipšić (smarti@inf.uniri.hr), 21. Svi. 2014. u 16:39 sati



Verzija za printanje   za tiskati


upomoc
foot_4