Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

Croatian Adult Spoken Language Corpus (CROSBI ID 635384)

Prilog sa skupa u zborniku | sažetak izlaganja sa skupa | međunarodna recenzija

Hržica, Gordana ; Kuvač Kraljević, Jelena Croatian Adult Spoken Language Corpus. 2016

Podaci o odgovornosti

Hržica, Gordana ; Kuvač Kraljević, Jelena

engleski

Croatian Adult Spoken Language Corpus

Spontaneous spoken language corpora provide unique language data that reveal basic components of spoken language: frequency and distributions of words and structures, item- based variations and contextual patterns (Pusch, 2006). With the growing research on language impairment (in different ages and in different conditions) the need for specialised corpora grew. However, no adult spoken language corpus was available in Croatian. There are many relevant methodological questions in language sampling of spoken language corpora due to the great variations in speech samples regarding factors such as formality, common ground or knowledge and various socioeconomic factors. Corpus makers tend to control the diversity of their corpora by limiting the number and type of social situations (e.g. Le Corpus de Référence du Français Parlé (CRFP)) or (in cases of large corpora) by carefully annotating metalinguistic factors of speech sampling (e.g. in British National Corpus (BNC) or in Santa Barbara Corpus of Spoken American English (CSAE)). In this paper presents building of Croatian spoken adult language corpus with the respect to those issues. The development of Croatian spoken adult language corpus started in 2014. Today it consists of more than 300 000 tokens in 83 language samples. Samples were collected in spontaneous conversation of adult speakers of Croatian language, with the respect to the dialectal and socio-economic diversity of Croatia. They were prepared according to TalkBank ground rules for contribution in order to become publically assessable in relevant subcopora of TalkBank, the largest collection of spoken language corpora (MacWhinney, 2007). References: Pusch, C. D. (2006). Corpora of Spoken Discourse. In: Keith Brown (ed). Encyclopedia of Language & Linguistics. Oxford: Elsevier. 226-230. MacWhinney, B. (2007). The TalkBank project. In: J.C. Beal, K.P Corrigan, H. L. Moisl (eds) Creating and Digitizing Language Corpora: Synchronic Database, Vol 1. Houndmills: Palgrave-Macmillan.

spoken language coprous; metalinguistic annotation; participant selection; speech segmentation

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o prilogu

2016.

objavljeno

Podaci o matičnoj publikaciji

Podaci o skupu

6th International Conference on Foreign Language Teaching and Applied Linguistics (FLTAL)

predavanje

12.05.2016-14.05.2016

Sarajevo, Bosna i Hercegovina

Povezanost rada

nije evidentirano