Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

Distributions of Automatically Segmented Phonemes in Croatian Speech (CROSBI ID 539746)

Prilog sa skupa u zborniku | sažetak izlaganja sa skupa | međunarodna recenzija

Martinčić–Ipšić, Sanda ; Grzybek, Peter ; Mačutek, Jan ; Matešić, Mihaela Distributions of Automatically Segmented Phonemes in Croatian Speech // 6. Znanstveni skup s međunarodnim sudjelovanjem "Istraživanja govora" / G. Varošanec-Škarić ; D. Horga (ur.). Zagreb, 2007. str. 83-85

Podaci o odgovornosti

Martinčić–Ipšić, Sanda ; Grzybek, Peter ; Mačutek, Jan ; Matešić, Mihaela

engleski

Distributions of Automatically Segmented Phonemes in Croatian Speech

In this paper we describe an automatic segmentation procedure for Croatian speech, which is based on a monophone speech recognition system and on word level transcriptions of speech signals. Since the transcription of the speech files is on the word level, the utterances have to be segmented on the phone level for the training procedures. For the word segmentation and recognition task, we have developed a phonetic dictionary, where a set of phonetic symbols is used to transcribe the words from the Croatian speech database ; the selected phonemes are derived according to SAMPA symbols proposed for Croatian [2]. The phonetic dictionary comprises all words, including all flective word formats, which occur in the Croatian speech corpora and their phonetic transcriptions [8]. The Croatian orthographic-to-phonetic rules are used for automatic conversion of graphemes into phonemes. The initial phone level segmentation of speech is performed using automatic alignment of speech signals and word transcriptions, which is based on hidden Markov monophone models (HMM) [3]. The automatic segmentation is performed using the forced alignment of the spoken utterance and the corresponding transcription using the monophone speech recognizer. The forced alignment assumes that all phones in the utterance are initially equally segmented. The monophone models were trained by iterations of Baum-Welch algorithm [4]. The Viterbi algorithm was used to find the most likely sequence of HMM states [4]. The results of the Viterbi algorithm are automatically determined time intervals of spoken phones in the speech signals. The automatically segmented phones are used as input for the speech recognition and speech synthesis training procedures. Since we use HMMs for acoustical modeling of Croatian speech in the speech recognition as well as in the speech synthesis, the same automatic segmentation procedure was performed and the same automatically segmented phones are used for training of the acoustic models of both systems [7]. Automatic segmentation results are presented for 13 hours of 25 professional speakers’ speech. Indirect measures used for the automatic speech segmentation performance are phoneme recognition correctness and word recognition correctness and accuracy. Additionally, the Croatian phoneme duration was calculated from automatically segmented phones. The data of the calculated duration for 674746 phones were used to test a theoretical model for phoneme duration, based on Altmann’ s [9] findings for vowel duration. A first attempt is made to apply this model to standard Croatian data and extend it to consonants.

automatic speech segmenation; distributions

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o prilogu

83-85.

2007.

objavljeno

Podaci o matičnoj publikaciji

6. Znanstveni skup s međunarodnim sudjelovanjem "Istraživanja govora"

G. Varošanec-Škarić ; D. Horga

Zagreb:

978-953-7067-82-3

Podaci o skupu

6. Znanstveni skup s međunarodnim sudjelovanjem "Istraživanja govora"

predavanje

06.12.2007-08.12.2007

Zagreb, Hrvatska

Povezanost rada

Informacijske i komunikacijske znanosti