crta
Hrvatska znanstvena Sekcija img
bibliografija
3 gif
 Naslovna
 O projektu
 FAQ
 Kontakt
4 gif
Pregledavanje radova
Jednostavno pretraživanje
Napredno pretraživanje
Skupni podaci
Upis novih radova
Upute
Ispravci prijavljenih radova
Ostale bibliografije
Slični projekti
 Bibliografske baze podataka

Pregled bibliografske jedinice broj: 439599

Zbornik radova

Autori: Bago, Petra; Boras, Damir; Ljubešić, Nikola
Naslov: First Steps Toward Developing a System for Terminology Extraction
( First Steps Toward Developing a System for Terminology Extraction )
Izvornik: INFuture2009: Digital Resources and Knowledge Sharing / Stančić, Hrvoje ; Seljan, Sanja ; Bawden, David ; Lasić-Lazić, Jadranka ; Slavić, Aida (ur.). - Zagreb : Department of Information Sciences, Faculty of Humanities and Social Sciences, University of Zagreb , 2009. 197-206 (ISBN: 978-953-175-355-5).
Skup: 2nd International Conference “The Future of Information Sciences: INFuture2009 – Digital Resources and Knowledge Sharing”
Mjesto i datum: Zagreb, Hrvatska, 4-6.11.2009.
Ključne riječi: terminology extraction; data sample; log-likelihood ratio test
( terminology extraction; data sample; log-likelihood ratio test )
Sažetak:
The aim of this paper is to describe first steps in developing a system for terminology extraction. First a data sample is built from synopses of doctoral theses at the Faculty of Humanities and Social Sciences, University of Zagreb, accepted in the period from 2004 to 2009 written mostly in Croatian language. Data sample consists of 420 documents and 338, 706 tokens. A small sample was manually tagged for terminology to be used in an initial experiment. The approach for terminology extraction is knowledge-driven and consists of differential analysis of reference and domain-specific corpora. Specific method used is log-likelihood ratio test. Experiment deals with different reference corpora and linguistic pre-processing. First results are promising. Further research guidelines are discussed.
Vrsta sudjelovanja: Predavanje
Vrsta prezentacije u zborniku: Cjeloviti rad (više od 1500 riječi)
Vrsta recenzije: Međunarodna recenzija
Projekt / tema: 130-1301679-1380
Izvorni jezik: eng
Kategorija: Znanstveni
Znanstvena područja:
Informacijske i komunikacijske znanosti
Upisao u CROSBI: pbago@ffzg.hr (pbago@ffzg.hr), 9. Sij. 2010. u 20:26 sati



Verzija za printanje   za tiskati


upomoc
foot_4