Automatic Keyphrase Extraction from Croatian Newspaper Articles (CROSBI ID 556322)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Ahel, Renee ; Dalbelo Bašić, Bojana ; Šnajder, Jan
engleski
Automatic Keyphrase Extraction from Croatian Newspaper Articles
Keyphrases provide a way to summarize documents and enable cross-category retrieval. The paper describes a robust system for automatic keyphrase extraction from newspaper articles in Croatian language. Keyphrase candidates are generated based on linguistic and statistical features, and naïve Bayes classifier is used to select the best keyphrases among the candidates. A prediction model is built using training documents with human-assigned keyphrases. System performance is measured on a corpus of newspaper articles, by comparing the automatically extracted keyphrases with those assigned by professional indexers. In absence of comparable results, we consider our results to be of modest performance.
keyphrase extraction; naïve Bayes classifier; Croatian language
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
207-218.
2009.
objavljeno
Podaci o matičnoj publikaciji
The Future of Information Sciences, Digital Resources and Knowledge Sharing
Stančić, Hrvoje ; Selja, Sanja ; Bawden, David ; Lasić-Lazić, Jadranka ; Slavić, Aida
978-953-175-305-0
Podaci o skupu
2nd International Conference The Future of Information Sciences INFuture2009
predavanje
04.11.2009-06.11.2009
Zagreb, Hrvatska