Node Selectivity as a Measure for Graph-Based Keyword Extraction in Croatian News (CROSBI ID 617186)
Neobjavljeno sudjelovanje sa skupa | neobjavljeni prilog sa skupa | međunarodna recenzija
Podaci o odgovornosti
Beliga, Slobodan ; Martinčić-Ipšić, Sanda
engleski
Node Selectivity as a Measure for Graph-Based Keyword Extraction in Croatian News
In this paper, we introduce selectivity-based keyword extraction as a new unsupervised method for graph-based keyword extraction. Node selectivity measure is defined as the average weight distribution on the links of a single node and used in procedure of keyword candidate extraction. In particular, we propose extracting three word long keyword sequence and proving that the obtained results compare favourably with previously published results. Experiments were conducted on Croatian news articles dataset with keywords annotated by human experts. The selectivity-based keyword extraction method achieved the average F2 score of 25.32% on isolated documents and F2 score of 42.07% on a document collection. Proposed method is derived solely from statistical and structural information, which are reflected in the topological properties of text network. Furthermore, comparative results indicate that our simple graph-based method provides results that are comparable with more complex supervised and unsupervised methods, as well as with human annotators.
keyword extraction; keyword candidate; keyword ranking; keyword expansion; node selectivity; Croatian news; complex network
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
nije evidentirano
nije evidentirano
Podaci o skupu
6th International Conference on Information Technologies and Information Society (ITIS2014)
predavanje
05.11.2014-07.11.2014
Šmarješke toplice, Slovenija