Flexible Length Phrases in Document Classification (CROSBI ID 526808)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Radošević, Danijel ; Dobša, Jasminka ; Mladenić, Dunja
engleski
Flexible Length Phrases in Document Classification
In this paper we investigate possibility of using phrases of flexible length in classification of textual documents as an extension to classic bag of words document representation where documents are represented using single words as index terms. The investigation is conducted on collection of articles from Večernji list. It is shown that usage of flexible length phrases improves precision of automatic document classification and there are indications that such approach could be used for genre classification.
documents classification; bag of words representation; flexible length phrases
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
457-462.
2006.
objavljeno
Podaci o matičnoj publikaciji
Proceedings of 29th International Conference of Information Technology Interfaces, ITI 2007
Dobrić Vesna
Zagreb: Universtity Computing Centre - SRCE
953-7138-05-4
1330-1012
Podaci o skupu
ITI 2006, Cavtat/Dubrovnik, June 19-22, 2006, Sveučilište u Zagrebu i Sveučilišni računalni centar (SRCE), 2006.
predavanje
19.06.2006-22.06.2006
Cavtat, Hrvatska; Dubrovnik, Hrvatska