Genre Document Classification Using Flexible Length Phrares (CROSBI ID 524158)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Radošević, Danijel ; Dobša, Jasminka ; Mladenić, Dunja ; Stapić, Zlatko ; Novak, Miroslav
engleski
Genre Document Classification Using Flexible Length Phrares
In this paper we investigate possibility of using phrases of flexible length in genre classification of textual documents as an extension to classic bag of words document representation where documents are represented using single words as features. The investigation is conducted on collection of articles from document data base collected from three different sources representing different genres: newspaper reports, abstracts of scientific articles and legal documents. The investigation includes comparison between classification results obtained by using classic bag of words representation and results obtained by using bag of words extended by flexible length phrases.
flexible length phrases; bag of words representation; genre classification
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
23-28-x.
2006.
nije evidentirano
objavljeno
Podaci o matičnoj publikaciji
Proceedings of 17th International Conference on Information and Intelligent Systems
Aurer, Boris ; Bača, Miroslav
Varaždin: Fakultet organizacije i informatike Sveučilišta u Zagrebu
Podaci o skupu
17th International Conference on Information and Intelligent Systems IIS 2006
predavanje
20.09.2006-22.09.2006
Varaždin, Hrvatska