Identification of persons and business subjects in text documents based on lexical analysis and scoring system (CROSBI ID 549565)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Lončar, Goran ; Bogunović, Nikola
engleski
Identification of persons and business subjects in text documents based on lexical analysis and scoring system
The amount of text documents and textual media news that is created every day on the Internet is growing rapidly, making it very difficult to find useful information effectively. The paper presents a system that identifies persons and business subjects in newly published text documents and matches them with persons and businesses previously stored in a database. The implemented system employs lexical analysis and scoring algorithm tagging the input documents with subjects' id from the database and enabling easy and effective search. Consequently, only the search object and the surrounding context is displayed to the end user. The system is currently successfully used in a web portal.
data mining; information retrieval; lexical analysis
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
35-38.
2009.
objavljeno
Podaci o matičnoj publikaciji
MIPRO 2009, Proceedings Vol. III, CTS & CIS
Bogunović, Nikola ; Ribarić, Slobodan
Rijeka: Hrvatska udruga za informacijsku i komunikacijsku tehnologiju, elektroniku i mikroelektroniku - MIPRO
978-953-233-045-8
Podaci o skupu
MIPRO 2009
predavanje
25.05.2009-29.05.2009
Opatija, Hrvatska