Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi

Query-Driven Indexing for Scalable Peer-to-Peer Text Retrieval (CROSBI ID 145730)

Prilog u časopisu | izvorni znanstveni rad | međunarodna recenzija

Skobeltsyn, Gleb ; Luu, Toan ; Podnar Žarko, Ivana ; Rajman, Martin ; Aberer, Karl Query-Driven Indexing for Scalable Peer-to-Peer Text Retrieval // Future generation computer systems, 25 (2009), 1; 89-99. doi: 10.1016/j.future.2008.03.006

Podaci o odgovornosti

Skobeltsyn, Gleb ; Luu, Toan ; Podnar Žarko, Ivana ; Rajman, Martin ; Aberer, Karl

engleski

Query-Driven Indexing for Scalable Peer-to-Peer Text Retrieval

In this paper, we present a query-driven indexing/retrieval strategy for efficient full text retrieval from large document collections distributed within a structured P2P network. Our indexing strategy is based on two important properties: (1) the generated distributed index stores posting lists for carefully chosen indexing term combinations that are frequently present in user queries, and (2) the posting lists containing too many document references are truncated to a bounded number of their top-ranked elements. These two properties guarantee acceptable latency and bandwidth requirements, essentially because the number of indexing term combinations remains scalable and the posting lists transmitted during retrieval never exceed a constant size. A novel index update mechanism efficiently handles adding of new documents to the document collection. Thus, the generated distributed index corresponds to a constantly evolving query-driven indexing structure that efficiently follows current information needs of the users and changes in the document collection. We show that the size of the index and the generated indexing/retrieval traffic remains manageable even for Web-size document collections at the price of a marginal loss in precision for rare queries. Our theoretical analysis and experimental results provide convincing evidence about the feasibility of the query-driven indexing strategy for large scale P2P text retrieval.

P2P; DHT; IR; Text retrieval; P2PIR; Scalability; Query-driven indexing; Distributed index; Index updates

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o izdanju

25 (1)

2009.

89-99

objavljeno

0167-739X

10.1016/j.future.2008.03.006

Povezanost rada

Elektrotehnika, Računarstvo, Informacijske i komunikacijske znanosti

Poveznice
Indeksiranost