Similarity based approach to protein domain architecture prediction (CROSBI ID 493458)
Neobjavljeno sudjelovanje sa skupa | neobjavljeni prilog sa skupa | međunarodna recenzija
Podaci o odgovornosti
Vlahoviček, Kristian ; Kajan, Laszlo ; Pongor, Sandor
engleski
Similarity based approach to protein domain architecture prediction
Increasing amount of primary biological information originating from genome sequencing projects calls for new approaches to large-scale classification and annotation methods. We present a method based on sequence similarity that can be applied to both functional characterization of whole proteins as well as prediction of domain architecture. The method consists of building an exemplar-based database and preprocessing it, by running a database vs. database comparison, to yield threshold values of biologically significant similarities [1-3]. The annotation of domains is then carried out by comparing an unknown query sequence against the database and processing the search output using the predetermined thresholds. The method performance evaluation shows overall prediction success rate of 90% on a set of 140 000 protein domains divided in 2000 domain groups, each containing 3-7000 members, with median specificity and sensitivity per group of 98% and 93%, respectively. The ease of implementation, prediction speed and method robustness make it an interesting candidate for large-scale annotation projects, as it involves minimal manual intervention in both training and prediction. The database of annotated protein domains - SBASE, and the domain architecture prediction system are available via the www interface (figure 1) at http://www.icgeb.org/sbase.
Protein domain; Domain architecture; Domain prediction; database
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
nije evidentirano
nije evidentirano
Podaci o skupu
European Conferrence on Computational Biology
poster
06.10.2002-09.10.2002
Saarbrücken, Njemačka