Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

Algorithms for Layout Phase of De Novo Genome Assembly (CROSBI ID 656856)

Prilog sa skupa u zborniku | prošireni sažetak izlaganja sa skupa | međunarodna recenzija

Vaser, Robert ; Šikić, Mile Algorithms for Layout Phase of De Novo Genome Assembly // Second International Workshop on Data Science. 2017. str. 86-87

Podaci o odgovornosti

Vaser, Robert ; Šikić, Mile

engleski

Algorithms for Layout Phase of De Novo Genome Assembly

DNA sequencing and assembly are one of the crucial parts of biological and medical research. Third generation sequencing technologies facilitated more con- tiguous assemblies thanks to the increase of read fragment lengths. Although the accuracy of such fragments is much lower than that of predecessor technologies, graph-based algorithms are able to assemble small to medium size genomes even without error correction, among which the overlap-layout-consensus paradigm is most notable. 86 Thursday, 11:30-11:55 Multidisciplinary Data Intensive Applications Here we present a standalone layout model intended for assembly of raw reads produced by third generation of sequencing platforms called Rala. It consists of two parts, fragment preprocessing inspired by the assembler HINGE [1] and assembly graph simplifications as described in the assembler Miniasm [2]. In preprocessing, pairwise overlaps between fragments are used to generate coverage graphs which enable distinction between fragments. Fragments which coverage graphs have sharp dips or peaks are chimeric, meaning they consist of two distinct parts of the genome, and are removed from the fragment set. Hills in coverage graphs indicate repetitive genomic regions and show whether a fragment bridges that regions. Overlaps between fragments that do not bridge repeats are removed as well. Afterwards, the assembly graph is build and simplified with transitive reduction, trimming, bubble popping and a heuristic which untangles leftover junctions in the graph. The whole implementation is publicly available at https: //github.com/rvaser/rala under the MIT licence. As a side result, we show that the percentage of chimeric reads produced by either the Pacific Biosciences or Oxford Nanopore Technologies platforms is correlated with the fragment length.

de novo assembly, layout phase

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o prilogu

86-87.

2017.

objavljeno

Podaci o matičnoj publikaciji

Second International Workshop on Data Science

Podaci o skupu

Second International Workshop on Data Science

poster

30.11.2017-30.11.2017

Zagreb, Hrvatska

Povezanost rada

Biologija, Računarstvo