presented by: xia li

12

SeqMap: mapping massive amount of oligonucleotides to the genome Hui Jiang et al. Bioinformatics (2008) 24: 2395-2396 The GNUMAP algorithm: unbiased probabilistic mapping of oligonucleotides from next- generation sequencing Nathan Clement et al. Bioinformatics (2010) 26: 38-45 Presented by: Xia Li

Upload: alaina

Post on 20-Feb-2016

42 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

DESCRIPTION

SeqMap : mapping massive amount of oligonucleotides to the genome Hui Jiang et al. Bioinformatics (2008) 24: 2395-2396 The GNUMAP algorithm: unbiased probabilistic mapping of oligonucleotides from next-generation sequencing Nathan Clement et al. Bioinformatics (2010) 26: 38-45 . - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Presented by: Xia Li

SeqMap: mapping massive amount of oligonucleotides to the genome

Hui Jiang et al. Bioinformatics (2008) 24: 2395-2396

The GNUMAP algorithm: unbiased probabilistic mapping of oligonucleotides

from next-generation sequencing Nathan Clement et al. Bioinformatics (2010) 26: 38-45

Presented by: Xia Li

Page 2: Presented by: Xia Li

Short-read mapping softwareSoftware Technique ReferenceGNUMAP Hashing refs + base quality +

repeated regions Clement et al., 2010

Novoalign Hashing refs Novocraft, unpublishedSOAP Hashing refs Li et al., 2008SeqMap Hashing reads Jiang et al., 2008RMAP Hashing reads + read quality Smith et al., 2008Eland Hashing reads Cox, unpublishedBowtie BWT Langmead et al., 2009

Slider lexicographically sorting + base quality Malhis et al., 2009

Page 3: Presented by: Xia Li

SeqMap

• Motivation– Hashing genome usually needs large memory (e.g.

SOAP needs 14GB memory when mapping to the human genome)

– Allow more substitutions and insertion/deletion

Page 4: Presented by: Xia Li

SeqMap

• Pigeonhole principle– Spaced seed alignment– ELAND, SOAP, RMAP

• Hash reads• Insertion/deletion:

2/4 combinations with1/2 shifted one nucleotideto its left or right

Short Read

Short read look up table (indexed by 2 parts)

Split into 4 parts

All combinations of 2/4 parts

Reference GenomeImage credit: J. Ruan

Page 5: Presented by: Xia Li

Experiment & Result

Page 6: Presented by: Xia Li

Experiment & Result

• Deal with more substitutions and insertion/deletion

Randomly generate a DNA sequence of a length of 1Mb, add 100Kb random substitutions, N’s and insertion/deletions

Page 7: Presented by: Xia Li

GNUMAP

• Motivation– Base uncertainty

• Such as nearly equal or low probabilities to A, C, G or T• Filter low quality reads [RMAP] -> discard up to half of the

reads (Harismendy et al., 2009)– Repeated regions in the genome

• Discard them -> loss of up to half of the data (Harismendy et al., 2009)

• Record one -> unequal mapping to some of the repeat regions

• Record all -> each location having 3 times the correct score

Page 8: Presented by: Xia Li

GNUMAP

• Flow-chart

Page 9: Presented by: Xia Li

Probabilistic Needleman-Wunsch

Page 10: Presented by: Xia Li

Alignment Score

ACTGAACCATACGGGTACTGAACCATGAA

AACCAT

GGGTACAACCATTAC

Read from sequencer

GGGTACAACCAT

Read is added to both repeat regions proportionally to their match qualityweighted by its # of occurrences in the genome

Slide credit: N. Clement

Page 11: Presented by: Xia Li

Experiment & Result

Page 12: Presented by: Xia Li

Comments

• SeqMap– Pos: dealing with more

substations/insertion/deletion– Cons: memory consuming, not fast

• GNUMAP– Pos: consider base quality and repeated regions ->

generate more useful information and achieves best performance (~15% increase)

– Cos: memory consuming, slow, more noise

1 High-dimensional Similarity Join Presented by Yang Xia Wongsodihardjo, Hariyanto Wang Hao

ZHENLONG LI LI CURRICULUM VITAE 5 / 14 Last updated on 8/15/2017 10. Liu K., Nebert D., Huang Q., Xia J., Li Z., 2013. Cloud-enabling GEOSS clearinghouse. In Yang C., Huang

Multi-layer Orthogonal Codebook for Image Classification Presented by Xia Li

Li Wang, Yaozong Gao, Feng Shi, Gang Li, Dinggang Shen Presented by Li Wang 09-18-2014

Presented by Li-xia Gao 2010.07.18

Research Article Data mining of cellular automata’s ...geosimulation.cn/Papers/IJGIS2004_CA_MiningRules.pdf · Data mining of cellular automata’s transition rules XIA LI School

Silent Spring Rachel Carson Presented by Li Qiaohui

EECS 110 Projects 1&2 Juan Li – Project 1 Ning Xia – Project 2

ISSN 1007-9327 (print) ISSN 2219-2840 (online) World Journal … · 2017. 9. 30. · Li M, Chen L, Liu LM, Li YL, Li BA, Li B, Mao YL, Xia LF, Wang T, Liu YN, Li Z, Guo TS Case Control

Presented by Qian Xia Wuxi Taibo Experimental School

icRS Cities 2019 - Sustainabilityicrsconf.com/images/icrs/icRS_2019_program.pdfUtsav Shashvatt Singh, Navdeep; Deep, Shehnaz; Bhardwaj, Anjani Xia, Xin Linhao Li; Wengui Li; Guangcheng

Bug Isolation via Remote Program Sampling Ben Liblit, Alex Aiken, Alice X.Zheng, Michael I.Jordan Presented by: Xia Cheng

Liste der Formeln akutell - Dr. Noyer Abotheken TCM · PDF fileF92 Ban Xia Liu Jun Zi Tang Ban Xia Liu Jun Zi ... F188 Cheng Yang Li Lao Tang ¡¢ e ... F218 Cinnamon & Angelica Formula

BA 493 Summer (Xia) Li Adam Burlison BA 493 Summer (Xia) Li Adam Burlison

Tennessee Technological University1 The Scientific Importance of Big Data Xia Li Tennessee Technological University

Study of Actual State of Wireless Technology in Residential Environments Mo Sha Peng Li Jing Xia

· colegiul national "dinicu golescu" c,împul ung muscel nr. 5387 /13.09.2016 clasa xia xia xia xia xia xib xib xib xib -1300 data 12.09.2016 13.09.2016

Semantic Web Presented by Xia Li. 2 Outline Introduction Examples Semantic Web technologies Applications Concerns

Speaker: Li-xia Gao Supervisor: Jufang He Department of Rehabilitation Scienc, Hong Kong Polytechnic University 06/12/2010

Development of a Particle Flow Algorithms (PFA) at Argonne Presented by Lei Xia ANL - HEP

Chapter 4 How to use cloud computing? Kai Liu, Qunying Huang, Jizhe Xia, Zhenlong Li,

Microtubule dynamics: Caps, catastrophes, and coupled hydrolysis Presented by XIA,Fan

Oct 14, 2014 Lirong Xia Recommender systems acknowledgment: Li Zhang, UCSC

Research of Mixed Teaching Design Technology 201… · Research of Mixed Teaching Design Technology . Jun Li, Xu Liang, Xia Li, Xiumei Li . Zibo Normal College, Zibo, Shandong, 255130

XIA: Efficient Support for Evolvable Internetworking · XIA: Efﬁcient Support for Evolvable Internetworking Dongsu Han Ashok Anand† Fahad Dogar Boyan Li Hyeontaek Lim Michel Machado

Zhongxing Ming Dan Li Chumei Xia Mingwei Xu 2014-2-5 1 Tsinghua University

Measuring Sovereign Contagion in Europe Presented by Jingjing XIA Caporin, Pelizzon, Ravazzolo, and Rigobon (2013)

Calcium Oscillation in the Pollen Tube Growth Presented by: XIA,Fan03050130

Web Transfer Latency Study Presented by Ye Xia WebTP Presentation, Aug. 28, 2000 Paper Presented: Paul Barford and Mark Crovella, “Critical Path Analysis

Retrieval Multimedia Data from Disks Presented by Yuni Xia

Presented by H. Li 1 J.L. Chen 1 , J.G. Li 1 , Z.X. Li 2 [email protected]

Xiangtai Li, Xia Li, Ansheng You, Li Zhang, Guangliang

Guo Feng, CIE Yuyang (Adrian) Xia, Intel Tianyi Gao, Baidu ... · Yuyang (Adrian) Xia, Intel Tianyi Gao, Baidu Enso Li (李典林), Tencent Stanley Liu (刘水旺), Alibaba Harmonization

ISSN 1007-9327 (print) ISSN 2219-2840 (online) World Journal of - … · quantification system with Roche CAP/CTM system Li M, Chen L, Liu LM, Li YL, Li BA, Li B, Mao YL, Xia LF,