levenshtein distance and sirna

Download Levenshtein distance and  siRNA

If you can't read please download the document

Upload: teva

Post on 15-Feb-2016

62 views

Category:

Documents


6 download

DESCRIPTION

Levenshtein distance and siRNA. An algorithm for determining functional siRNA. Short-interfering RNA Interferes with mRNA Inhibits specific proteins from being produced How proteins are made Transcription DNA RNA Translation mRNA protein Protein! - PowerPoint PPT Presentation

TRANSCRIPT

Slide 1

An algorithm for determining functional siRNALevenshtein distance and siRNA

1What is siRNA?

http://fig.cox.miami.edu/~cmallery/255/255hist/mcb4.1.dogma.jpghttp://www.nature.com/news/2003/030616/full/030616-12.htmlShort-interfering RNAInterferes with mRNAInhibits specific proteins from being produced

How proteins are madeTranscriptionDNA RNATranslationmRNA proteinProtein!

Some proteins we would like to suppressEx: Knocked out caffeine genes in coffee plants.

2

The ProblemWhich strings of siRNA effectively silence genes?Too many to test every single oneTried combinatoricsResults: About 25% of all strings (of 20 nt strands) fit ideal properties of functional siRNABUT this amounts to about

274,877,907,000strings

http://www.tidewaterbusinessservice.com/Smiley%20confused.jpgConfused smiley

3

Levenshtein Distance

1. Vert JP, Foveau N, Lajaunie C, Vandenbrouck Y: An accurate and interpretable model for siRNA efficacy prediction. BMC Bioinformatics. 2006, 7:520.Levenshtein DistanceCalculate distance between strings based on whether character n in string1 is the same as character n in string2.Minimum number of substitutions/insertions required to transform one string to another.

ModificationsUsed weights from Verts paper1 Each substitution no longer increments distance by uniform amount

Depends onPosition of nucleotide substitutionType of substitution

4UCCAUAGUAGAACGUUCGGU1. Position of nucleotide2. Type of nucleotide substitutionAlgorithmC++ implementation

DataData downloaded from siRecords2Used only data for siRNA targeting HEK (human embryonic kidney) mRNAs.Four levels of efficacy4=Very High3=High2=Medium1=Low

Modified algorithm

2. http://sirecords.umn.edu/siRecords/download_data.phpResultsIssues with the algorithmVerts weight data is collected from both murine and human sources

Future WorkIncorporate thermodynamic data from Vert into algorithm for additional accuracyAcknowledgements