team conoscenza bioinformatics tan jian wei ~ tan fengnan

11
Team Conoscenza Bioinformatics Tan Jian Wei ~ Tan Fengnan

Upload: eric-flowers

Post on 03-Jan-2016

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Team Conoscenza Bioinformatics Tan Jian Wei ~ Tan Fengnan

Team ConoscenzaBioinformatics

Tan Jian Wei ~ Tan Fengnan

Page 2: Team Conoscenza Bioinformatics Tan Jian Wei ~ Tan Fengnan

Presentation FlowBackgroundProblemSolutionTechnical DifficultiesMilestonesQuestions

Page 3: Team Conoscenza Bioinformatics Tan Jian Wei ~ Tan Fengnan

Background

Technical Difficulties Milestones Questions

Background Problem Solution

Human Genome Project◦Began 1990 – Ended at 2004◦Mapped out all of the Human

Genome Sequences◦20,000 – 25,000 “working” gene◦3.3 Billion base pairs

Adenine, Thymine, Guanine, CytosineThe base pairs will form proteins or

hormones after a process known as “Transcription”

Page 4: Team Conoscenza Bioinformatics Tan Jian Wei ~ Tan Fengnan

Background Part 2

Technical Difficulties Milestones Questions

Background Problem Solution

Why do we want to do that?◦We don’t actually know what this particular

gene does.• Useful gene denotes a functionality or a trait• By comparing with isolated protein, we can

find out where is exactly these genes are and does.• Example: Insulin

Insulin -> isolated from human Do a base pair match against a genome database

of a plant Find out whether the plant can be candidate to

produce a insulin substitute

Page 5: Team Conoscenza Bioinformatics Tan Jian Wei ~ Tan Fengnan

Background Part 3

Technical Difficulties Milestones Questions

Background Problem Solution

How do we do this?◦United States National Center For

Bioinformatics Information◦BLAST (Basic Local Alignment

Sequencing Tool)

Page 6: Team Conoscenza Bioinformatics Tan Jian Wei ~ Tan Fengnan

Background Part 4

Technical Difficulties Milestones Questions

Background Problem Solution

AACGTTTCCAGTCCAAATAGCTAGGC===--=== =-===-==-====== AACCGTTC TACAATTACCTAGGC

Hits(+1): 18Misses (-2): 5Gaps (existence -2, extension -1):

1 Length: 3Score = 18 * 1 + 5 * (-2) – 2 – 2 =

6

Page 7: Team Conoscenza Bioinformatics Tan Jian Wei ~ Tan Fengnan

ProblemHow do we process the

information here?How do researchers make sense

out of it? E.g. 7000 recordsIs there a better way to use the

information here in a more aggregated context?

Can this information be shared among other users?

Technical Difficulties Milestones Questions

Background Problem Solution

Page 8: Team Conoscenza Bioinformatics Tan Jian Wei ~ Tan Fengnan

SolutionTo present the data in a way that

is relevant to the bio informatics context

Visualizing the data in a very intuitive way

Technical Difficulties Milestones Questions

Background Problem Solution

Page 9: Team Conoscenza Bioinformatics Tan Jian Wei ~ Tan Fengnan

Technical DifficultiesDeploying ex serverUnderstanding what the terms in

the data meanTransforming data into a useful

format for analysis◦Have to first analyze researchers’

difficulty

Technical Difficulties Milestones Questions

Background Problem Solution

Page 10: Team Conoscenza Bioinformatics Tan Jian Wei ~ Tan Fengnan

MilestonesWiki write up and proposalTesting current NCBI systemData cleaningBuilding Panopticon solutionDeploying EX Server

Technical Difficulties Milestones Questions

Background Problem Solution

Page 11: Team Conoscenza Bioinformatics Tan Jian Wei ~ Tan Fengnan

Questions

Technical Difficulties Milestones Questions

Background Problem Solution