ol(j.-, 5 t3 q l( fd. contentsodin.ces.edu.co/contenidos_web/41028584.pdf · genetics of...
TRANSCRIPT
~ (ro, Ol(J.-,
5 t3q L(
l Fd.
Contents
Abbreviations
Preface
Supplementary learning aids
Before we start - Intelligent use of the Internet
PART ONE: The Basics
Chapter 1 DNA structure and gene expression
1.11.1.1L1.2
Building blocks and chemical bonds in DNA, RNA and polypeptides
DNA, RNA and polypeptides are large polymers defined by a linear sequence of simple repeating unitsCovalent bonds confer stability; weaker noncovalent bonds facilitate intermolecular associationsand stabilize structure
DNA structure and replicationThe structure ofDNA is an antiparallel double helix
Box 1.1 Examples of the importance of hydrogen bonding in nudeic acids and proteins
DNA replication is semi-conservative and synthesis ofDNA strands is semi-discontinuous
The DNA replication machinery in mammalian cells is complex
Box 1.2 Major dasses of proteins used in the DNA replication machinery
Viral genomes are frequendy maintained by RNA replication rather than DNA replication
RNA transcription and gene expression
The f]ow of gene tic information in cells is almost exclusively one way: DNA-RNA- proteinOnly a small fraction of the DNA in complex organism~ is expressed to give a protein or RNAproduct
During transcription genetic information in some DNA segments (genes) specifies RNA
Cis-acting regulatory elements and trans-acting transcription factors are required in eukaryoticgene expression
Tissue-specific gene expression involves selective activation of specific genes
RNA processing
RNA splicing removes nonessential RNA sequences from the primary transcriptSpecialized nucleotides are added to the 5' and 3' ends of most RNA polymerase 11transcripts
Translation, post-translational processing and protein structure
During translation mRNA is decoded on ribosomes to specifY the synthesis of polypeptidesThe genetic code is degenerate and not quite a universal codePost-translational modifications include chemical modifications of some amino acids and
polypeptide cleavage
Protein secretion and intracellular export is controlled by specific localization signals or bychemical modifications
Protein structure is highly varied and not easily predicted from the amino acid sequence
1.21.2.1
1.2.21.2.3.
1.2.4
1.31.3.11.3.2
1.3.3
1.3.4
1.3.5
1.4L4.11.4.2
1.51.5.11.5.21.5.3
1.5.4
Chapter 2 Chromosome structure and function
1.5.5
2.1
2.22.2.1
Ploidy and the cell cyde
Structure and function of chromosomes
Packaging of DNA into chromosomes requires multiple hierarchies of DNA folding
XXl1l
XXVl1
xxviii
XXIX
1
3
44
6
88
10
1010
12
13
1313
1516
1719
191922
232325
26
2829
33
34
3435
vi I CONTENTS
Chapter 3
2.2.22.2.3
2.2.42.2.52.2.6
2.32.3.12.3.22.3.3
2.42.4.1
Individual chromosomes occupy nonoverlapping territories in an interphase nucleusChromosomes as functioning organelles: the pivotal role of the centromere
Box 2.1 The mitotic spindle and its components
Chromosomes as functioning organelles: origins of replicationChromosomes as functioning organelles: the telomeresHeterochromatin and euchromatin
Mitosis and meiosis are the two types of ceO divisionMitosis is the normal form of cell division
Meiosis is a specialized form of cell division giving rise to sperm and egg cellsX-Y pairing and the pseudoautosomal regions
Studying human chromosomesMitotic chromosomes can be seen in any dividing cell, but meiotic chromosomes are hard to study inhumans
2.4.2
Box 2.2 Chromosome banding
Molecular cytogenetics: chromosome FISH
Box 2.3 Human chromosome nomenclature
2.4.3
2.52.5.12.5.2
Chromosome painting, molecular karyotyping and comparative genome hybridization
Chromosome abnormalities
Types of chromosomal abnormalityNumerical chromosoma] abnorma]ities involve gain or ]oss of complete chromosomes
Box 2.4 Nomenclature of chromosome abnormalities
Structural chromosomal abnormalities result from misrepair of chromosome breaks or from
malfunction of the recombination system
Apparently normal chromosomal complements may be pathogenic if they have the wrongparental origin
2.5.3
2.5.4
Cells and development
3.13.1.13.1.23.1.3
The structure and diversity of ceOs
Prokaryotes and eukaryotes represent the fundamental division of cellular life formsCell size and shape can vary enormously, but rates of ditTusion fix some upper limit~In multicellular organisms, there is a fundamental distinction between somatic cells and the germ line
3536
37
3839
40
40404144
44
44
48
48
49
49
51
5152
53
54
57
59
60606161
Box 3.1 IntraceOular organization of animal cells 62
Box 3.2 The cytoskeleton: the key to ceO movement and cell shape and a major frameworkfor intracellular transport 64
3.1.43.1.5
3.23.2.13.2.2
In multicellular organisms, no two cells carry exactly the same DNA sequenceCells from multicellular organisms can be studied in situ or in culture
CeO interactions
Communication between cells involves the perception of signaling molecules by specific receptors
Activated receptors initiate signal transduction pathways that may involve enzyme cascades or secondmessengers, and result in the activation or inhibition of transcription factorsThe organization of cells to form tissues requires cell adhesionThe extracellular matrix provides a scatTold for all tissues in the body and is also an important
source of signals that control cell behavior
An overview of development
The specialization of ceOs during developmentCell specialization involves an irreversible series of hierarchica] decisions
Box 3.3 Animal models of development
The choice between alternative fates may depend on lineage or position
Box 3.4 Twinning in human enibryos
3.2.33.2.4
3.3
3.43.4.1.
3.4.2
64
65
6666
6770
70
71
7272
73
73
74
CONTENTS I Vil
3.4.3 Stem cells are se!f-renewing progenitor cells
Box 3.5 Where our tissues come from - the developmental hierarchy in mammals
Box 3.6 The diversity of human cells
3.4.4.3.4.5.3.4.6
3.53.5.13.5.23.5.33.63.6.1
A variety of tissue stem cells are known to exist but much remains to be learned about them
Embryonic stem (ES) cells have the potential to form any tissueThe differentiation potential of tissue stem cells is controversial
Pattern formation in development
Emergence of the body plan is dependent on axis specification and polarizationHomeotic mutations reveal the molecular basis of positional identityPattern formation often depends on signal gradientsMorphogenesis
Morphogenesis can be driven by changes in cell shape and size
Box 3.7 Polarizing the mammalian embryo - signals and gene products
Major morphogenetic changes in the embryo result from differential cell affinityCell proliferation and programmed cell death (apoptosis) are important morphogenetic mechanisms
Early human development: fertilization to gastrulation
Fertilizatíon activates the egg and brings together the nuclei of sperm and egg to form a uniqueindividual
Cleavage partitions the zygote into many smaller cells
Only a small percentage of the cells in the early mammalian embryo gives rise to the mature organismlmplantation
Gastrulation is a dynamic process whereby cells of the epiblast give rise to the three germ layers
Box 3.8 Extra-embryonic membranes and the placenta
Box 3.9 Sex determination: genes and the environment in development
Neural development
The nervous system deve!ops after the ectoderm is induced to differentiate by the underlyingmesoderm
Pattern formation in the neural tube involves the coordinated expression of genes along two axesNeuronal differentiation involves the combinatorial activity of transcription factors
Conservation of developmental pathways
Many human diseases are caused by the failure of normal developmental processesDevelopmental processes are highly conserved at both the single gene leve! and the level ofcomplete pathways
3.6.23.6.3
3.73.7.1
3.7.23.7.33.7.43.7.5
3.83.8.1
3.8.23.8.3
3.93.9.13.9.2
Chapter 4 Genes in pedigrees and populations
4.1
4.24.2.14.2.2
Monogenic versus multifactorial inheritance
Mendelian pedigree patterns
Dominance and recessiveness are properties of characters, not genesThere are five basic Mendelian pedigree patterns
Box 4.1 Characteristics of the Mendelian patterns ofinheritance
The mode of inheritance can rarely be defined unambiguously in a single pedigreeOne gene-one enzyme does not imply one gene-one syndromeMitochondrial inheritance gives a recognizable matrilineal pedigree pattern
Box 4.2 The complementation test to discover whether two recessive characters aredetermined by allelic genes
Complications to the basic Mendelian pedigree patternsCommon recessive conditions can give a pseudo-dominant pedigree patternFailure of a dominant condition to manifest is called nonpenetranceMany conditions show variable expression
For imprinted genes, expression depends on parental originMale lethality may complicate X-linked pedigreesNew mutations often complica te pedigree interpretation, and can lead to mosaicism
4.2.34.2.44.2.5
4.34.3.14.3.24.3.34.3.44.3.54.3.6
74
75
76
777879
798080808181
82
8285
86
8687888888
89
93
94
949495
9797
98
101
102
102102102
104
104105106
106
106106106107109109109
viii I CONTENTS
4.44.4.14.4.2
Genetics of multifactorial characters: the polygenic-threshold theorySome historyPolygenic theory of quantitative traits
Box 4.3 Two common misconceptions about regression to the mean
Box 4.4 Partitioning of variance
Polygenic theory of discontinuous characters
Counseling in non-Mendelian conditions uses empiric risks
Factors affecting gene frequencies
There can be a simple relation between gene frequencies and genotype frequenciesGenotype frequencies can be used (with caution) to calculate mutation rates
111111
112
114
115
115116
117
117
117
Box 4.5 Hardy-Weinberg equilibrium genotype frequencies for aBele frequenciesp(Al) and q (A2) 117
Box 4.6 The Hardy- Weinberg distribution can be used (with caution) to calculate carrier
frequencies and simple risks for counseling 118
Box 4.7 Mutation-selection equilibrium 118
4.5.3 Heterozygote advamage can be much more important than recurrent mutation for determiningthe frequency of a recessive disease.
4.4.34.4.4
4.5
4.5.14.5.2.
Box 4.8 Selection in favor of heterozygotes for CF
Chapter 5 Amplifying DNA: PCR and cell-based DNA cloning
5.1
5.25.2.1
The importance of DNA cloning
PCR: basic features and applicationsPrincipies of baÚc PCR and reverse transcriptase (RT) PCR
Box 5.1 A glossary ofPCR methods
PCR has two major limitations: short sizes and low yields of productsGeneral applications of PCR
Some PCR reactions are designed to permit multiple amplification products and to amplii)' previouslyuncharacterized sequences
Principies of cell-based DNA cloning
An overview of cell-based DNA doning
Box 5.2 Restriction endonucleases and modification-restriction systems
118
119
121
122
123123
124
125127
128
129129
129
5.3.2 Restriction endonudeases enable the target DNA to be cut into manageable pieces which canbe joined to similarly cut vector molecules 130
5.3.3 lntroducing recombinant DNA into recipient cells provides a method for fractionating a complexstarting DNA population 133
5.3.4 DNA libraries are a comprehensive set of DNA dones representing a complex starting DNApopulation 133
5.3.5 Recombinant screening is often achieved by insertional inactivation of a marker gene 135
5.2.25.2.35.2.4
5.35.3.1
5.45.4.1
Box 5.3 Nonsense suppressor mutations
Box 5.4 The importance of sequence tagged sites (STSs)
Cloning systems for amplif)ring different sized fragmentsStandard plasmid vectors provide a simple way of doning small DNA fragments in bacterial (andsimple eukaryotic) cells
Lambda and cosmid vectors provide an efficient means of doning moderately large DNAfragments in bacteria] cells
Large DNA fragments can be doned in bacteria] cells using vectors based on bacteriophage Pland F factor plasmids
Yeast artificial chromosomes (YACs) enable doning of megabase fragments
Cloning systems for producing single-stranded and mutagenized DNA
5.4.2
5.4.3
5.4.4
5.5
138
138
138
139
140
142143
144
CONTENTS IIX
5.5.1 Single-stranded DNA for use in DNA sequencing is obtained using M13 or phagemid vectors orby linear PCR amplification
Oligonucleotide mismatch muragenesis can create a predetermined single nucleotide change inany cloned gene
PCR-based mutagenesis includes coupling of desired sequences or chemical groups to a targetsequence and site-specific mutagenesis
Cloning systems designed to express genes
Large amounes of protein can be produced by expression cloning in bacterial cel!sPhage display is a form of expression cloning in which proteins are expressed on bacterial cel! surfacesEukaryotic gene expression is carried out with greater fidelity in eukaryotic cel! lines
Box 5.5 Transferring genes into cultured animal cells
5.5.2
5.5.3
5.6
5.6.15.6.25.6.3
Chapter 6 Nucleic acid hybridization: principies and applications
6.16.1.16.1.2
Preparation of nucleic acid probes
Nucleic acids can conveniently be labeled in vitro by incorporation of modified nucleotidesNucleic acids can be labeled by isotopic and nonisotopic methods
Box 6.1 Principies of autoradiography
6.2
6.2.1Principies of nucleic acid hybridization 161
Nucleic acid hybridization is a method for identifYing closely related moleculcs within two nucleic acidpopulations 161
Box 6.2 Fluorescence labeling and detection systems 164
The kinetics of DNA reassociation are defined by the product of DNA concentration and time (Co,) 164
Box 6.3 A glossary of nucleic acid hybridization 166
A wide variety of nucleic acid hybridization assays can be used 167
Nucleic acid hybridization assays using cloned DNA probes to screen uncloned nucleicacid populations 168
Dot-blot hybridization, a rapid screening method, often employs al!ele-specific oligonucleotide probes 168
6.2.2
6.2.3
6.3
6.3.1
6.3.2
Box 6.4 Standard and reverse nucleic acid hybridization assays
Southern and Northern blot hybridizations detect nucleic acids that have been size-fractionated by gelelectrophoresis
Pulsed field gel electrophoresis extends Southern hybridization to include detection of very largeDNA molecules
In in situ hybridization probes are hybridized to denatured DNA of a chromosome preparation orRNS of a tissue section fixed on a glass slide
Hybridization assays using cloned target DNA and microarrays
Colony blot and plaque lift hybridization are methods for screening separated bacterial colo niesor plaques
Gridded high density arrays of transformed cel! clones or DNA clones has greatly increased theefficiency of DNA library screening
DNA microarray technology has enormously extended the power of nucleic acid hybridization
6.3.3
6.3.4
6.46.4.1
6.4.2
6.4.3
Chapter 7 Analyzing DNA and gene structure. variation and expression
7.17.1.1
Sequencing and genotyping DNA
Standard DNA sequencing involves enzymatic DNA synehesis using base-specific dideoxynucleotidechain terminators
7.1.27.1.3
Box 7.1 Producing single-stranded DNA sequencing templates
Automated DNA sequencing and microarray-based re-sequencing
Basic genotyping of restriction site polymorphisms and variable number of tandem repeatpolymorphisms
Identifying gene s in cloned DNA and establishing their structure7.2
144
146
146
147147
150150
152
155
156156157
159
169
169
171
172
174
174
175175
181
182
182
182
183
183
186
x I CONTENTS
PART TWO:
Box 7.2 Common dasses of DNA polymorphism which are amenable to simple genotypingmethods 187
7.2.17.2.27.2.37.2.4
7.37.3.1
Exon trapping identifies expressed sequences by using an artificial RNA splicing assay
cDNA selection identifies expressed sequences in genomic dones by heteroduplex formationAchieving full-Iength cDNA sequences: overlapping done sets, and RACE-PCR amplificationMapping transcription start sites and defining exon-intron boundaries
Studying gene expressionPrincipies of expression screening
Box 7.3 Database homology searching
Hybridization-based gene expression analyses: from single gene analyses to whole genome expressionscreemng
PCR-based gene expression analyses: RT -PCR and mRNA differential displayProtein expression screens typically use highly specific antibodies
Box 7.4 Obtaining antibodies
Autofluorescent protein tags provided a powerful way of tracking subcellular localization of proteins
7.3.2
7.3.37.3.4
7.3.5
Chapter 8 Genome projects and model organisms
The human genome and its relationship to other genomes
8.18.1.18.1.2
The ground-breaking importance of geno me projectsGenome projects prepared the way for systematic studies of the Universe within
The medical and scientific benefits of the genome projects are expected to be enormous
Box 8.1 A genomics glossary
Background and organization of the Human Genome ProjectDNA polymorphisms and new DNA doning technologies paved the way for sequencing our genomeThe Human Genome Project was mainly conducted in large genome centers with high-throughputsequencing capacity
How the human genome was mapped and sequencedThe first useful human genetic maps were based on microsatellite markers
Box 8.2 Human gene and DNA segment nomendature
Box 8.3 Major milestones in mapping and sequencing the human genome
8.28.2.18.2.2
8.38.3.1
8.3.2 The first high resolution physical maps of the human genome were based on done contigs and STSlandmarks 213
Box 8.4 Hybrid cell mapping 215
8.3.3 The final stage of the Human Genome Project was crucially dependent on BAC/PAC done contigs 217
Box 8.5 Physical mapping by building done contigs 218
8.3.4 The first high density human gene maps were based on EST markers 220
8.3.5
Box 8.6 Co-operation, competition and controversy in the genome projects
The draft human genome sequence suggested 30000-35000 human genes, but getting a precisetotal is difficult
The final stages of the Human Genome Project: gene annotation and gene ontologyAnalyses of human genome sequence variation are important for anthropological andmedical research
Without proper safeguards, the Human Genome Project could lead to discrimination againstcarriers of disease genes and to a resurgence of eugenics
Genome projects for model organismsThere is a huge diversity of prokaryotic genome projects
The S.cerevisiaegenome project was the first of many successfulprotist genome projects
Box 8.7 Model unicellular organisms
The Caenorhabditiselegansgenome project was the first animal genome project to be completed
8.3.68.3.7
8.3.8
8.48.4.18.4.2
8.4.3
187188188189
190190
192
193197198
200
202
205
207
208208208
209
210210
210
212212
212
213
220
221224
225
225
226226226
227
228
CONTENTS I xi
8.4.4 Metazoan genome projects are mostly focusing on models of development and disease 229
Box 8.8 Model multicellular animals for understanding development, disease and genefunction 230
Chapter 9 Organization of the human genome
9.19.1.1
9.1.2
General organization of the human genomeAn overview of the human genome
The mitochondrial geno me consists of a small circular DNA duplex which is densely packedwith genetic information
Box 9.1 Genome copy number variation in human cells
Box 9.2 The limited autonomy of the mitochondrial genome
The nuclear genome consists of 24 different DNA molecules corresponding to the 24 different humanchromosomes
The human genome contains about 30000-35 000 unevenly distributed genes but precise numbersare uncertain
9.1.3
9.1.4
9.2
9.2.1
Box 9.3 DNA methylation and CpG islands
Organization, distribution and function of human RNA genesA total of about 1200 human genes encode rRNA or tRNA and are mostly organized intolarge gene clusters
Small nuclear RNA and small nucleolar RNA are encoded by mostly dispersed, moderately large genefamilies
9.2.2
9.2.3
Box 9.4 Anticodon specificity of eukaryotic cytoplasmic tRNAs
MicroRNAs and other novel regulatory RNAs are challenging preconceptions on the extent of RNAfunction
9.39.3.19.3.2
Organization, distribution and function of human polypeptide-encoding genesHuman genes show enormous variation in size and internal organizationFunctionally similar genes are occasionally clustered in the human genome, but are more oftendispersed over different chromosomes
Box 9.5 Human genome and human gene statistics
Overlapping genes, genes-within-genes and polycistronic transcription units are occasionally foundin the human genome
Polypeptide-encoding gene families can be classified according to the degree and extent of sequencerelatedness in family members
Genes in human gene families may be organized into small clusters, widely dispersed or both
Pseudogenes, truncated gene copies and gene fragments are commonly found in multigene familiesHuman proteome classification has begun but the precise functions of many human proteins remainuncertain
9.3.3
9.3.4
9.3.59.3.69.3.7
9.49.4.1
Tandemly repeated noncoding DNA
Satellite DNA consists of very long arrays of tandem repeats and can be separated from bulk DNAby density gradient centrifugationMinisatellite DNA is composed of moderately sized arrays of tandem repeats and is often locatedat or close to telomeres
Microsatellite DNA consists of short arrays of simple tandem repeats and is dispersed throughoutthe human genome
Interspersed repetitive noncoding DNA
Transposon-derived repeats make up > 40% of the human genome and mostly arose through RNAintermediates
Some human UNE 1 elements are actively transposing and enable transposition of SINES, processedpseudogenes and retrogenes
Alu repeats occur more than once every 3 kb in the human genome and may be subject topositive selection
9.4.2
9.4.3
9.59.5.1
9.5.2
9.5.3
239
240240
241
242
243
244
245
246
247
247
249
249
250
253253
254
255
256
257259262
265
265
265
267
268
268
268
270
270
Xli I CONTENTS
Chapter 10 Human gene expression 275
10.1 An overview of gene expression in human ceUs 276
Box 10.1 Spatial and temporal restriction of gene expression in mammalian cells 276
10.2 Control of gene expression by binding of trans-acting protein factors to cis-acting regulatorysequences in DNA and RNA 277
10.2.1 Hisrone modificarion and chromatin remodeling facilirare access ro chromarin by DNA-binding facrors 27810.2.2 Ubiquirous rranscriprion facrors are required for rranscriprion by RNA polymcrases l and 1Il 279
10.2.3 Transcriprion by RNA polymerase II requires complex sers of cis-acring regularory sequences andrissue-specific rranscriprion facrors 280
10.2.4. Transcriprion facrors conrain conserved srrucrural motifs rhar permir DNA binding 282
Box 10.2 Classes of cis-acting sequence elements involved in regulating transcription ofpolypeptide-encoding genes 283
10.2.5
10.2.6
10.310.3.110.3.210.3.3
lOA
10.4.1
10.4.2
10.4.3
10.510.5.110.5.210.5.310.5.4
10.5.5
10.5.6
10.610.6.1
10.6.2
10.6.3
A variery of mechanisms permir rranscriptional regulation of gene expression in response roexrernal srimuli
Translarional control of gene expression can involve recognirion of UTR regularory sequences byRNA-binding proreins
AIternative transcription and processing of individual genesThe use of alternarive promorers can generare rissue-specific isoforms
Human genes are prone ro alrernarive splicing and alrernative polyadenylationRNA ediring is a rare form of processing whereby base-specific changes are inrroduced into RNA
Box 10.3 AIternative splicing can alter the functional properties of a protein
285
288
291291292293
293
Differential gene expression: origins through asymmetry and perpetuation throughepigenetic mechanisms such as DNA methylation 294
Selecrive gene expression in cells of mammalian embryos mosr likely develops in response ro shorrrange cell-cell signaling evenrs 295
DNA merhylarion is an imporrant epigeneric facror in perperuaring gene repression in verrebrare cells 295Animal DNA merhylation may provide defense against transposons as well as regulating geneexpression 297
Long range control of gene expression and imprinting 298Chromarin structure may exert long-range control over gene expression 298
Expression of individual genes in gene clusters may be co-ordinared by a common locus conrrol region 299Some human genes show selective expression of only one of the rwo parental alleles 300Genomic imprinting involves ditTerences in rhe expression of alleles according to parent of origin 301
Box lOA Mechanisms resulting in monoallelic expression from biaUelic genes in human cells 302
Box 10.5 The nonequivalence of the maternal and paternal genomes 302
The mechanism of genomic imprinring is unclear but a key component appears ro be DNAmethylation 303
X chromosome inactivation in mammals involves very long range cis-acting repression of geneexpression 305
The unique organization and expression of Ig and TCR genes 306DNA rearrangements in B and T cells generate cell-specific exons encoding Ig and TCR variablere~ns ~8
Heavy chain class switching involves joining of a single VD) exon ro alternative constant regiontranscription unirs 309
The monospecificiry of Igs and TCRs is due to allelic and light chain exclusion 310
Chapter 11 Instability of the human genome: mutation and DNA repair 315
11.1 An overview of mutation, polymorphism, and DNA repair
11.2 Simple mutations
11.2.1 Mutations due to errors in DNA replication and repair are frequenr
Box 11.1 Classes of genetic polymorphisms and sequence variation
316
316316
317
11.311.3.111.3.2
11.3.3
11.411.4.111.4.211.4.3
11.4.4
11.511.5.1
11.5.2
11.5.3
11.5.411.5.511.5.6
11.611.6.1
11.6.2
11.6.3
CONTENTS I
The frequency of individual base substitutions is nonrandom according to substitution class
The frequency and spectrum of mutations in coding DNA differs from that in noncoding DNA
Box 11.2 Mechanisms that affect the population frequency of aUeles
The location ofbase substitutions in coding DNA is nonrandom
Box 11.3 Classes of single base substitution in polypeptide-encoding DNA
11.2.5 Substitution rates vary considerably between different genes and between different gene components11.2.6 The substitution rate can vary in differcnt chromosomal regions and in different lineages
11.2.211.2.3
11.2.4
Box 11.4 Sex differences in mutation rate and the question of male-driven evolution
Genetic mechanisms which result in sequence exchanges between repeatsReplication slippage can cause VNTR polymorphism at short tandem repeats (microsatellites)
Large units of tandemly repcated DNA are prone to insertion/ deletion as a result of unequalcrossover or unequal sister chromatid exchangesGene conversion events may be rclatively freql1cnt in tandemly repetitive DNA
Pathogenic mutationsThere is a high deleterious mutation rate in hominids
The mitochondrial genome is a hotspot for pathogenic mutationsMost splicing mutations alter a conserved sequence nceded for normal splicing, but some occurin sequences not normally required for splicingMutations that introduce a premature termination codon often result in unstable mRNA butother outcomes are possible
The pathogenic potential of repeated sequences
Slipped strand mispairing of short tandem repeats predisposes to pathogenic deletions andframeshifting insertions
Unstable expansion of short tandem repeats can cause a variery of diseases but the mutationalmechanism is not well understood
Tandemly repeated and clustered gene families may be prone to pathogenic unequal crossover andgene conversion-like events
lnterspersed repeats often predispose to large deletions and duplications
Pathogenic inversions can be produced by intrachromatid recombination between inverted repeatsDNA seql1ence transposition is not uncommon and can cause disease
DNA repair
DNA repair usually involves cutting out and resynthesizing a whole area ofDNA surrounding thedamage
DNA repair systems share components and processes with the transcription and recombinationmachinery
Hypersensitivity to agents that damage DNA is often the result of an impaired cellular responseto DNA damage, rather than defective DNA repair
Chapter 12 Our place in the tree of life
12,112.1.1
12.1.2
12.1.312.1.412.1.5
Evolution ofgene structure and duplicated genes
Spliceosomal introns probably originated from group 11introns and first appeared in early eukaryoticcells
Complex genes can evolve by intragenic duplication, often as a result of exon duplication
Box 12.1 Intron groups
Exon shuffiing can bring together new combinations of protein domains
Gene duplication has played a crucially important role in the evolution of multicellular organismsThe globin superfamily has evolved by a process of gene duplications, gene conversions, and genelosslinactivation
Box 12.2 Symmetrical exons and intron phases
Box 12.3 Gene duplication mechanisms and paralogy
12.1.6 Retrotransposition can permit exon shuffiing and is an important contributor to gene evolution
X1l1
318318
319
320
321
322323
326
329329
329329
331332333
334
336
337
337
337
339340342343
344
345
345
347
351
352
352352
353
353354
354
355
357
360
XIV I CONTENTS
12.212.2.1
12.2.212.2.312.2.4
12.2.5
12.2.6
12.2.7
12.2.8
12.312.3.112.3.2
12.3.312.3.4
12.412.4.112.4.2
Evolution of chromosomes and genomesThe mitochondrial genome may have originated following endocytosis of a prokaryotic cell by a
eukaryotic cell precursor
Box 12.4 The universal tree of life and horizontal gene transfer
Reduced selection pressure caused the mitochondrial genetic code to divergeThe evolution of vertebrate genomes may have involved whole genome duplicationThere have be en numerous major chromosome rearrangements during the evolution of mammalian
genomesSegmental duplication in primate lineages and the evolutionary instability of pericentromeric andsubtelomeric sequencesThe human X and Y chromosomes exhibit substantial regions of sequence homology, including
common pseudoautosomal regionsHuman sex chromosomes evolved from autosomes and diverged due to periodic regional
suppression of recombinationSex chromosome differentiation results in progressive Y chromosome degeneration and Xchromosome inactivation
Molecular phylogenetics and comparative genomicsMolecular phylogenetics uses sequence alignments to construct evolutionary trees
New computer programs align large scale and whole genome sequences, aiding evolutionary analysesand identification of conserved sequencesGene number is generally proportional to biological complexityThe extent of progressive protein specialization is being revealed by proteome comparisons
What makes us human?What makes us different from mice?
What makes us different from our nearcst relatives, the great apes?
Box 12.5 A glossary of common metazoan phylogenetic groups and terms
12.5 Evolution of human populations12.5.1 Genetic evidence has suggested a recent origin of modern humans from African populations12.5.2 Human genetic diversity is low and is mostly due to variation within populations rather than
between them
Box 12.6 Coalescence analyses
PART THREE: Mapping and identifying disease genes and mutations
Chapter 13 Genetic mapping of Mendelian characters
13.113.1.113.1.213.1.313.1.413.1.5
13.213.2.113.2.2
Recombinants and nonrecombinants
The recombination fraction is a measure of genetic distanceRecombination fractions do not exceed 0.5 however great the physical distance
Mapping functions define the relationship between recombination fraction and genetic distanceChiasma counts and total map lengthPhysical vs. genetic maps: the distribution of recombinants
Genetic markers
Mapping human disease genes requires genetic markersThe heterozygosity or polymorphism information content measure how informative a marker is
Box 13.1 The development of human genetic markers
13.2.3. DNA polymorphisms are the basis of all current genetic markers
Box 13.2 Informative and uninformative meioses
13.313.3.1
13.3.2
Two-point mappingScoring recombinants in human pedigrees is not always simple
Computerized lod score analysis is the best way to analyze complex pedigrees for linkagebetween Mendelian characters
Box 13.3 Calculation of lod scores for the families in Figure 13.6
361
361
362
363363
364
366
367
368
371
372372
374375376
377378381
383
385385
387
389
395
397
398398398399399400
402402402
403
403
404
404404
405
406
13.3.313.3.4
13.413.4.1
13.4.2
13.4.3
13.513.5.113.5.2
13.613.6.113.6.213.6.313.6.413.6.5
CONTENTS I xv
Lod scores of +3 and -2 are the criteria for linkage and exdusion (for a single test)For whole genome searches a genome-wide threshold of significance must be used
Multipoint mapping is more efficient than two-point mappingMultipoint linkage can loca te a disease locus on a framework of markersMarker framework maps: the CEPH families
Box 13.4 Bayesian calculation of linkage threshold
Multipoint disease-marker mapping
Fine-mapping using extended pedigrees and ancestral haplotypesAutozygosity mapping can map recessive conditions efficiently in extended inbred families
Identifying shared ancestral segments allowed high-resolution mapping of the loci for cysticfibrosis and Nijmegen breakage syndrome
Standard lod score analysis is not without problemsErrors in genotyping and misdiagnoses can generate spurious recombinantsComputational difficulties limit the pedigrees that can be analyzedLocus heterogeneity is always a pitfall in human gene mappingMeiotic mapping has limited resolution
Characters whose inheritance is not Mendelian are not amenable to mapping by the methodsdescribed in this chapter
Chapter 14 IdentifYing human disease genes
14.1
14.214.2.114.2.214.2.3
14.314.3.114.3.214.3.314.3.4
14.3.5
14.414.4.1
PrincipIes and strategies in identifYing disease genes
Position-independent strategies for identifYing disease genes
Identifying a disease gene through knowing the protein productIdentifying the disease gene through an animal model
Identification of a disease gene using position-independent DNA sequence knowledge
Positional doning
The first step is to define the candidate region as tightly as possibleA contig of dones must be established across the candidate regionA transcript map defines all genes within the candidate regionGenes from the candidate region must be prioritized for mutation testing
406406
407407407
407
408
408408
409
411411
412413413
413
415
416
416416418418
418419419420421
Box 14.1 Transcript mapping: laboratory methods that supplement database analysis foridentifYing expressed sequences within genomic dones 421
The special relevance of mouse mutants
Use of chromosomal abnormalities
Patients with a balanced chromosomal abnormality and an unexplained phenotype are interesting
Box 14.2 Mapping mouse genes
422
423423
423
14.4.2 Patients with two Mendelian conditions, or a Mendelian condition plus mental retardation, may have achromosomal deletion 425
14.5
14.5.1
14.5.2
14.6
14.6.114.6.214.6.314.6.414.6.5
Box 14.3 Pointers to the presence of chromosome abnormalities
Box 14.4 Position effects - a pitfall in disease gene identification
Confirming a candidate geneMutation screening to confirm a candidate gene
Box 14.5 CGH for detecting submicroscopic chromosomal imbalances
Once a candidate gene is confirmed, the next step is to understand its function
Eight examples iIIustrate various ways disease genes have been identified
Direct identification of a gene through a chromosome abnormality: Sotos syndromePure transcript mapping:Treacher Collins syndrome
Large-scale sequencing and search for homologs: branchio-oto-renal syndromePositional candidates defined by function: rhodopsin and fibrillin
A positional candidate identified through comparison of the human and mouse maps: PAX3and Waardenburg syndrome
426
427
428428
428
429
429429430430
431
431
xvi I CONTENTS
14.6.614.6.714.6.S
Inference from function i/1 vilr,,: Fanconi anemia
Inference from function i/1vivo: myosin 15 and DFNB3 deafness
Inference from the exprcssion pattern: otoferlin
15.1
Chapter 15 Mapping and identifying genes conferring susceptibility to complex diseases
Deciding whether a non-Mendelian character is genetic: the role of family, twin andadoption studiesThe A value is a measure of familial clusteringThe importance of shared family environmentTwin studies sutTer from many limitations
Adoption studies: the gold standard for disentangling genetic and environmental factors
Segregation analysis allows analysis of characters that are anywhere on the spectrumbetween purely Mendelian and purely polygenicBias of ascertainment is often a problem with family data: the example of autosomal recessiveconditions
Complex segregation analysis is a general method for estimating the most likely mix of geneticfactors in pooled family data
15.3 Linkage analysis of complex characters15.3.1 Standard lod score analysis is usually inappropriate for non-Mcndelian characters
Box 15.1 Correcting the segregation ratio
Non-paramctric linkage analysis does not require a genetic model
Shared segment analysis in families: atTected sib pair and atTected pcdigree member analysisThresholds of significance are an important consideration in analysis of complex diseases
Association studies and linkage disequilibriumWhy associations happen
Association is in principIe quite distinct from linkage, but whcre the family and the populationmerge, linkage and association merge
Box 15.2 Measures of linkage disequilibrium
Many studies show islands of linkage disequilibrium separated by recombination hotspotsDcsign of association studiesBox 15.3 The transmission disequilibrium test (TDT) to determine whether marker allele
M1 is associated with a disease
15.4.5 Linkage and association: complcmentary techniques
15.1.115.1.215.1.315.1.4
15.2
15.2.1
15.2.2
15.3.215.3.315.3.4
15.4
15.4.115.4.2
15.4.315.4.4
15.5 Identifying the susceptibility alleles
15.615.6.1
15.6.215.6.3
15.6.4
15.6.5
15.6.615.6.715.6.H
15.7
15.7.115.7.2
431431431
435
436436436436437
437
43H
43H
439439
439
440441442
442442
443
443
444445
446
447
447
Box 15.4 Sample sizes needed to find a disease susceptibility locus by a whole genomescan using either atTected sib pairs (ASP) or the transmission disequilibrium test (TDT) 447
Eight examples iIIustrate the varying success of genetic dissection of complex di seas esBreast cancer: identifying a Mendelian subset has led to important medical advances, but do es notcxplain the causes of the common sporadic diseaseHirschsprung disease: an oligogenic diseaseAlzheimer diseasc: genetic factors are important both in the common late-onset form and in the rare
Mendelian early-onset forms, but they are ditTerent genes, acting in ditTerent waysType 1 diabetes mellitus: still the geneticist's nightmare?
Ethics Box 1 Alzheimer disease, ApoE testing and discrimination
Type 2 diabetes: two susceptibility factors, one so common as to be undetectable by linkage; theother very complex and in certain populations onlyIntlammatory bowcl disease: a clear-cut susceptibility gene identifiedSchizophrenia: the special problems of psychiatric or behavioral disordersObesity: genetíc analysis of a quantitative trait
Overview' andsummaryWhy is it so difficult?
If it all works out and we idcntify susceptibility alleles-then what?
448
44H
450
450451
452
453455455456
457457457
CONTENTS I xvii
Chapter 16 Molecular pathology
16.1
16.216.316.3.1
16.3.316.3.4
16.416.4.1
16.4.216.4.3
16.4.4
16.516.5.116.5.216.5.3
Introduction
The convenient nomenclature of d and 1! alleles hides a vast diversity of DNA sequencesA first classification of mutations is into 10ss of function vs. gain of function mutationsFor molecular pathology, the important thing is not the sequence of a mutant allele but its etTect
Box 16.1 The main classes of mutation
Box 16.2 Nomenclature for describing sequence changes
16.3.2 Loss of function is likely when point mutations in a gene produce the same pathological changeas dcletions
Box 16.3 A nomenclature for describing the effect of an allele
Gain of function is likely when only a specific mutation in a gene produces a given pathologyDeciding whether a DNA sequence change is pathogenic can be difficult
Loss of function mutations
Many ditTerent changes to a gene can cause loss of function
Box 16.4 Hemoglobinopathies
Box 16.5 Guidelines for assessing the significance of a DNA sequence change
In haploinsufficiency a 50% reduction in the level of gene function causes an abnormal phenotypeMutations in proteins that work as dimers or multimers sometimes produce dominant negativeetTects
Epigenetic modification can abolish gene function even without a DNA sequence change
Gain of function mutations
Acquisition of a novel function is rare in inherited disease but common in cancer
Overexpression may be pathogenicQualitative changes in a gene product can cause gain of function
461
462462462462
462
463
463
463
464465
465465
465
466
467
469469
469469470471
16.6 Molecular pathology: from gene to disease 471
16.6.1 For loss of function mutations the phenotypic etTect depends on the residuallevcl of gene function 471
Box 16.6 Molecular pathology ofPrader-Willi and Angelman syndromes 472
16.6.216.6.316.6.416.6.516.6.6
16.716.7.116.7.216.7.3
16.7.4
16.816.8.116.8.2
Loss of function and gain of function mutations in the same gene will cause ditTerent diseasesVariability within families is evidence of modifier genes or chance etTectsUnstable expanding repeats - a novel cause of disease
Protein aggregation is a common pathogenic mechanism in gain of function diseases
For mitochondrial mutations, heteroplasmy and instability complicate the relationship betweengenotype and phenotype
Molecular pathology: from disease to geneThe gene underlying a disease may not be the obvious oneLocus heterogeneity is the rule rather than the exception
Mutations in ditTerent l11embers of a gene family can produce a series of related or overlappingsyndromes
Clinical and molecular classifications are alternative tools for thinking about diseases, and each isvalid in its own sphere
Molecular pathology of chromosomal disorders
Microdcletion syndromes bridge the gap between single gene and chrol11osol11alsyndrol11es
The major etTects of chromosomal aneuploidies may be caused by dosage imbalances in a fewidentifiable genes
Chapter 17 Cancer genetics
17.1
17.2
17.317.3.1
Introduction
The evolution of cancer-;~-J~' UNr~ER§mÁDCES
,¡¡;j'1l11 ,,-;:-_. -. 1 ..,.~ vil \~omprv;iÚS(J "0;. la 1::..w.:(/encwOncogenes
The history of oncogenesBIBLIOTECAFUNDADORES
474475476478
478
478479479
479
480
480480
483
487
488
488
489489
XVIll I CONTENTS
Box 17.1 Two ways of making a series of successive mutations more likely
17.3.2 The functions of oncogenes17.3.3 Activation of proto-oncogenes
17.4
17.4.117.4.217.4.3
17.517.5.117.5.217.5.317.5.4
17.617.6.1
17.717.7.117.7.2
17.8
Tumor suppressor genes
The retinoblastoma paradigm
Loss of heterozygosity (LoH) screening is widely used for trying to identify TS gene locationsTumor suppressor genes are often silenced epigenetically by methylation
Stability of the genomeChromosomal instabilityDNA repair defects and DNA-level instabilityHereditary nonpolyposis colon cancer and microsatellite instabilityp53 and apoptosis
Control of the cell cycleThe Gl-S checkpoint
Integrating the data: pathways and capabilitiesPathways in colorectal cancer
A successful tumor must acquire six specific capabilities
What use is all this knowledge?
Chapter 18 Genetic testing in individuals and populations
18.1
18.2
18.318.3.118.3.218.3.318.3.418.3.518.3.6
18.418.4.1
18.4.218.4.318.4.4.
18.518.5.1
18.5.218.5.3
18.5.4
18.618.6.1
18.6.2
18.6.3
18.718.7.118.7.218.7.3
Introduction
The choice of material to test: DNA, RNA or protein
Scanning a gene for mutations
Methods based on sequencing
Methods based on detecting mismatches or heteroduplexesMethods based on single-strand conformation analysisMethods based on translation: the protein truncation testMethods for detecting deletionsMethods for detecting DNA methylation patterns
Testing for a specified sequence changeMany simple methods are available for genotyping a specified variant
Box 18.1 Multiplex amplifiable probe hybridization (MAPH)
Methods for high-throughput genotypingGenetic testing for triplet repeat diseases
Geographical origin is an important consideration for some tests
Gene trackingGene tracking involves three logical steps
Box 8.2 1\vo methods for high-throughput genotyping
Recombination sets a fundamentallimit on the accuracy of gene trackingCalculating risks in gene tracking
Box 18.3 The logic of gene tracking
The special problems of Duchenne muscular dystrophy
Population screeningAcceptable screening programs must fit certain criteria
Box 18.4 Use of Bayes' theorem for combining probabilities
Specificity and sensitivity measure the technical performance of a screening testOrganization of a genetic screening program
DNA profiling can be used for identifYing individuals and determining relationships
A variety of ditferent DNA polymorphisms have been used for profilingDNA profiling can be used to determine the zygosity of twinsDNA profiling can be used to disprove or establish paternity
489
490490
492492497497
497497499499
500
501501
502502503
504
509
510
510
511511511512513513514
515516
518
519519521
521521
524
524524
527
528
529529
529
530531
532532534534
CONTENTS I XIX
18.7.4 DNA profiling is a powerful tool for forensic investigations 535
Box 18.5 The Prosecutor's FaUacy 535
PART FOUR: New horizons: into the 21st century
Chapter 19 Beyond the genome project: functional genomics, proteomics and bioinformatics
19.119.1.1
19.1.2
19.1.3
19.1.4
19.219.2.119.2.219.2.319.2.419.2.5
19.319.3.1
19.3.2
19.3.3
19.3.4
19.3.5
19.419.4.119.4.2
19.4.3
19.4.419.4.5.
19.4.6.
19.4.719.4.8
19.5
An overview of functional genomicsThe information obtained from the structural phase of the Human Genome Project is of limiteduse without functional annotation
The functions of individual genes can be described at the biochemical, cellular and whole-organismlevels
Functional relationships among genes must be studied at the levels of the transcriptome andproteome
Box 19.1 The function of glucokinase
High-throughput analysis techniques and bioinformatics are the enabling technologies offunctional genomics
Functional annotation by sequence comparison
Tentative gene functions can be assigned by sequence comparisonConsensus search methods can extend the number of homologous relationships identifiedSimilarities and ditTerences between genomes indica te conserved and functionally important sequencesComparative genomics can be exploited to identifY and characterize human disease genesA stubborn minority of genes resist functional annotation by homology searching
Global mRNA profiling (transcriptomics)Transcriptome analysis reveals how changes in patterns of gene expression coordinate the biochemicalactivities of the cell in health and disease
Direct sequence sampling is a statistical method for determining the relative abundances ofditTerent transcripts
Box 19.2 Sequence sampling techniques for the global analysis of gene expression
DNA microarrays use multiplex hybridization assays to measure the abundances of thousands oftranscripts simultaneously
The analysis of DNA array data involves the creation of a distance matrix and the clustering of relateddatapoints using reiterative algorithms
DNA arrays have been used to study global gene expression in human celllines, tissue biopsies andanimal disease models
Proteomics
Proteomics encompasses the analysis of protein expression, protein structure and protein interactionsExpression proteomics has tlourished through the combination of two major technology platforms: two-dimensional gel electrophoresis (2DGE) and mass spectrometry
Box 19.3 Protein chips
Box 19.4 Mass spectrometry in proteomics
Expression proteomics has been used to study changes in the proteome associated with diseaseand toxicity
Protein structures provide important functional informationThere are many ditTerent ways to study individual protein interactions
Box 19.5 Determination of protein structures
High throughput interaction screening using library-based methods
Box 19.6 Structural classification of proteins
The challenge of interaction proteomics is to assemble a functional interaction map of the cell
lnformation abol1l protein interactions with smallligands can improve our understanding ofbiomolecular processes and provides a rational basis for the design of drugs
Summary
537
539
540
540
540
540
541
541
541541543543544545
545
545
546
547
548
550
552
553553
554
554
557
558559562
563
564
567
571
572
572
xx I CONTENTS
Chapter 20 Genetic manipulation of cells and animals
20.1
20.220.2.1
20.2.2
20.2.320.2.4
20.2.520.2.620.2.7
An overview of gene transfer technology
Principies of gene transfer
Gene transfer can be used to introduce new, functional DNA sequences into cultured animalcells either transiently or stably
The production of transgenic animals requires stable gene transfer to the germ line
Box 20.1 Methods of gene transfer to animal cells in culture
Box 20.2 Selectable markers for animal cells
Box 20.3 Isolation and manipulation of mammalian embryonic stem cells
The control of transgene expression is an important consideration in any gene transfer experimentGene transfer can also be used to produce defined mutations and disrupt the expression ofendogenous genes
Gene targeting allows the production of animals carrying defined mutations in every cellSite-specific recombination allows conditional gene inactivation and chromosome engineeringTransgenic strategies can be used to inhibit endogenous gene function
20.3 Using gene transfer to study gene expression and function
20.3.1 Gene expression and regulation can be investigated using reporter genes
575
576
576
576577
578
579
582
584
586588589591
594594
Box 20.4 Reporter genes for animal cells 595
20.3.2 Gene function can be investigated by generating loss-of-function and gain-of-function mutations andphenocopies 595
20.3.3 The large scale analysis of gene function by insertional mutagenesis and systematic RNA interferenceare cornerstones of functional genomics 597
20.420.4.120.4.2
20.4.3
20.4.4
20.4.520.4.6
Box 20.5 Sophisticated vectors used for insertional mutagenesis 599
Creating disease models using gene transfer and gene targeting technology 599Modeling disease pathogenesis and drug treatment in cell culture 599
It may be difficult to identify animal disease models generated spontaneously or induced by randommutagenesis 600
Mice have been widely used as animal models of human disease largely because specific mutations can becreated at a predetermined locus 602
Loss-of-function mutations can be modeled by gene targeting, and gain-of-function mutations by theexpression of dominant mutant genes 602
Box 20.6 The potential of animals for modeling human disease 603
1ncreasing attention is being focused on the use of transgenic animals to model complex disorders 604Mouse models of human disease may be difficult to construct because of a variety of human/ mousedifferences 605
Chapter 21 New approaches to treating disease
21.1
21.2
21.3
21.3.121.3.221.3.321.3.4
21.4
21.521.5.1
Treatment of genetic disease is not the same as genetic treatment of disease
Treatment of genetic disease
Using genetic knowledge to improve existing treatments and develop new versions ofconventional treatments
Pharmacogenetics promises to increase the effectiveness of drugs and reduce dangerous side effectsDrug companies have invested heavily in genomics to try to identify new drug targetsCell-based treatments promise to transform the potential of transplantationRecombinant proteins and vaccines
Ethics Box 1 The ethics of human cloning
Principies of gene therapy
Methods for inserting and expressing a gene in a target cell or tissue
Genes can be transferred to the recipient cells in the laboratory (ex vivo) or within the patient'sbody (in vivo)
609
610
610
610
610
611
611
613
614
616
616
616
CONTENTS I xxi
21.5.2 Constructs may be designed to integrate into the host ceIl chromosomes or to remain as episomes 616
Ethics Box 2 Germ line versus soma tic gene therapy 617
Box 21.1 1995 NIH Panel report on gene therapy (Orkin-Motulsky report) 619
Ethics Box 3 Designer babies 619
21.5.321.5.4
21.621.6.121.6.221.6.321.6.4
21.721.7.121.7.221.7.321.7.421.7.5
Glossary
Disease index
Index
Viruses are the most commonly used vectors for gene therapy
Nonviral vector systems avoid many of the safety problems of recombinant viruses, but gene transferrates are generaIly low
Methods for repairing or inactivating a pathogenic gene in a ceIl or tissueRepairing a mutant aIlele by homologous recombinationInhibition of translation by antisense oligonucleotidesSelective destruction or repair of mRNA by a ribozymeSelective inhibition of the mutant aIlcle by RNA interference (RNAi)
Some examples of attempts at human gene therapy
The first definite success: a cure for X-linked severe combined immunodeficiencyAttempts at gene therapy for cystic fibrosisAttempts at gene therapy for Duchenne muscular dystrophyGene therapy for cancerGene therapy for infectious disease: HIV
619
622
624624624
625625
625626626627628628
631
645
647