fundamentals of genomic selection

34
Fundamentals of Genomic Selection Jack Dekkers Animal Breeding & Genetics Department of Animal Science Iowa State University

Upload: others

Post on 22-Jun-2022

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Fundamentals of Genomic Selection

Fundamentalsof

Genomic Selection

Jack DekkersAnimal Breeding & Genetics

Department of Animal ScienceIowa State University

Page 2: Fundamentals of Genomic Selection

Past and CurrentSelection Strategies

Genes Phenotype

Environment

selection

EstimatedBreeding

Value

Phenotypeof relatives

Black box of

Quantitative genetics

h2

Page 3: Fundamentals of Genomic Selection

CSU 2006 – Departmental Seminar (3) Cole 20042006

This approach has been very successfulfor many traits

US Holsteins - Milk production

Holstein birth year

Bree

ding

val

ue (

kg)

Phenotypic base = 11,638 kg

-3500-3000-2500-2000-1500-1000-500

0500

1000

1960 1970 1980 1990 2000

sires cows

AIPL

Page 4: Fundamentals of Genomic Selection

. . . . . not for others . . . .

US Holsteins – Daughter Pregnancy Rate

-2

-1

0

1

2

3

4

5

1960 1970 1980 1990 2000

Holstein birth year

Bree

ding

val

ue (

%)

Phenotypic base = 21.53%

sires

cows

AIPL

Page 5: Fundamentals of Genomic Selection

produce~80

daughters

and has important limitationsE.g. Need to select Bulls by Progeny Test

Progeny-testevaluation

Superior progeny tested bull

XEmbryoTransfer

Which is best??

5 years and

$$$$ later

Limitations:-Long generation intervals-High cost of progeny test-Difficult to improve

low heritable traits(fertility, disease resistance)

Page 6: Fundamentals of Genomic Selection

‘70 – ‘00: Promise of Molecular Genetics

QQ

Qq

qq

GG

GA

AA

Find major genesor

markers linked to QTL

and use these for Marker-Assisted

Selection

Meanweight (kg)

105

100

95Effe

ct ‘G

’ alle

le =

+5

= ef

fect

of #

G a

llele

s

Page 7: Fundamentals of Genomic Selection

Use of MAS to enhance Selection

Genes

Major genesMarkers

QTL

Phenotypicdata

Molecular data

Marker-Assisted Selection

Molec. genetics

• Expressed in both sexes• Expressed at early age• Requires less phenotypic data

Page 8: Fundamentals of Genomic Selection

Potential gains from MAS in livestockMeuwissen & Goddard, 1996 (GSE)

QTL with 1/3 of genetic variance haplotype-marked

h2=.27

Phenotyped before selectionPhenotyped after selection

Sex-limited traitCarcass trait

1 2 35

1

2

3

4

6462

55

3938 37

31

21

38

30

25

159

54

20

10

20

30

40

50

60

70

Generation

MAS is most beneficial for

‘difficult’ traits

Trait characteristic

Page 9: Fundamentals of Genomic Selection

3 types of marker loci for MASFunctional mutations

- known genesQ

q

Direct markers

LD-markers

LE-markers

- in pop.-wide Linkage Disequilibriumwith QTL

Marker-QTL linkage phase ~consistent

across population

Q

Q

G

G

Q

q

G

A

q

q

A

A

- used on a within-family basis- in pop.-wide Linkage Equil. with QTL

Sire 1 Sire 2 Sire 3 Sire 4Q

q

G

A

q

Q

G

A

Q

Q

G

A

q

q

G

A

Marker-QTL linkage phase NOT consistent across families

Page 10: Fundamentals of Genomic Selection

Many markers and QTL have been reported but few have been utilized

Examples of gene tests in commercial

breedingD = dairy cattleB = beef cattleC = poultryP = pigsS = sheep

Dekkers, 2004, J.Anim.Sci

Trait Direct marker LD marker LE marker BLAD (D) Congenital

defects Citrulinaemia (D,B) DUMPS (D) CVM (D) Maple syrup urine (D,B) Mannosidosis (D,B) RYR (P) RYR (P) Appearance CKIT (P) Polled (B) MC1R/MSHR (P,B,D) MGF (B) Milk quality κ-Casein (D) β-lactoglobulin (D) FMO3 (D) Meat quality RYR (P) RYR (P) RN/PRKAG3 (P) RN/PRKAG3 (P) A-FABP/FABP4 (P) H-FABP/FABP3 (P)

CAST (P, B) >15 PICmarqTM (P) THYR (B) Leptin (B) Feed intake MC4R (P) Disease Prp (S) B blood group (C) F18 (P) K88 (P) Reproduction Booroola (S) Booroola (S) Inverdale(S) ESR (P) Hanna (S) PRLR (P) RBP4 (P)

MC4R (P) CAST (P) QTL (P) Growth & composition IGF-2 (P) IGF-2 (P) Myostatin (B) QTL (B) Callipyge (S) Carwell (S)

DGAT (D) PRL (D) QTL (D) Milk yield & composition GRH (D) κ-Casein (D)

Trait Direct markers LD markers LE markers

Page 11: Fundamentals of Genomic Selection

Reasons for limited use of MAS in livestock

• # markers available is limited• Markers only explain limited % of genetic variance

• Only QTL with moderate – large effects detected• Genotyping costs• Marker/QTL effects are not consistent /

not transferable to commercial breeding populations• ‘Beavis’ effect – effects of ‘significant’ markers

tend to be overestimated• Marker effects were estimated within families

or in experimental crosses • Interactions of marker/QTL effects with genetic

background and / or environment• Inconsistent marker-QTL LD across populations

Page 12: Fundamentals of Genomic Selection

High-through-putSNP genotyping

Since 2000: A Revolution in Molecular Technology

NOW AVAILABLE:Illumina Bovine 50k Beadchip

50,000 DNA tests for <$250

Genomic Selection

International Swine GenomeSequencing Consortium

Nature 20042.8 million SNPs

SingleNucleotidePolymorphisms

+ discovery of manySingleNucleotide SNPsPolymorphisms

AAGCCTTGATAATT

AAGCCTTGCTAATT

maternalpaternal

Page 13: Fundamentals of Genomic Selection

How to use high-density SNP data?

Statistical Analysisto detect QTL / estimate SNP effects

Use only significant SNPs

for MAS

Genotype large # ofIndividuals

for large numbers of SNPs+ collect their phenotypes

Allows detecting more LD markersbut still suffers from only using

significant markers• Small effects are missed• Beavis effect

Page 14: Fundamentals of Genomic Selection

Solution: Genomic selectionGenetic Evaluation using high-density SNPs

• All SNPs are fitted simultaneously, i.e. 50,000 vs. 1 at a time• SNP effects are fitted as random vs. fixed effects

• enables all SNPs to be fitted simultaneously• shrinks SNP effect estimates to 0 depending on evidence from data

Meuwissen et al. 2001 Genetics

Estimates of SNP effects βk

yi = µ + Σ βk gik + eiSNP k

Implemented using a variety ofBayesian methods (Bayes-A, -B, -C)Or by using genomic vs. pedigreerelationships in animal model BLUP (GBLUP)

^ Use to estimate breeding value of new

animals based on genotypes alone

Genomic EBV = Σ βk gik^

Page 15: Fundamentals of Genomic Selection

Example Genomic EBV based on 3 SNPs

with estimated effects (β for # A alleles) of:+10 for SNP 1+ 5 for SNP 2–10 for SNP 3

SNP 1

SNP 2

SNP 3

Individual Genotype Value Genotype Value Genotype Value

Genomic Breeding

Value 1 AA 10 AA 5 AA -10 5 2 AA 10 AA 5 BB 10 25 3 AB 0 BB -5 AB 0 -5 4 AB 0 BB -5 AA -10 -15 5 BB -10 AA 5 AB 0 -5

^

Page 16: Fundamentals of Genomic Selection

Estimate marker effectsGenotype

for >50,000 SNPs

Phenotype

Genotypefor >50,000

SNPs

Predict BV from marker genotypes at

early age

Genotypefor >50,000

SNPs

Training data

Predict BV from marker genotypes at

early age

Select at young age

Genomic selectionGenetic Evaluation using high-density SNPs

Meuwissen et al. 2001

Page 17: Fundamentals of Genomic Selection

Genomic EBV have greater reliability for young bulls and heifers than Parent Average EBV

E.g. for Young Holstein Bulls (VanRaden and Tooker, 2009 USDA-AIPL)

ftp://aipl.arsusda.gov/pub/outgoing/GenomicReliability0608.doc

TraitGain over parent average

reliability (~39%)Net merit + 23Milk yield + 32Fat yield + 36Protein yield + 28Productive life + 33Dtr. Pregancy rate + 20

Page 18: Fundamentals of Genomic Selection

How will Genomic Selectionchange Dairy Cattle Breeding?

• AI Studs will market young bulls / bull teamsselected on Genomic EBV

• These young bulls will be from ET flushes of heifers contract-

mated to young bullsselected on Genomic EBV

X• Need for progeny-testing will decrease

Page 19: Fundamentals of Genomic Selection

Resulting shorter generation intervalscan increase genetic gains

US Holsteins - Milk production

Holstein birth year

Bree

ding

val

ue (

kg)

Phenotypic base = 11,638 kg

-3500-3000-2500-2000-1500-1000-500

0500

1000

1960 1970 1980 1990 2000

sires cows

??

Page 20: Fundamentals of Genomic Selection

Greater Reliabilities for Functional Traitscan reverse declines in health, fertility

US Holsteins – Daughter Pregancy Rate

-2

-1

0

1

2

3

4

5

1960 1970 1980 1990 2000

Holstein birth year

Bree

ding

val

ue (

%)

Phenotypic base = 21.53%

sires

cows

??

Page 21: Fundamentals of Genomic Selection

Semensam-ples

Costs to Identify Superior Bulls could decline

5 yrs&

$$$$$$later

Superior genome-tested young bull

X

Which is best??

Superior progeny-tested bull

Illumina Bovine 50k Beadchip

DNAsam-ples

< 6 mo&$$

later

EmbryoTransfer

Page 22: Fundamentals of Genomic Selection

Genomic Selection reduces the need to obtain phenotypes on selection candidates or their close

relatives to obtain accurate EBV

Instead, marker effects can be estimated on the target individuals in the target environment

E.g. Genomic Selection for Commercial Crossbred Performance (Dekkers, 2007, J.Anim.Sci.)

Main Premise of Genomic Selection

Page 23: Fundamentals of Genomic Selection

Current Pyramid Selection Programs(e.g. poultry and pigs)

Sireline

Crossbreds in Production herds

Multiplier

Damline

Multiplier

Fieldenvironment

High healthenvironment

G x E

Selection is primarily on phenotypes obtained on pure lines in high-biosecurity environments.

But target is improvement of crossbreds in field environment.NUCLEUS

herds

Page 24: Fundamentals of Genomic Selection

Selection for Performance in Field‘Traditional’ Breeding Solution:

Sireline

Crossbreds in Production herds

Multiplier

Purebreddata

Collect phenotypes on relatives in field

Fiel

d d

ata

on

rela

tives

Requirements / limitations:- Costly logistics – Pedigree-based

phenotyping in field- Longer generation intervals- Higher rates of inbreeding

- use of family vs. own phenotype

Combined Crossbred-Purebred Selection

Page 25: Fundamentals of Genomic Selection

Selection for Performance in FieldMolecular Breeding Solution:

Sireline

Crossbreds in Production herds

Multiplier

Genomic selection based on SNP effects estimated in field

Genotypes

SNP effectestimates

GenomicEBV

Advantages:- Does not require pedigree-

based phenotypes- Opportunities to reduce

generation intervals- Opportunities to select for

traits not observed in nucleus (disease, mortality)

GenotypePhenotype

GenomicSelection

Page 26: Fundamentals of Genomic Selection

Summary / ConclusionsGenomic Selection offers unique opportunities

to enhance breeding programsby removing limitations on when, where, and on whom

phenotypes are recorded• Not requiring phenotypes on individual/close relatives• Selection on marker effects estimated on target individuals (crossbreds)

and in target environment (field)

• With opportunities to• reduce generation intervals• reduce inbreeding

• Select for ‘difficult’ traits • Address GxE

• Genomic selection of pure-bredsfor field performance of crossbreds

Simulation results are very promisingInitial empirical results are encouraging

Page 27: Fundamentals of Genomic Selection

Acknowledgements

Rohan FernandoDorian Garrick

Iowa State University

Paul VanRaden and colleaguesUSDA-AIPL

Page 28: Fundamentals of Genomic Selection

AB&G SummerShort Courses

Use of High-Density SNP Genotyping for Genetic Improvement of Livestock

June 1-10, 2009Dekkers, Fernando, Garrick

Page 29: Fundamentals of Genomic Selection

Line 1 Line 2

F1 F1x

x

F2F2 x

F3F3 x MAS..

QTL detectionEstimation of marker effects

y = marker genotype + BV + e

MAS

Marker-assisted composite line development

10-20 cM marker

distance sufficient

Pyiasatian, Fernando, Dekkers. 2007. Genomic selection for marker-

assisted improvement in line crosses. TAG. 115: 665-674.

Page 30: Fundamentals of Genomic Selection

QTL detection, MA-Evaluation, and Selection

• Select on jk

QTL

iijki uMS ˆ*

1+∑

=

β

yjk = generationk + ∑=

++QTL

ijkjkijki euMS

1*β • Model:

# marker allelesfrom line 1 in interval i

Significant intervals

Page 31: Fundamentals of Genomic Selection

Extra response (%) over BLUPDetected QTL explain 18% of gen.var.

20 cM

-60

-40

-20

0

20

40

60

80

F3 F4 F5 F6 F7 F8 F9 F10 F11Generation

CR

(%ve

rsus

BLU

P)

BLUP

Presenter
Presentation Notes
1) This graph shows the effects of genetic models and thresholds on cumm.response (CR) and cumm. discounted response (CDR). 2) The x-axis represents generation, starts from F3 and the y-axis represents the CR (as percentage of superiority to BLUP). 3) With the threshold = 0.05, the analysis model including marker info and polygenic effects gave higher CR than models without� polygenic effects. 4) Using estimates for marker scores gave higher CR, meaning that regression estimates represent QTL effects better than the number� of marker alleles. In all the following graphs, also make the lines thicker and get rid of the grid lines, except the line at 0. Done.
Page 32: Fundamentals of Genomic Selection

• Select on ∑=

90

1*

iijki MSβ

yjk = generationk +∑=

+90

1*

ijkijki eMSβ • Model:

# marker allelesfrom line 1 in interval i

βi fixed orrandom

All intervals

Genomic selection

(Meuwissen et al. ‘01)

Marker-Assisted Evaluation and Selection

Page 33: Fundamentals of Genomic Selection

50% -ve QTL Effects, 20 cM

-50

0

50

100

150

200

F3 F4 F5 F6 F7 F8 F9 F10 F11Generation

Cum

ulat

ativ

e Re

spon

se

(% o

ver B

LUP)

R11F11Known QTL

BLUP

BLUP

Sign.QTL

Page 34: Fundamentals of Genomic Selection

0.40

0.50

0.60

0.70

0.80

0.90

1.00

0 0.24 0.74 1.22 1.71 2.32Average QTL effect

Fav

ora

ble

QT

L f

req

uen

cy

0.40

0.50

0.60

0.70

0.80

0.90

1.00

0 0.24 0.74 1.22 1.71 2.32Average QTL effect

Favo

rab

le Q

TL

fre

qu

en

cy R11

F11

Known QTL

BLUP

F5

F11