joachim frank

49
Joachim Frank 1. The Quest for High(er) Resolution 2. SNR and SSNR Estimates for EM Noise Processes HHMI, Health Research, Inc., Wadsworth Center ting April 1, 2008: Department of Biochemistry and Molecular Biophy and Department of Biology, Columbia University

Upload: talon

Post on 05-Jan-2016

33 views

Category:

Documents


1 download

DESCRIPTION

Joachim Frank. 1. The Quest for High(er) Resolution 2. SNR and SSNR Estimates for EM Noise Processes. HHMI, Health Research, Inc., Wadsworth Center Starting April 1, 2008: Department of Biochemistry and Molecular Biophysics - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Joachim Frank

Joachim Frank

1. The Quest for High(er) Resolution

2. SNR and SSNR Estimates for EM Noise Processes

HHMI, Health Research, Inc., Wadsworth CenterStarting April 1, 2008: Department of Biochemistry and Molecular Biophysics and Department of Biology, Columbia University

Page 2: Joachim Frank

Before going into the subject matter, a few introductory slides on single-particle reconstruction.

(“Single particle” here means “single, unattached” – the number of particles or molecules is normally huge)

For details on this technique, see J. Frank, Three-Dimensional Electron Microscopy of Macromolecular Assemblies, Oxford University Press 2006

Page 3: Joachim Frank

This is the kind of data we are dealing with in “single-particle reconstruction”

Page 4: Joachim Frank

Once a reference is established, orientation determination can be done by 3D projection matching

Page 5: Joachim Frank

Projection matching normally starts with 83 projections on the half-sphere

Page 6: Joachim Frank

Angular Refinement, by Iterative 3D Projection Matching

Subsequently, angular refinement is achieved by iteration, with smaller and smaller angular increments.

Page 7: Joachim Frank

(refinement by defocus groups)

Page 8: Joachim Frank

Part of the Bill Baxter’s ctfgroup interface: grouping by defocus

Defocus groups are used for expediency, but they need to be narrow enough so as to avoid blurring of high-res information.

Page 9: Joachim Frank

Reconstruction by Defocus Groups

At the very end, the reconstructions from all the defocus groups are merged in a CTF correction step (by Wiener Filtering)

Page 10: Joachim Frank

Gabashvili et al. (2000) Cell

Cryo-EM 11.5 Å resolution; 73,000 particles

The processing method described is now standard. It was used to obtain this reconstruction of the E. coli ribosome

Page 11: Joachim Frank

Math Problem, in a Nutshell, at Current Resolutions (~10 Å)

Computation of 3D density map:

100,000 x (200 x 200) measurements with SNR ~0.1 – 0.4

# of particles # of pixels

(projections)

200 x 200 x 200 + 5 x 100,000 unknowns

# of voxels # of parameters (in 3D density map) (shifts, angles)

What happens if we want to go to 3 Å resolution?

Page 12: Joachim Frank

Resolution measures & criteria -- Fourier ring/shell correlation

k = spatial frequency

Δk = ring width or shell thickness

*1 2

[ , ]

2 2 1/ 21 2

[ , ]

| (k) (k) |

( , )[ | (k) | | (k) | ]

k k

k k

Re F F

FSC k kF F

Page 13: Joachim Frank

Conservative; SNR =1

R. Henderson, X-ray

Too optimistic

Resolution criteria based on FSC

Page 14: Joachim Frank

Extrapolation to 3 Å --Millions of particles?

We’re facing an obstacle!

Page 15: Joachim Frank

What can be controlled/improved experimentally?

versus

What can be controlled/improved by image processing with a given dataset?

(or: how do we want to spend our time?)

Page 16: Joachim Frank

Processing by defocus groups, or variations in height

These envelope functions show how the final resolution is affected by the width of the defocus range, or by variations in height of particle within the ice

Page 17: Joachim Frank

Question of Window Size

Reconstruction of uncorrected data is done under the assumption that all information is present. But part of the information lies outside the particle, because of the finite width

of the point spread function!

Radius of point spread function has to be accommodated in the window.

Window size = particle diameter + diameter of the point-spread function = point-spread function

associated with the CTF

Page 18: Joachim Frank

Exploration of reconstruction strategy“High-resolution project”

Use small dataset (50,000) to optimize processing, with the idea to switch to larger dataset (130,000)

Parameters of image processing:• Sampling (strategy to switch from coarse to fine, to save CPU)• Window size (to avoid CTF effects)• Angular spacing (how to reduce)• Amplitude correction in each step of refinement vs. at the very end

Final parameters emerging from this exploration: angular step 0.5 degrees, angular search range 2 degrees7 iterations of refinement: 920 hours on a 48-node clusterRegular window size OKSampling (decimation) can be switched mid-way from coarse to fine

Page 19: Joachim Frank

Resolution measurement issues

• Apply soft mask to reconstruction to get true resolution!• Evidence for dependence of resolution R vs. log(N)• Is lin-log dependence general?• Is it allowed to extrapolate from half to full dataset?• Verification of molecular detail: phosphorus atoms show

up as bumps

Page 20: Joachim Frank

“Clutter”

outside

Page 21: Joachim Frank
Page 22: Joachim Frank

To be replaced – first two curves are incorrect! Other curves OK.

Page 23: Joachim Frank

Final reconstructiion, at 7.5 A:

Page 24: Joachim Frank

Progression of resolution, from the beginning to the end of the project.

Page 25: Joachim Frank

E. coli 70S•aa-tRNA•EF-Tu•GDP•kir at 7.5 Å

EF-Tu

130,000 particles 7.5 Å (FSC=0.5)

Page 26: Joachim Frank
Page 27: Joachim Frank

APE

EF-Tu

Page 28: Joachim Frank

Protein S2

X-ray

missing helix

Except for a missing helix at the periphery, protein S2 is exactly as it appears in the X-ray structure.

Page 29: Joachim Frank

Extrapolation of FSC resolution to full set

65,000130,000

Resolution is a linear function of Log(Number of particles) -- Ribosome.

Page 30: Joachim Frank

GroEL (Stagg et al.)

Resolution is a linear function of Log(Number of particles) -- GroEL.

Page 31: Joachim Frank

6.7 Å

Page 32: Joachim Frank

6.7 Å (LeBarron et al., in prep.) 10 Å (Valle et al., NSB 2003)

Page 33: Joachim Frank

Cutoff Cryo-EM X-ray Cryo-EM X-ray

6.7 Å

5.0 Å

6.7 Å

5.0 ÅPhosphorus atomsrecognized as bumps!

Page 34: Joachim Frank

Definition of EF-Tu domains (stereo)

Page 35: Joachim Frank

• SNR and SSNR estimation

• Why?

• We need better noise models for phantoms, to fine-tune classification

• William Baxter

Page 36: Joachim Frank

Given: two images representing independent realizations of a signal embedded in noise. Align the two images, then compute the cross-correlation coefficient. Estimate the SNR α from the cross-correlation coefficient by

α = ρ/(1-ρ) ρ: α:

0 0 1 ∞ 0.5 1

(Frank and Al-Ali, Nature 1975)

Page 37: Joachim Frank

• Compare the images of two molecules that were identified, by 3D projection matching, as having the same orientation.

• These images differ by (i) structural noise, (ii) shot noise, and (iii) digitization noise.

• How to disentangle the three different noise portions?

Page 38: Joachim Frank

• Structural noise:

the irreproducible portion of the object.

(i) Conformational changes

(ii) Superimposed structure of substrate (ice, thin carbon)

Page 39: Joachim Frank

• Experimental design:

• Three experiments:(i) take images of two molecules deemed in the same

orientation (as determined by projection matching)

(ii) take two images of the same field

(iii) digitize the same image of a molecule twice

Page 40: Joachim Frank

These are the kinds of images we are using in the SNR estimation

Page 41: Joachim Frank

Projection matching: two particles falling into the same orientation bin represent the same molecule structure plus “structural noise”

Using projection matching to a template, we can find molecules that nominally have the same orientation, and can bye compared in the SNR estimation.

Page 42: Joachim Frank

SNR estimation by double experiments

Molecules insame view

Same fieldimaged 2x

Same micrographdigitized 2x

Page 43: Joachim Frank

Formula for the case of two noise subprocesses: obtaining SNR of subprocess #I from measured SNRs of complete process (αcomp) and subprocess #2 (αsub2):

1αsub1 = --------------------------------

(1+1/ αcomp )/(1+1/αsub2)-1

Page 44: Joachim Frank

Results of Estimation:

• SNR for digitization noise: 27 [for ZI microdensitometer -- will be different for different scanners]

• SNR for shot noise: 0.09• [Is in the range measured before, e.g. 0.1 (Fu et al. JSB 2007)]

• Note: for CCD recording, digitization and shot noise cannot be separated.

• SNR for structural noise: ~2 [Reasonable, since the “noise” contains all irreproducible

components: conformational differences, thin carbon]

Page 45: Joachim Frank

SSNR Estimates, using the same formula in Fourier space along rings

double scan

double exposure

double structure

The curves show how the SNR is distributed as a function of spatial frequency

Page 46: Joachim Frank

SSNR for “true” film-recorded Shot Noise (averaged over 4 defoci)

Page 47: Joachim Frank

SSNR for “true” Structural Noise (averaged over 3 defoci)

Page 48: Joachim Frank

Outlook

• Atomic resolution soon in reach. How soon is soon? Asymmetric structures much more tough than those with symmetry.

• Efficient classification is key!• Extrapolation of resolution might be possible in

general – does lin-log rule always hold?• Develop strategy with subsets is a good idea!

Page 49: Joachim Frank

People in my group who have contributed to this research:

• William Baxter (SNR)• Robert A. Grassucci (HR)• Jamie LeBarron (HR)• Haixiao Gao (HR)• Tapu Shaikh (HR)• Jayati Sengupta (HR)

• Funding: HHMI, NIH, NCRR

HR = high-resolution project SNR = SNR and SSNR estimation