Title: Measuring gene expression in individual live cells
1Measuring gene expression in individual live
cells
- David A. Ball
- Synthetic Biology Group
- July 14, 2009
2Overview
- Background
- What is gene expression?
- Why do we need to measure proteins in single
cells? - Application to Yeast cell cycle
- Current Measurement System
- Hardware
- Software
- Future Measurements
- New instrument Cyto?IQ
2
3Gene expression
- Transcription
- Genes in DNA copied to messenger RNA
- Translation
- Ribosome converts mRNA codons (3 bases) into
Amino Acids - Amino Acids form protein
3
4Gene expression
- Necessary to characterize both steps
- mRNA (1-10/cell)
- Protein (10-10,000/cell)
5Saccharomyces cerevisiae (budding yeast)
- Well-studied organism
- Genome completely sequenced
- Asymmetrical division
- Population doubles in 100 minutes
- Cell cycle is similar to higher Eukaryotes
5
6Budding yeast cell cycle
7Classic protein measurements
- Synchronize cells
- chemical blocks cells in specific cell cycle
phase - release block
- take measurements of population at regular
intervals - chemical may interfere with process of interest
- Measurements Western Blot
- Kill lots of cells
- Extract all proteins, separate by electrophoresis
- Transfer to membrane, stain with
fluorescent-labeled antibody - Results in mean protein concentration
7
8Cell cycle model
- gt30 equations
- 100 parameters
- Predicts behavior of 100 mutants
Chen et al., Mol Biol Cell 2004
8
9Life is noisy...
- Many sources of variation in cells
- Fluctuation of molecular interactions
- Fluctuations of the cell size
- Distribution of molecules among cells
Need to observe gene expression in individual
cells
Elowitz, et al, Science 2002
9
10Green Fluorescent Protein
- Responsible for bioluminescence of Aequorea
victoria - 1962 Purified GFP
- 1992 DNA Sequence
- 1996 Crystal structure
11The many colors of GFP
- Mutations in DNA sequence cause shifts in color
- Blue, Cyan, Green, Yellow
- Reef Coral produces red fluorescent protein
- Red, Orange, Yellow
- Currently 100 flavors
11
12GFP in other organisms
13Current workflow Hardware
- Computer-controlled microscope
- Point-visiting
- Auto-focus
- Motorized filters shutters
- CCD camera
- Collect 2 images/time-point
- Bright-field
- Fluorescence
- 5 GB of images
13
14Movies
GAL1
CLN2
15Current workflow Software
- Identify individual cells (segmentation)
- Track fluorescence characteristics over time
- Average
- Localized concentration
- Analysis of fluorescence time-series
15
16Software Image processing
- Cell Identification
- Flood-fill
- Locate plateaus
- Erode
- Label Cells
- Dilate
- Cells mapped between frames by calculation of
intersectionunion - Cell-body pixels transferred to fluorescence
images for quantifying FP expression over time
17Example GAL1-YFP
- GAL1 part of the galactose metabolism pathway
- Cells grown overnight in glucose
- Transferred to galactose
- Observed for 8 hours
Track expression in gt400 cells
18Cell cycle regulators
- Obtained GFP-fusions of 17 regulators each in
separate cell-line (Invitrogen) - For each
- Observed 20 regions (500 cells)
- Collected Phase-Contrast Fluorescence Images
- Temporal Resolution 4 min
19Budding yeast cell cycle
20Regulating proteins Conventional wisdom
21Cell cycle regulator CLN2
600 cells
22Cell cycle regulator CLB2
23Cell cycle regulator NET1
24Cell cycle regulator CDC14
25Cells without GFP
26Further analysis Noise reduction
- Signal measurements generally have noise
- Instrument
- System that is measured (cells)
- Need to extract signal from noise
- Regression (curve-fitting)
- Filtering
26
27Sine fitting
- Fit data to a sine function with variable
amplitude, period, and phase - Also include a cubic polynomial to account for
trend
28Data filtering
- Use Fourier analysis to extract frequency
components of an oscillating signal - Find noise floor (N), remove components below (2N)
28
29Cell cycle regulator CLN2
30Cell cycle regulator CLB2
31Cell cycle regulator NET1
32Cell cycle regulator CDC14
33Cells without GFP
Sine fit
34Quantifying localized concentration
- Find pixel with peak intensity
- Check if neighboring pixels are above threshold
- Iterate until no more pixels found above
threshold or cell boundary is hit
35Localization
CDC14
CLN2
CLB2
NET1
36Bud-Bud fluorescence
37Regression Bud-Bud
Phase
CLB2
CLN2
NET1
38Limitations of current workflow
- 2 days between start of experiment output of
image processing - Limited to the use of 3 fluorescent proteins
- Limited number of cells
- Measure fluorescence rather than of proteins
38
39Cyto?IQ
40Features of Cyto?IQ
- Process images in real-time
- Returns statistical model
- Spectral imaging to allow for study of many
proteins simultaneously - Count Protein numbers
- Count mRNA
40
41Features of Cyto?IQ
- Process images in real-time
- Returns statistical model
- Spectral imaging to allow for study of many
proteins simultaneously - Count Protein numbers
- Count mRNA
41
42Spectral Imaging
- Traditional fluorescence imaging use filters to
selectively excite/collect light from FPs - Measure entire spectrum of fluorescence
- Unmix the contributions of each FP
43Spectral Imaging
44Features of Cyto?IQ
- Process images in real-time
- Returns statistical model
- Spectral imaging to allow for study of many
proteins simultaneously - Count Protein numbers
- Count mRNA
44
45Stroboscopic Imaging
- Generally, photos captured with constant light,
and opening/closing shutter - Moving objects produce blur due to finite shutter
speed - To reduce blur
- decrease shutter time
- keep shutter open, flash light
46Single-molecule counting
- Proteins randomly move through cell (D 10
m2/s) - Movement causes blur
- Excite fluorescence with intense, short laser
pulse
47Features of Cyto?IQ
- Process images in real-time
- Returns statistical model
- Spectral imaging to allow for study of many
proteins simultaneously - Count Protein numbers
- Count mRNA
47
48Quantifying mRNA
- Probes designed to bind mRNA
- Series of 4 probes, each with 5 fluorescent tags
- Downsides
- Requires dead cells
- Each set of probes 1000
Zenklusen, et al, Nat Struct Mol Biol, 2008
49Quantifying mRNA
- MS2 mRNA binding protein
- Express MS2-GFP fusion
- Target mRNA must be modified to include binding
motif
Bertrand, et al, Mol Cell, 1998
49
50Cyto?IQ Goals
- Count and track gt10 proteins and mRNAs over time
- Observe gt1000 cells
- Return statistical model to the user
51Summary
- To fully understand gene expression, we must
measure transcription and translation in
individual cells - Able to extract amplitude, period, and phase of
the oscillations of cell-cycle proteins - Current techniques have limitations such as
number of proteins we can observe - Future work will aim at counting expression of up
to 10 genes simultaneously
51
52Acknowledgments
- Synthetic Biology Group
- Jean Peccoud
- Julie Marchand
- Michael Czar
- Patrick Cai
- Matthew Lux
- Sarah Zheng
- Fred Cross
- (Rockefeller University)
John Tyson Kathy Chen William Baumann
NIH Grant 5R01GM078989-02
53Composite movies
- CCD records in grayscale
- Sequentially collect bright-field image and
fluorescence image - Combine 2 images into the R, G, and B channels
53
54Spectral Detection with NanoDrop 3300
- Collected spectra from 4 highly overlapping
fluorescent proteins - EGFP, acGFP, vYFP, and Citrine
- Made mixtures with known combinations, and
unmixed the spectra
Fluorescent Protein Spectra in Living Cells
55Spectral Unmixing
Input Spectrum
acGFP Spectrum
Citrine Spectrum
D
B
C
A
vYFP Spectrum
EGFP Spectrum
- Have m equations (spectral data points),
and n unknowns (coefficients) - If the reference spectra are known, then we can
solve for the coefficients (A,B,C) that produce
the input spectra - Currently using a least-squares minimization
56Preliminary Results
- Mixtures of cells with different fluorescent
proteins
Mix 1
Mix 2
Mix 3
Mix 4
Mix 5