Title: Biology Primer
1Biology Primer
- Basic principles
- Organisms/Cells as Basic Units
- Biochemical Components of Cells
- Genetic processes
- Fundamental Molecular Genetics
BIO520 Bioinformatics Jim Lund
2Phylogeny
3Prokaryotes
4Gram-negative Prokaryotes
5Eukaryotes
What cell compartments are present? What cell
processes are different between prokaryotes and
eukaryotes? What are protein families are unique
to eukaryotes? How is gene annotation
different?
6Central Dogma of Molecular Biology
DNA
RNA
Protein
7(No Transcript)
8DNA
- G,A,T,C
- 5 to 3
- antiparallel double helix
- GC, AT
5-GATTC-3
3-CTAAG-5
9RNA
- Single-stranded polymer
- G,A,C,U
- rRNA (90)
- tRNA (5-10)
- mRNA (1-5)
- informational
- many kinds
- snRNA, siRNA, miRNA, snoRNA
10Other Molecules
- Lipids
- Carbohydrates
- Small molecules
- Precursors, metabolites
Proteins
11Polypeptides
- Polymers of 20 amino acids
- amide linkages
- 100-1000 residues typical
- amino (N)-----carboxyl (C) end
- encoded by mRNAs
- modified (Pi, CH3)
Protein 1 or more polypeptide
12Protein structure
- Primary
- AA sequence
- Secondary
- Alpha helix, Beta sheet
- Tertiary
- 3D structure
- Quaternary
- 3D structure of 2 or more protein chains bound
together.
13JOE AVERAGE Polypeptide
- 350 amino acids
- Rare vs common aas
- Amino acids
- MW110 Da
Know your aa's
14Protein lengths
Zhang et al., 2000
15Protein lengths
Zhang et al., 2000
16Proteins
- Enzymes
- Receptors
- Transporters
- Structural components
- Regulatory factors
17Translation Reading Frames
AUGGUACCACC...
Frame 1
MetValProPro..
AUGGUACCACC...
Frame 2
TrpTyrHis....
AUGGUACCACC...
Frame 3
GlyThrThr...
18Genetic Code
19Long ORFs are RARE
P(ORF)(61/64)n
P(20)(61/64)20.38
P(100)0.008
P(200)10-4
20Prokaryotic Gene Expression
Promoter
Cistron1
Cistron2
CistronN
Terminator
Transcription
RNA Polymerase
mRNA 5
3
1
2
N
N
N
C
N
C
C
1
2
3
Polypeptides
21Eukaryotic Gene Expression
Promoter
Transcribed Region
Terminator
Enhancer
Transcription
RNA Polymerase II
Primary transcript 5
3
Intron1
Exon1
Exon2
Cap Splice Cleave/Polyadenylate
Translation
7mG
An
N
C
Transport
7mG
An
Polypeptide
22Joe Average Gene
- Encodes 350 amino acid product
- Euk
- Exons and introns, typically 5-10
- Enhancer/promoter region
- Prok 1 cistron (part of operon)
10 kbp
1.5 kbp
23Genome elements
- Genes
- Special regions
- Origins of replication
- Telomeres
- Intergenic Regions
- Enhancers
- Chromsomal attachment sites
- Origins of replication
- Junk??
24Genome Structure
25E. coli genome
--------------- 10 kb
26H. sapiens genome
----------- 50 kb
27Genome Size and Gene Number in Prokaryotes
LINEAR
28Genome Size and Gene Number in Eukaryotes
29Joe Average Genome
- Human
- 3,000,000 kbp
- 20,000 genes (as many as the worm C. elegans)
- 10 genes
- E. coli
- 5,000 kbp
- 4,200 genes
- 90 genes!
30Commonly used molecular biology techniques
31DNA Cloning
32The cloning process
- Source of Nucleic Acid
- Cloning Vector
- Host
- Selection/Screen
33Polymerase Chain Reaction (PCR)
- Template (DNA to be amplified)
- Primer Sequences
- 18-27 base pairs (bps)
- Conditions
- Annealing temperature for primers (55-65C)
- Extension time (1-3 minutes)
- Cycles (20-35).
34PCR
35DNA Sequencing
36DNA-DNA hybridization
Southern blotting
37DNA-DNA hybridization
Dot Blot/Gene Chip
38Transcript Analyses
- Genomic transcript profiling
DeRisi, Iyer, Brown Science, 1998
Northern blot DNA microarray
39Questions addressed by protein analysis techniques
- Identity/Amount
- modifications
- Structure
- 3D
- Detailed information about binding to ligands and
other proteins - Location
- Function
- Protein-protein interaction
- Enzymatic activity
40Protein Expression
Western blot 2D PAGE
Charge
Size
41Protein structure
42Location of a protein in a cell
43Investigating gene function
- Biochemistry
- Variation
- Sequence
- Expression
- Where, when, expression level?
- Knockout
- Overexpression
- Homology to well-studied proteins.
44DNA mutation
Promoter
Transcribed Region
Terminator
Enhancer
Amount Nature
3
Intron1
Exon1
Exon2
nucleus
X
7mG
An
N
C
cytoplasm
7mG
An
45Variation-Population
- Most variation unimportant to function
- SNPs (Single Nucleotide Polymorphism)
- Some variation has functional importance
- Eg. HBA1 mutants
- Thalassemia
- Sickle Cell Anemia
46Phylogenetic Variation
- Evolutionary changes in DNA/protein sequence.
- Variation depends on distance
- Orthologs, paralogs
- Variants function in context.
47Induced/Directed MutationExperimental
- Often GROSS mutations
- May not reflect in any way the population or
phylogenetic variation - Critical to functional study