General Introduction to the Genome - PowerPoint PPT Presentation

1 / 76
About This Presentation
Title:

General Introduction to the Genome

Description:

General Introduction to the Genome Transcriptome The entire mRNA content of a cell. Proteome The collection of functioning proteins synthesized by a living cell. – PowerPoint PPT presentation

Number of Views:108
Avg rating:3.0/5.0
Slides: 77
Provided by: fad66
Category:

less

Transcript and Presenter's Notes

Title: General Introduction to the Genome


1
General Introduction to the Genome
2
An Outlines
  • Molecular Biology Major Events
  • DNA, RNA
  • Protein Synthesis(Transcription Translation)
  • Genome Anatomy
  • Bioinformatics
  • Genomics Signal Processing

3
  • Molecular Biology Major Events
  • DNA, RNA
  • Protein Synthesis(Transcription Translation)
  • Genome Anatomy
  • Bioinformatics
  • Genomics Signal Processing

4
Molecular Biology Major Events
1869
Johann Friedrich
DNA Discovery
5
Molecular Biology Major Events
Edward Tatum
6
The Central Dogma
Target
Book
Book shelves
Nucleus
7
What is Life made of?
8
Eukaryotes vs Prokaryotes
DNA
DNA
9
Prokaryotes Eukaryotes
Single cell Single or multi cell
No nucleus Nucleus
No organelles Organelles
One piece of circular DNA Chromosomes
No mRNA post transcriptional modification Exons/Introns splicing
10
The Cell Chemical Composition
  • 70 Water
  • 7 Small molecules
  • Salts
  • Amino acids? (Protein)
  • Nucleotides? (DNA, RAN)
  • 23 macromolecules
  • Proteins
  • Polysaccharides
  • Lipids

11
The Cell The 3 Critical Molecules
Form enzymes Form bodys components
Hold Genetic information
Transfer Information Synthesize Protein
12
  • Molecular Biology Major Events
  • DNA, RNA
  • Protein Synthesis(Transcription Translation)
  • Genome Anatomy
  • Bioinformatics
  • Genomics Signal Processing

13
DNA the Nucleotide
14
DNA Nitrogenous base
Purines
Pyrimidines
15
DNA Polymerization reaction
5 P
3OH
5
3
16
DNA hydrogn bounds
No of base pairs Genome Size HG 3200 Mbp (Mb)
17
Sugar- Phosphate Back bone
DNA Watson - Crick Model 1951
18
DNA Watson - Crick Model
Sugar- Phosphate Back bone
No of base pairs Genome Size HG 3200 Mbp (Mb)
19
RNA versus DNA
G, A ,C,T
G, A ,C,U
20
Protein structure
  • 1902 - Emil Hermann Fischer wins Nobel prize
    showed amino acids are linked and form proteins

21
Amino acid Basic unit of protein
Different side chains, R, determine the
properties of 20 amino acids.
Amino group
Carboxylic acid group
An amino acid
22
(No Transcript)
23
Protein structure
  • Primary structure
  • Secondary structure
  • Super-secondary structure
  • Tertiary structure
  • Quaternary structure

24
Protein Structure Predication Problem
Protein sequence
Protein 3D structure
Protein Function
25
The Central DogmaGenes is proteins blueprint,
Genome
DNA
Gene
Protein
26
  • Molecular Biology Major Events
  • DNA, RNA
  • Protein Synthesis(Transcription Translation)
  • Genome Anatomy
  • Bioinformatics
  • Genomics Signal Processing

27
Protein Synthesis DNA, RNA, and the Flow of
Information
Replication
Translation
Transcription
28
Protein Synthesis Gene Expression
29
mRNA
Gene 3
1
Gene 2
2
3
Gene 1
Splicing
30
Alternative Splicing
Pre-mRNA
mRNA
Gene 3
1
Gene 2
3
2
Gene 1
31
m-RNA Editing
Pre-mRNA
mRNA
Gene 3
1
Gene 2
2
3
Gene 1
32
(No Transcript)
33
Translation
Pre-mRNA
Start Codon
mRNA
Gene 3
AUGAUAAC UA G
Gene 2
CV
Gene 1
Stop Codon
34
Protein Synthesis The Genetic Code
Start
Stop
35
Gene Regulation
1
Gene 1
2
3
R Gene 1
Regulatory protein
36
Gene Regulation
We have a little knowledge about regulatory
mechanisms
Regulatory protein Gene 1
Gene 2
Gene 1
Regulatory protein Gene 2
37
What a big Genome Size?
  • The 12 font size enables approximately 60
    nucleotides of DNA sequence to be written in a
    line 10 cm in length.
  • Genome size total number of nucleotide base
    pairs.
  • typically in millions of base pairs, or megabases
    abbreviated Mb or Mbp)

38
  • Molecular Biology Major Events
  • DNA, RNA
  • Protein Synthesis(Transcription Translation)
  • Genome Anatomy
  • Bioinformatics
  • Genomics Signal Processing

39
the human genome sequence would stretch for 5000
km, the distance from Montreal to London, Los
Angeles to Panama, Tokyo to Calcutta, Cape Town
to Addis Ababa, or Auckland to Perth
The sequence would fill about 3000 books the size
of book 600 pages size.
40
Genome size of organism are different
41
Genome size is not good indicator for genes number
42
  • Space is saved in the genomes of less complex
    organisms because the genes are more closely
    packed together.

43
C-value paradox
  • Correlation between the complexity of an
    organism and the size of its genome was looked on
    as a bit of a puzzle.

44
Genome Anatomy
45
Human Genome Anatomy
Human genome? Nuclear genome
? Mitochondrial genome
46
Human Mitochondrial Genome Anatomy
  • it is much smaller than the nuclear genome(17
    kB), and it contains just 37 genes.
  • 13 code proteins and 24 specify non-coding RNA.
  • do not contain intron.
  • is typical of the mitochondrial genomes of other
    animals

47
(No Transcript)
48
Nuclear Human Genome Anatomy
62
49
Nuclear Human Genome Anatomy Protein Coding Genes
50
Nuclear Human Genome Anatomy Protein Coding Genes
five exons, separated by four introns.
average exons nine exons per gene
51
Two gene segments (V28 and V29-1)
52
Nuclear Human Genome Anatomy pseudogene
Non functional genes
53
Nuclear Human Genome Anatomy genome-wide repeat

54
Nuclear Human Genome Anatomy genome-wide
repeat
  • Tandemly repeated DNA
  • Minisatellite DNA
  • Microsatellite DNA
  • Interspersed genome-wide repeats
  • SINE
  • LINES
  • LTR
  • DNA transposons

55
Nuclear Human Genome Anatomy genome-wide repeat
Minisatellite DNA
  • we are familiar with because of its association
    with structural features of chromosomes.
  • Telomeric DNA, which in humans comprises hundreds
    of copies of the motif 5'-TTAGGG-3'.

..
TTAGGG
TTAGGG
TTAGGG
..
AATCCC
AATCCC
AATCCC
56
The content of the human nuclear genome
genome-wide repeat Microsatellite DNA
  • microsatellites with a CA repeat, such as
  • make up 0.25 of the genome, 8 Mb in all.
  • Single base-pair repeats such as
  • make up another 0.15.

57
Nuclear Human Genome Anatomy genome-wide repeat
Interspersed repeat
58
Gene Classification Gene function
  • This system has the advantage that the fairly
    broad functional categories used in can be
    further subdivided to produce a hierarchy of
    increasingly specific functional descriptions for
    smaller and smaller sets of genes.
  • The weakness
  • functions have not yet been
    assigned to many eukaryotic genes.

59
Gene Classification Gene function
  • The gene catalog couldnt tell us why we are
    human?
  • it may still not be possible simply from genome
    comparisons with the chimpanzee genome to
    determine what makes us human

60
Gene Classification Gene function
  • The major categories of protein coding genes
    represent the most studied areas of cell biology,
    which means that many of the relevant genes can
    be recognized because their protein products are
    known.
  • Genes whose products have not yet been identified
    are more likely to be involved in the less well
    studied areas of cellular activity.

61
Gene classification Protein Domain
  • A more powerful method is to base the
    classification not on the functions of genes but
    on the structures of the proteins that they
    specify.
  • A protein molecule is constructed from a series
    of domains, each of which has a particular
    biochemical function.

62
Gene classification Protein Domain
63
  • Molecular Biology Major Events
  • DNA, RNA
  • Protein Synthesis(Transcription Translation)
  • Genome Anatomy
  • Bioinformatics
  • Genomics Signal Processing

64
What is Bioinformatics?
  • Integration of computational and biological
    methods
  • to convert biological information into general
    theories.

aatgcatgcggctatgctaatgcatgcggctatgctaagctgggatccg
atgacaatgcatgcggctatgctaatgcatgcggctatgcaagctgggat
ccgatgactatgctaagctgggatccgatgacaatgcatgcggctatgct
aatgaatggtcttgggatttaccttggaatgctaagctgggatccgatga
caatgcatgcggctatgctaatgaatggtcttgggatttaccttggaata
tgctaatgcatgcggctatgctaagctgggatccgatgacaatgcatgcg
gctatgctaatgcatgcggctatgcaagctgggatccgatgactatgcta
agctgcggctatgctaatgcatgcggctatgctaagctgggatccgatga
caatgcatgcggctatgctaatgcatgcggctatgcaagctgggatcctg
cggctatgctaatgaatggtcttgggatttaccttggaatgctaagctgg
gatccgatgacaatgcatgcggctatgctaatgaatggtcttgggattta
ccttggaatatgctaatgcatgcggctatgctaagctgggaatgcatgcg
gctatgctaagctgggatccgatgacaatgcatgcggctatgctaatgca
tgcggctatgcaagctgggatccgatgactatgctaagctgcggctatgc
taatgcatgcggctatgctaagctcatgcggctatgctaagctgggaatg
catgcggctatgctaagctgggatccgatgacaatgcatgcggctatgct
aatgcatgcggctatgcaagctgggatccgatgactatgctaagctgcgg
ctatgctaatgcatgcggctatgctaagctcggctatgctaatgaatggt
cttgggatttaccttggaatgctaagctgggatccgatgacaatgcatgc
ggctatgctaatgaatggtcttgggatttaccttggaatatgctaatgca
tgcggctatgctaagctgggaatgcatgcggctatgctaagctgggatcc
gatgacaatgcatgcggctatgctaatgcatgcggctatgcaagctggga
tccgatgactatgctaagctgcggctatgctaatgcatgcggctatgcta
agctcatgcgg
65
Data structures Software engineering (C,
C,PERL)
Cell structure Genome, genes DNA, RNA
Biology
Computer Science
Bioinformatics
Chemistry
Statistics
Markof Model Neural Network
Protein structure Molecular bounds
66
Bioinformatics Subareas
  • The subareas within bioinformatics include
    Genomics and Proteomics.

Genome comparison evolutionary tree
Microarray Analysis Gene predication Gene
classification Gene regulation
Protein 3D predication Protein protein
interaction Protein alignment
67
  • Molecular Biology Major Events
  • DNA, RNA
  • Protein Synthesis(Transcription Translation)
  • Genome Anatomy
  • Bioinformatics
  • Genomics Signal Processing

68
What is GSP?
  • Analysis
  • Using Theory and Methods of Signal Processing
  • Processing

aatgcatgcggctatgctaatgcatgcggctatgctaagctgggatccg
atgacaatgcatgcggctatgctaatgcatgcggctatgcaagctgggat
ccgatgactatgctaagctgggatccgatgacaatgcatgcggctatgct
aatgaatggtcttgggatttaccttggaatgctaagctgggatccgatga
caatgcatgcggctatgctaatgaatggtcttgggatttaccttggaata
tgctaatgcatgcggctatgctaagctgggatccgatgacaatgcatgcg
gctatgctaatgcatgcggctatgcaagctgggatccgatgactatgcta
agctgcggctatgctaatgcatgcggctatgctaagctgggatccgatga
caatgcatgcggctatgctaatgcatgcggctatgcaagctgggatcctg
cggctatgctaatgaatggtcttgggatttaccttggaatgctaagctgg
gatccgatgacaatgcatgcggctatgctaatgaatggtcttgggattta
ccttggaatatgctaatgcatgcggctatgctaagctgggaatgcatgcg
gctatgctaagctgggatccgatgacaatgcatgcggctatgctaatgca
tgcggctatgcaagctgggatccgatgactatgctaagctgcggctatgc
taatgcatgcggctatgctaagctcatgcggctatgctaagctgg
  • To gain global understanding of Genome.

69
GSP Labs
  • The Genomic Signal
  • Processing Laboratory at Texas AM
    University.
  • The Computational Biology Division of the
    Translational Genomics
  • Research Institute in Phoenix, Arizona.

To model Genomic Regulatory Mechanisms for the
purposes of diagnosis and therapy.
Edward R. Dougherty
70
GSP Labs
  • Columbia's Genomic Information Systems Laboratory
  • at Columbia University

Dimitris Anastassiou
71
GSP Labs
  • DSP Group, Department of Electrical Engineering,
    California Institute of Technology

P. P. Vaidyanathan
72
Mapping Character String to Numerical Sequences
AAAATTTTCCCGGGTAGCTTTCCCGGGT
0001110101010101111111111000
73
Research Area of GSP
  • Gene Predication
  • Genes Predication
  • Hidden Markov Models (HMM)
  • Fourier Transform
  • Wavelet Transform
  • Resonant Recognition Model (RRM)
  • To identify the common hot spots of many protein
    molecules using Fourier transform methods.

74
References
  • http//biology.ucok.edu/bidlack/biology/notes.htm
  • http//www.ncbi.nlm.nih.gov/books/bv.fcgi?ridgeno
    mes
  • http//www.estrellamountain.edu/faculty/farabee/bi
    obk/biobooktoc.html
  • http//www.werathah.com/
  • http//lectures.molgen.mpg.de/online_lectures.html

75
References
  • http//www.biology.lsu.edu/webfac/jmoroney/BIOL309
    0/

76
THANKYOU FOR YOUR ATTENATION
Write a Comment
User Comments (0)
About PowerShow.com