Biological Sequence Analysis - PowerPoint PPT Presentation

1 / 29
About This Presentation
Title:

Biological Sequence Analysis

Description:

2003, Human Genome Project Completed! The Human Genome. DAD. MOM. YOU ... The Human Genome. DAD. MOM. YOU. 2 copies in every cell. One copy from each parent ... – PowerPoint PPT presentation

Number of Views:35
Avg rating:3.0/5.0
Slides: 30
Provided by: sch17
Category:

less

Transcript and Presenter's Notes

Title: Biological Sequence Analysis


1
Biological Sequence Analysis
  • 140.638.01

2
Course Objectives
  • See handout

3
(No Transcript)
4
Related Journals
  • Journal of Computational Biology
  • PLoS Computational Biology
  • Bioinformatics
  • Human Molecular Biology
  • Nucleic Acid Research
  • Nature

5
Who are you?
  • Field of research/What program
  • Stats background
  • Computing background
  • Register or audit
  • Why are you taking this course
  • Specific topics you are interested
  • Working on any sequence-related projects right
    now

6
Administrative Details
  • See handout
  • Alternative times for class
  • Mon 330-5
  • Friday 1030-1150

7
Historic Overview of Sequence Analysis
8
  • 1951, Sanger and Tuppy Protein Sequencing
  • 1960s, Margaret Dayhoff assembled protein
    sequence atlas. (NBRF)
  • Late 1970s, Walter Goad assembled DNA sequence
    database. (LANL)
  • Lipman and Benson ENTREZ, internet access to
    sequence database
  • Early 1980s, computers applied to sequence
    analysis

9
  • Sequence Alignment
  • 1970, Dot Matrix (Gibbs McIntyre)
  • 1970, Dynamic programming for global alignment
    (Needleman Wunsch)
  • 1981, local alignment (Smith Waterman)
  • 1982-1995, multiple sequence alignment
  • Database search for similar sequences
  • 1988, Pearson Lipman FASTA
  • 1990, Altschul et al. BLAST

10
  • RNA secondary structure 1971 (Tinoco)
  • Gene prediction 1982(Fickett)
  • Phylogenetic trees 1987 --
  • 2003, Human Genome Project Completed!

11
The Human Genome
Genomic Content 3.3billion bases 30K genes 23
chromosomes (22X/Y
DAD
MOM
2 copies in every cell (46 chr) One copy from
each parent Each parent passes on a mixed copy
YOU
12
(No Transcript)
13
Nucleotides are the chemical building block of
Nucleic Acids DNA and RNA
14
(No Transcript)
15
Nucleotide sequence determines the amino acid
sequence
16
(No Transcript)
17
Alternative Splicing
  • The human genome contains 30k genes
  • But we have gt300k proteins

18
The Human Genome
Genomic Content 3.3 billion bases 30K genes 23
chromosomes (22X/Y)
2 copies in every cell One copy from each
parent Each parent passes on a mixed copy
DAD
MOM
Deletions Insertions Mutations Evolutionary Scale
YOU
19
Review of basic probability
  • Random variables
  • Conditional probability
  • Expectation variance

20
In An Experiment
  • Sample Space all conceivable outcomes
  • Sample point one conceivable outcome
  • Random variable (informal def.) A quantity
    that takes a value from the sample space with
    some degree of randomness
  • Event a set of conceivable outcomes
  • Probability of an event A how likely will event
    A occur

21
Independence
  • If A and B are two events defined on the same
    sample space S and
  • P(A AND B) P(A) P(B),
  • A and B are called independent events.
  • Typically, two events are independent if the
    outcome of one does not affect the outcome of the
    other.

22
Independent Bernoulli trials
  • An event with 2 outcomes (coin-tossing)
  • Are the building blocks of
  • Binomial distribution
  • Geometric distribution

23
(No Transcript)
24
Expectation of a Random Variable
  • The average value of the random variable after a
    large number of random experiments
  • Expectation of a Bernoulli random variable, a
    binomial variable?

25
Variance of a Random Variable
  • The average squared deviation of the random
    variable from its mean
  • Variance of a Bernoulli random variable, a
    binomial variable?

26
(No Transcript)
27
(No Transcript)
28
(No Transcript)
29
The materials used in this class are made
possible by
  • Zhiping Weng, http//zlab.bu.edu
  • Carlo Colantuoni
  • Wenyi Wang
  • Zhijin Wu
  • Garland publishing, Albertss the Cell
  • And the wealth of internet resources
Write a Comment
User Comments (0)
About PowerShow.com