Title: Oleh
1Kesedaran dan Keusahawanan BIOTEKNOLOGI
Dewan Tun Dr Ismail, PWTC
Isu, Cabaran dan Peluang An Overview
Oleh Dr.Azman Firdaus Shafii dafs_at_aldrich.com.my O
PEN SOURCE SYSTEMS SDN BHD www.aldrich.com.my
30 Julai 2002
Page 1
Open Source Systems Sdn Bhd
2 Imagine two asteroids directly colliding in
space. There is a lot of energy release, a lot
of bright light, a lot of noise. Pieces of
rocks fly in all directions, in a plethora of
shapes, sizes and momenta. But can we predict
which pieces are going to be dominant, and which
are not?
July 2002
Page 2
Open Source Systems Sdn Bhd
3WHAT IS BIOINFORMATICS?
The study of information content and information
flow in biological systems and processes
Page 3
Open Source Systems Sdn Bhd
4EXPONENTIAL DATA GROWTH
- For the first time in the history of
science or ICT, - data growth exceeds computer performance growth.
- Biological/Life-Science data doubles every 10-12
months - Computer performance doubles every 18 months
(Moore's Law, co-founder of Intel)
Why?
Page 4
Open Source Systems Sdn Bhd
5KNOWN GENOME SIZES
Typical bp sequence ...CGATCCCAATT......
Page 5
Open Source Systems Sdn Bhd
6NUMBER OF GENES IN DNAs
Human Beings - 30,000 to 40,000 Fruit
Fly - 13,601 Worm - 19,098 The Human
Genome Project is targeting June 2003 for a
complete, correct, end-to-end copy with
an overall accuracy of 99.99
Page 6
Open Source Systems Sdn Bhd
7Now that the genie is out of the bottle, what
do we do?
The idea is to send a thousand
ships, not a catalogue of a thousand
genes Eric Lander, Director Whitehead Institute,
MIT, 2000
So, Enter the Post-Genomics Challenge!
Page 7
Open Source Systems Sdn Bhd
8BIOINFORMATICS
Think content! Think workflow!
LAB AUTOMATION
Samples
Analysis
Measurement
Results
BIOINFORMATICS
Reference
External data
A CONCEPTUAL ROLE OF BIOINFORMATICS
Page 8
Open Source Systems Sdn Bhd
9One Possible Supercomputing Roadmap
Outputs
Post-processing
Pre-processing
Analysis
- HR
- Computers
- ICT infra development tools
- software applications
Laboratory Information Management Systems (LIMS)
- Data management
- Knowledge management
- Workflow management
- DNA/Reference
- Proteins/Mass Spectroscopy
- Interactions
- Structures
- Screening e.g. HTS
DATABASE
Cluster Computing on Linux and Open Source
Technologies
Page 9
Open Source Systems Sdn Bhd
10- DATA
- Stored fact
- Inactive (they exist)
- Technology-based
- Gathered from various sources
- INFORMATION
- Presented fact
- Active (enabler)
- Business-based
- Transformed from data
Modelling
Data
Information
Knowledge
Analyses
Page 10
Open Source Systems Sdn Bhd
11Global Life Science Value Chain
Industry Sectors
Page 11
Open Source Systems Sdn Bhd
12Global Biosciences Market Forecast, 20012006
The overall market will grow at a 24 CAGR
through 2006 to reach nearly US38B
Page 12
Open Source Systems Sdn Bhd
13Asia/Pacific Biosciences Market Forecast,
20012006
The overall market will grow at a 56 CAGR
through 2006 to reach US3.6B
Page 13
Open Source Systems Sdn Bhd
14DEMAND SIDE
- Cheminformatics
- Characterisation of Combinatorial Libraries
- Optimisation of Combinatorial Libraries
- Gene-based Informatics
- Genome Data Analysis
- Genome Datamining
- SUPPLY SIDE
- Gene Sequencing Data Generation Software
- Object-oriented Framework for Gene Sequencing
Data Management - Gene Sequence Analysis Software
- Gene Sequence Data Dissemination Software
Page 14
Open Source Systems Sdn Bhd
15- BEWARE!
- There is a lot of excitement, market talk, hype
and froth, but there is also substance. - Some technology bets will fail.
- Sound creative business models will survive, even
prosper.
- Bioinformatics Health Status Today
- Heterogeneous data types
- Distributed systems
- Few standards
- Unified query lacking
- Volumes, and volumes of data and databases
Conclusion Need clearly architected systems,
yet maintaining flexibility as new data emerges
Page 15
Open Source Systems Sdn Bhd
16BIOINFORMATICS USE LOTS OF DATABASES
Users
Example of a Public Domain Databank available to
government, academia and industry
Page 16
Open Source Systems Sdn Bhd
17THE NEW BIOLOGY INFORMATION-DELIVERY MODEL
Multimedia H/P
PDA
PC
Global WAN Great Global Grid (G.G.G)
Presentation Layer
Visualization Hi-performance Workstation
Laptop
Computational Data Management Computing Layer
Data Biological Financial Chemical
Admin Proteomic Diagnostic Genetic
Treatment Clinical
Server B
Server A
Server C
Information Management Layer
Storage Management
Database Management
Page 17
Open Source Systems Sdn Bhd
18Top 10 Clusters
19th Edition of TOP500 List of Worlds Fastest
Supercomputers
Total Peak Performance (Gflops)
Source clusters_at_top500,2002
Page 18
Open Source Systems Sdn Bhd
19Workhorses behind the analysis
Page 19
Open Source Systems Sdn Bhd
20BioMed GRID Singapore Map
Page 20
Open Source Systems Sdn Bhd
21ACKNOWLEDGEMENTS 1. Dr. Hwa A Lim, Genetically
Yours, World Scientific (2002) 2. Dr.
Christopher Hogue, Co-Founder, MDS Proteomics,
Inc.(2002) 3. Bio-IT World Conference EXPO,
Papers Presentations, Suntec Singapore, 29-30
May 2002 Organised by IDG World Expo (Asia) Ltd.
Page 21
Open Source Systems Sdn Bhd
22THANK YOU
Page 22
Open Source Systems Sdn Bhd