Title: The UK e-Science Vision: Building a Sustainable e-Infrastructure
1 The UK e-Science VisionBuilding a Sustainable
e-Infrastructure
- Tony Hey
- Director of UK e-Science Core Programme
- Tony.Hey_at_epsrc.ac.uk
2The UK e-Science Paradigm
- The Integrative Biology Project involves seven UK
Universities lead by Oxford and the University
of Auckland in New Zealand - Models of electrical behaviour of heart cells
developed by Denis Nobles team in Oxford - Mechanical models of beating heart developed by
Peter Hunters group in Auckland - Researchers need robust middleware services to
routinely build secure Virtual Organisations to
support an international collaboratory - Goal is to enable faster, better or different
research
3RCUK e-Science Funding
- First Phase 2001 2004
- Application Projects
- 74M
- All areas of science and engineering
- Core Programme
- 15M Research infrastructure
- 20M Collaborative industrial projects
- Second Phase 2003 2006
- Application Projects
- 96M
- All areas of science and engineering
- Core Programme
- 16M Research Infrastructure
- 11M DTI Technology Fund
4Some Example e-Science Projects
- Particle Physics
- global sharing of data and computation
- Astronomy
- Virtual Observatory for multi-wavelength
astrophysics - Chemistry
- remote control of equipment and electronic
logbooks - Engineering
- industrial healthcare and virtual organisations
- Bioinformatics
- data integration, knowledge discovery and
workflow - Healthcare
- sharing normalized mammograms
- Environment
- Ocean, weather, climate modelling, sensor networks
5UK e-Science Grid
Edinburgh
Glasgow
Newcastle
DL
Belfast
Manchester
Cambridge
Oxford
Hinxton
RAL
Cardiff
London
Southampton
6Access Grid Group Conferencing
All UK e-Science Centres have AG rooms Widely
used for technical and management meetings
Multi-site group-to-group conferencing
system Continuous audio and video contact with
all participants Globally deployed
7A Status Report on UK e-Science
- An exciting portfolio of Research Council
e-Science projects (40 projects) - Beginning to see e-Science infrastructure deliver
some early wins in several areas - DiscoveryNet success at SC02
- TeraGyroid success at SC03 heroic achievement
- The UK is unique in having a strong collaborative
industrial component (50 projects) - Nearly 80 UK companies contributing over 30M
- Engineering, Pharmaceutical, Petrochemical, IT
companies, Commerce, Media,
8Breakdown of AHM attendees
9Identifiable UK e-Science Focus
- Data Access and Integration
- OGSA-DAI and DAIT project
- Grid Data Services
- Workflow, Provenance, Notification
- Distributed Query, Knowledge Management
- Data Curation and Data Handling
- Digital Curation Centre
- Security, AA and all that
- Digital Certificates and Single Sign-On
- Federated Shibboleth framework
10UK e-Science Grid Second Phase - Web Service
Grids
Edinburgh
Glasgow
Newcastle
DL
Belfast
Manchester
Cambridge
Oxford
RL
Hinxton
Cardiff
London
Soton
11The e-Science Core Programme Phase 2 - Building
a Sustainable National e-Infrastructure
- The UK Government Investment Framework for
Research and Innovation 2004 2014 report
emphasizes the need for - Creation of a multidisciplinary research
environment - National Information Infrastructure
- Access to experimental data sets and publications
- Collection and preservation of digital information
12Key Elements of a UK
e-Infrastructure
- Research Network
- National Grid and HEC Service
- Open Middleware Infrastructure Institute
- Digital Curation Centre
- National e-Science Institute
- Portals and Discovery Services
- Access to facilities, data services and
repositories - Tools and Services to support collaboration
- National data archive
- Support for International Standards
13SuperJANET4/5
14Local Research Equipment
UK Researchers
International Point-of-Access
Extended JANETDevelopment Network
Existing connections
Proposedconnections
CAnet
UKLightLondon
StarLightChicago
10Gb/s
10Gb/s
2.5Gb/s
Abilene
CERN
10Gb/s
10Gb/s
10Gb/s
NetherLightAmsterdam
CzechLight
JISC 6.5M for UKLight Lambda Network
GEANT
15The Future Hybrid Networks?
- Standard packet routed production network for
email, Web access, - User-controlled lambda connections for
e-Science applications requiring high performance
end-to-end Quality of Service
16 NGS Today
Interfaces
Projects e-Minerals e-Materials Orbital
Dynamics of Galaxies Bioinformatics (using BLAST)
GEODISE project UKQCD Singlet meson
project Census data analysis MIAKT
project e-HTPX project. RealityGrid
(chemistry) Users Leeds Oxford UCL Cardiff South
ampton Imperial Liverpool Sheffield Cambridge Edin
burgh QUB BBSRC CCLRC.
OGSILite
17NGS Tomorrow
Web Services based National Grid Infrastructure
18Lessons from the NGS?
- Do users want an NGS?
- What will they use it for?
- Will the data nodes be useful?
19Research Prototypes to Production
Quality Middleware?
- Research projects are not funded to do the
regression testing, configuration and QA required
to produce production quality middleware - Common rule of thumb is that it requires at least
10 times more effort to take proof of concept
research software to production quality - Key issue for UK e-Science projects is to ensure
that there is some documented, maintainable,
robust grid middleware by the end of the 5 year
250M initiative
20Open Middleware Infrastructure Institute (OMII)
- Vision
- To be the national provider of reliable,
interoperable, open source grid middleware - Provide one-stop portal and software repository
for grid middleware - Provide quality assured software engineering,
testing, packaging and maintenance for our
products - Located in Southampton with Edinburgh node
producing OGSA-DAI middleware
21OMII-1Release Oct 2004
- Collection of tested, documented and integrated
software components for Web Service Grids - A base built from off-the-shelf Web Services
technology - A package of extensions that can be enabled as
required - An initial set of Web Services for building
file-compute collaborative grids - Technical preview of Web Service version of
OGSA-DAI database middleware - Sample applications
- www.omii.ac.uk
22OMII future distributions
- Include the services in previous distributions
- OMII managed programme contributions
- Database service
- Workflow service
- Registry service
- Reliable messaging service
- Notification service
- Interoperability with other grid middleware
- GT4, Condor
- EGEE, TeraGrid
23Digital Curation?
- In next 5 years e-Science projects will produce
more scientific data than has been collected in
the whole of human history - In 20 years can guarantee that the operating and
spreadsheet program and the hardware used to
store data will not exist - Research curation technologies and best practice
- Need to liaise closely with individual research
communities, data archives and libraries - Edinburgh with Glasgow, CLRC and UKOLN selected
as site of DCC
24Digital Curation Centre
- Actions needed to maintain and utilise digital
data and research results over entire life-cycle - For current and future generations of users
- Digital Preservation
- Long-run technological/legal accessibility and
usability - Data curation in science
- Maintenance of body of trusted data to represent
current state of knowledge in area of research - Research in tools and technologies
- Integration, annotation, provenance, metadata,
security..
25Imperial
Imperial
GLOBUS
EBI
EBI
UCL
UCL
Local databases
User query
GRID sharing
Linux farms
26BioSimGrid Project
Application
Distributed Query
2nd Level Metadata Describing the Results of
Generic Analyses
Analyse Data
1st Level Metadata Describing the Simulation
Data
Simulation Data
Distributed Raw Data
BioSimGRID - A biosimulation GRID database
27e-HTPx Project
Phasing
Crystallisation
Data Collection
Protein Production
Protein Structure
START Target Selection
Deposition
Structure analysis
28NERC DataGrid Project Remote Access Grid
Challenge
British Atmospheric Data Centre
British Oceanographic Data Centre
http//ndg.nerc.ac.uk
29Partners for e-Infrastructure
- Sustainability requires long-term support
- e-Science is creating the e-Infrastructure for
research - The support of the e-Infrastructure will, over
time, become the role of JISC. - Research and Development of this e-Infrastructure
will be the responsibility of OST/RCUK.
30JISC Funded e-Research Projects
- Security
- Knowledge management
- Collaborative Environments
- Visualisation
- Data curation and handling
- Middleware architectures and development
- Education and Outreach
31UK e-Science Grid Vision
- TeraGrid and DEISA Vision of Supercomputer Centre
Grids - SETI_at_HOME and ClimatePrediction.Net Vision of
Public Idle-Cycle Grids - Particle Physicists Vision of truly global
Compute-File Grids - UK e-Science Plug and Play Grid Vision driven
by user needs - Need to provide robust, interoperable Middleware
Services so that different user communities can
connect the resources and research groups that
they need for their type of e-Science Grid