Title: UCSB Bio-imaging Infrastructure
1UCSB Bio-imaging Infrastructure
Center for Bioimaging Informatics
www.bioimage.ucsb.edu Supported by NSF ITR
0331697
2Projects
- Bisque/OME
- In use at UCSB
- Bisquik
- Next generation system (in construction)
3Motivation
- Analysis creates knowledge
- Image analysis
- Querying
- Mining
- Available Storage systems are growing
- Available Processing increasing
4Challenges
- Diverse users
- UCSB Neurosciences institute
- Retinal detachment Fischer et al
- Microtubule dynamics Feinstein and Wilson
- CMU Murphy Lab
- Flybase
- Computable Plant
- Dataset challenges
- Multiple types and collection techniques
- Complex images (5D) and metadata
- Acquisition and management of large sets
- Security of private data
- Sharing of experimental data
5Challenges
- Metadata
- Collection
- Organization
- Sharing/interpretation
- Dataset management
- Personal collections
- Multiple organizations
- Privacy and security
- Analysis design
- Analysis integration
6Block System diagram
Image Analysis
Browse Search Content metadata
Collection Analysis
Interactive Enhancement
Semantic Analysis
Knowledge Discovery
Image and metadata capture
DB Storage
Ground truth Collection
7BISQUEBio-Image Semantic Query User Environment
- Current dataset collection
- BISQUE functionality
- Data management
- Ground truth acquisition
- Analysis
- Architecture
8Current collections
- UCSB Retinal Objects (confocal, EM)
- UCSB Microtubule Objects(light, AFM)
Type Current Backlog Rate/y Expected 4Yrs 2D Images
Retinal EM 500 22000 500 23,000 23K
Retinal confocal P 3000 600 2400 10,000 10K
Retinal confocal Z 600 14000 12000 10,000 50K
Microtubule light 3000 2500 2500 13,000 400K
Microtubule AFM 0 500 1200 5000 30K
Flybase 0 125K 0 125K 125K
- Total estimated size in TBs and growing for 1 Lab
- Complexity and analysis are the main issues
9Data management capabilities
- Digital notebook
- Direct import process
- Reconfigurable (confocal, 4 x microtubule, etc)
- Web
- Web access and browsing
- Organize images and metadata
- Data sharing environment
- Search by metadata or content
- Integrated analysis
10Screenshots (Digital Notebook)
11Data management capabilities
- Digital notebook
- Capture experimental/image parameters
- Direct Import process
- Reconfigurable (confocal, 4 x microtubule, etc)
- Web
- Web access and browsing
- Organize images and metadata
- Data sharing environment
- Search by metadata or content
- Integrated analysis
12Screenshots (browsing)
Dataset Browsing
13Screenshots (search)
14Screenshots (search)
15Screenshots (search similar)
16Screenshots (search similar)
17Screenshots (personal collection)
Data sharing
18Screenshots (5D Viewer)
19Screenshot (5D with tracks)
20Image analysis
- Integrated image analysis
- Image enhancement
- Cell counter (ImageJ)
- Quantify microtubule dynamics
- Microtubule tracker (manual, automatic)
- Track identification
- Integration in progress
- Segmentation and classification
- Modeling microtubule dynamics
- Relevance feedback improving search
21Screenshots (Image Analysis)
Image J Cell counter
22System description overview
- Current dataset collection
- Current functionality
- Data management
- Ground truth acquisition
- Analysis
- Architecture
23Hardware/software infrastructure
- Hardware
- 16 node Cluster
- dual Intel Xeon 3GHZ
- Gigabit network switch
- 2 TB Storage
- Software
- Bisque
- OME (Apache, Postgresql)
- Linux
24Bisque ArchitectureBio-Image Semantic Query User
Environment
BISQUE
Research in biology
Distributed computing cluster
image
Digital Notebook
XML
metadata
features
Image/ metadata server
search
WEB
analysis
Cell counter lug-in for ImageJ
Cell counter lug-in for ImageJ
external
internal
Cell counter lug-in for ImageJ
XML
Cell counter plug-in for ImageJ
Research in image processing and indexing
25OME Base
- Open Microscopy Environment
- OME is an open source software project to
develop a database-driven system for the
quantitative analysis of biological images. OME
is a collaborative effort among academic labs and
a number of commercial entities. - Provides base for image/metadata storage and
analysis integration - Boston Sorger Lab
- Baltimore Goldberg Lab
- Dundee Swedlow Lab
- Madison LOCI
26Bisque extensions
- Ongoing extensions with OME
- Content based search
- Analysis integration with OME
- Segmentation
- Cell Counting
- MT identification and Analysis
- Etc.
- Front end dataset and analysis support
- Schema additions for image types and analysis
- (Uncertainty modeling and queries)
27Bisque
- Built 1st generation w/ gt5000 5-D images
- Integrated several useful analyses
- Being used internally
- External Interest
- Immediate future
- Continued development
- analysis integration
- Dataset integration
- Remote deployments
- Integration with other projects
- Flybase Integration of schema/datasets in
progress - Computableplant.org
28Projects
- Bisque/OME
- In use at UCSB
- http//hammer.ece.ucsb.edu/bisque
- guest/bioimage
- Bisquik
- Next generation system (in construction)
29Bisque/OME lessons learned
- Getting the correct metadata is hard
- Correct may need to change or be reinterpreted
- Sql schema difficult to change
- Getting the right name is difficult
- Different terms used in different labs make data
integration painful - Analysis integration needs to be easy
- Researcher balk at learning complex software
- Users expect a rich experience
- Google, flickr, etc have raised the bar
30New project Bisquik
- DoughDB Flexible metadata
- Ontology support
- Rich user experience
- Web 2.0 Ajax, SVG
- Web based tools DN, Graphical Annotator
- Programming toolkit
- Smaller is better
- Distributed data store
- Support for large scale computation
- Expected early 2007
31Block Components
Metadata Annotation
DoughDB
Metadata Renderer
Permission
Web UI
Ontology
Blob/Image Server
Remote Access
Analysis Engine
32Tagging Screen Shot
33Motivation
- Current metadata model is inflexible
- Adding new experimental images requires
- Changes to Digital notebook
- Changes to Bisque interface
- Changes to OME/postgres
- Shouldnt this be easier?
- Find images tagged with rod-opsin
- Create a region and specify the object
- Add experimental metadata to this dataset
34Bisquik requirements
- Add new tag/value pairs to any db object
- (Foo,2)
- (visible-cell, rod)
- Allow multiple tags with same value
- (visible-cell, rod)
- (visible-cell, muller)
- Support fine-grained tag permission/visibility
- Tags have creators and access control
- Support update semantics preserve history
- Timestamp tags
- No deletes (except under certain conditions)
35Bisquik DoughDB
OID4
OID1
image OID1
Feature f1
Name GH1020
Foo 2
pixels OID2 OID3
OID2
image OID1
data Server//
Pixel-type raw
36Bisquik queries
- Find objects (images) with tag foo
- Select a.foo
- Find images with name GV100
- Select a.name gv100
- Find images with cellcount lt 100
- Select a.cellcount lt 100
- Find images with a region similar to r1 based on
feature f1 - Select r.image i and l2(r.f1, r1 ) lt 10
37DoughDB key features
- No classes or types only tag/value pairs
- Open ended data model
- Pair values have owner and acl
- Preserves history of annotations
- SQL like query language
- Simple keyword queries
38Bisquik ontology support
- ontology is a data model that represents a
domain and is used to reason about the objects in
that domain and the relations between them - Unstructured tag/value
- Great for taggers
- Unhappy searchers
- Different labs use different terms for the same
object.
39Bisquik ontology support
- Dictionary of terms and relations
- Require (or strongly suggest) that tags and value
are defined before use - Drop into ontology editor when new values and
tags are encountered. - Integrated into search system
- Permit (or offer) alias part-of related-to
searches
40Analysis Engine
- Analysis Components
- Analysis Applications
- Glue language python
- Component connection library
- Memory for single node
- MPI for cluster based execution
- Execution engine
- Automatic component placement
- Resources and efficient communication
41Analysis Engine
- Interfaced with DoughDB
- All input/output are Pairs or objects
- Application executions recorded for data
provenance
42Bisquik interface
- http//oib.ece.ucsb.edu/bisquik
- Some bisque functionality
- Flickr-like interface for region tagging
- Pair tagging and keyword tagging
- Metadata renderers
- Textual (tag)
- Graphical (regions, geometry, graph data)
- Analysis oriented (histogram)
43Region Tagging screenshot
44Bisquik Metadata Annotation
- Unified offline (Digital Notebook) and online
manipulation. - Easy to build annotation forms
- Allow schema modification in field
- Permit annotation templates to be shared between
DN and Bisquik - Graphical geometry annotator
45Blob Image server
- Extensible server for write-once objects
- Pixels, Features
- Pluggable transforms/operations
- Thumbnails, slices
- pixel transforms (watermarks)
- Graphical metadata renderers
46Remote Access
- All basic services are web accessible
- Soap and WSDL
- DoughDB, Image server,
- DoughDB pairs have unique 64 bit IDs
- Split between machine ID/Pair ID
- All pairs are addressable
- Query engine processes foreign pairs
47Bisquik new hardware
- 10 TB disk mirrored array (20TB)
- Image Server
- Database (backup) server
- 8-16 Query/Analysis nodes
Image Server
DB/Backup Server
10TB
10TB
48Status
- Web UI
- Uploading, Tagging, simple searches
- DoughDB
- Single node storage and queries
- Analysis Engine
- In design
49Conclusion
- Bisque Team August, Jiejun, Melissa,Kris
- Bisquik Team
- Interface August Jiejun Jaechok
- Analysis Engine Kris, Mellisa, Dmitry
- Ontology David(summer), Verleen(summer), Kris
- DoughDB Kris,Vebjorn
50Biowall
51Biowall
- 5x4 1600x1200 displays
- (8000x4800 pixels)
- 38 MegaPixels
- 1 Head node 5 slaves
- Distributed multihead X (X proxy)
- New simplified viewer
- OpenEV image server
- Local client
- Web/Database client
- 3D visualizations
- Working with very large screen areas