Title: Optimization algorithms for proteinprotein interactions
1Optimization algorithms for protein-protein
interactions
- Rahul S. Sampath
- George Biros
- University of Pennsylvania
2Overview
- Electrostatic protein-protein interactions
- VCP SPICE
- Poisson-Boltzmann Equation (PBE)
- Sensitivity analysis
- Fast Algorithms
- Preliminary results
3Electrostatics protein-protein interactions
- Need to study protein-protein interactions
- Vital for many biological processes
- Immune system pathogen mechanisms
- Application in drug design
- Why is electrostatics important?
- Many bio-molecules are charged
- Electrostatic force is long-ranged
- Initiate binding
- Sometimes the only dominant force
4Apps for Protein-Protein Simulations
- Protein Assembly
- Will 2 proteins bind?
- Binding Sites?
- Shape of complexes?
- Describe thermodynamics of binding
- Protein Design
- Design mutants (slightly different sequence)
- Complementary proteins Facilitate binding
- Example VCP/SPICE
5VCP SPICE
- Variola smallpox causing virus
- Vaccinia used in smallpox vaccine
- VCP Vaccinia complement control protein
- SPICE Smallpox inhibitor of complement enzymes
(genetically engineered Variola protein) - Complement proteins Immune system
6VCP SPICE
- SPICE gt 90 similar to VCP
- SPICE gt 1000 times as powerful as VCP in
controlling complement proteins - Small proteins electrostatics likely dominant
7Electrostatic Potential around VCP,SPICE their
Mutants (Lambris et al. )
- Mutants selected randomly
- Potential distribution using Coulomb's law (Swiss
PDB Viewer) - Good correlation with experiment
8Shortcomings of the current approach
- All mutations done experimentally
- No deterministic method to select mutants
- Could we predict sites for mutations numerically?
- Modeling errors
- Atomistic vs continuous
- Coulomb's vs Poisson-Boltzmann
- Solvent Implicit vs explicit
9Atomistic Vs Continuum approx.
- Atomistic model Molecular Dynamics
- Expensive
- Small sized problems
- Continuum Model Poisson-Boltzmann Equation
10Treating Solvent as a Continuum
- Explicit Solvent simple
- Solvent treated separately
- Implicit Solvent more realistic, complex
11Implicit Solvent Model Solvent Accessibility
Free Region (?f) Set of centers of the solvent
probes which do not intersect any of the
atoms. Solvent Accessible Region (?s) Union of
volumes of the solvent probes with centers in
?f.
2-D view of Protein in solvent Courtesy Michael
Levitt Britt H Park (1993)
12Building the material properties from the geometry
- PDB file gives coordinates of centers of atoms
- PDB2PQR adds additional data charges radii
- Build a domain ¼ 3 times protein size. Generate
grids. - Protein is partitioned among different processors
- ?r , ? in each element are average of nodal
values. - Every node in the mesh is either in ?f or ?s or
?p.
VCP
13The Poisson-Boltzmann Equation
- Poisson-Boltzmann Equation (PBE)
14The Linearized Poisson-Boltzmann Equation
- Linearized Poisson-Boltzmann Equation (LPBE)
15Difficulties with PBE
- Nonlinearity
- Dirac-delta functions as sources
- Complicated geometry
- Material discontinuity
16Estimation of mutation sites
Introduce auxiliary objective function
17Adjoint-Sensitivity Operator for PBE
18More on the Adjoint Formulation
- Linear
- Independent of the perturbation term
- Solve the adjoint problem once Vs solving for
sensitivity many times - Essentially combinatorial
- q not arbitrary
- 20 amino-acids to choose from
19Solving the PBE
- Nonlinear solver
- Inexact Newton iterative linear solvers
- PBE is an elliptic equation
- Multigrid (MG) works well with elliptic equations
- A Multigrid Tutorial, Briggs et al.
- BlackBox Multigrid, Dendy et.al.
- Baker et.al., APBS. IBM J. R D (2001)
- Baker et.al., Electrostatics of nano-systems
Application to microtubules and the ribosome.
PNAS (2001)
20Full Multi-grid Algorithm
Fig. CourtesyPaul Heckbert
21What is new in our approach?
- Matrix-Free implementation
- Developed on top of PETSc
- Gives a lot of handles to change the smoothers
- Extend PETSc restriction/prolongation using
matrix-free operators to reduce memory usage - Application to sensitivity analysis
22Matrix Free Methods
- Problem of storage for large problems
- Creating and destroying matrices is expensive
- Iterative solvers only need the result of a
Matrix-Vector multiplication, not the matrix
itself
23Implementing a Matrix Free Multi-grid
- Matrix-free Restriction and Interpolation
- Coarse grid operator in Multi-grid
- A2h R Ah P (Galerkin Coarsening)
- No Explicit Galerkin Coarsening for Matrix-Free
24Components in the algorithm
- Use PDB2PQR to add missing data from PDB.
- Partition the protein.
- Assign material properties.
- Solve Main problem MG (LPB) or Newton-MG (NLPB)
- Solve Adjoint problem MG (LPBy) or Newton-MG
(NLPBy) - Generate VTK files display using Paraview.
- Solve optimization problem (To be implemented)
25Approximation of Jacobians in each Element
- J1 and J2 LPBE
- J3 NLPBE Adjoint equations.
- J3 expensive, poor MG performance. J4 ¼ J3
- The residual is still computed exactly.
26Performance of the approx Jacobian
- The exact jacobian does not converge
- Failure to satisfy the Galerkin condition
- Must use Matrix-based explicit Galerkin approach
- Approx jacobian converges
- Performace does not deteriorate with mesh size
- 643, 73 iterations, 13 min on 1 processor
- 1283, 70 iterations, 120 min on 1 processor
27Matrix based DD Vs Matrix free MG
- MFMG outperforms DD for large problems
- MFMG is asymptotically cheaper
- Performance of DD deteriorates, not for MG
-
28Scalability for more processors
29Potential Iso-contours
Generated Using Paraview LPB (Blue , Red
-) NLPB (Green , Yellow -)
Using Swiss-PDB viewer Coulomb's law
30Acknowledgements
- John D. Lambris, Department of Pathology
Laboratory Medicine, University of Pennsylvania. - PETSc Team
- The computations were performed in part on the
National Science Foundation HP GS1280 system at
the Pittsburgh Supercomputing Center. - The other members of our group Santi, Shravan,
Cosmina, Hari and Bharath.
31