Title: Ocean Circulation: Sink or Swim
1COMPUTATIONAL SCIENCES BROWN BAG SEMINAR
Ocean Circulation Sink or Swim
Danny Thorne
UK Department of Computer Science
Tuesday, April 17, 2001
Supported in part by Sandia National Laboratory,
Lawrence Livermore National Laboratory, and the
National Science Foundation. Nuke em if they
cant take a joke.
2Craig C. Douglas University of Kentucky and Yale
University
Gundolf Haase Johannes KeplerUniversity-Linz
- Mohamed Iskandarani
- University of Miami
Danny Thorne University of Kentucky and Sandia
National Laboratory
Michael Sheetz University of Kentucky
3Outline
This talk chronicles the
- Past
- Background
- Single Layer
- Multiple Layer
- Present
- Subgrid Structuring Techniques
- Simultaneous Iteration
- Lumped Schur Complement
- Future
- C
- 3D
- Non-Hydrostatic
of the Spectral Element Ocean Model (SEOM).
4(No Transcript)
5(No Transcript)
6Difficulties of Ocean Modeling
- Stratification
- Rotation
- Complicated basins
- Small aspect ratio (depth ltlt L)
- Turbulence (jets, eddies, fronts)
- Multiple time and length scales
- Poorly known I.C. and B.C.
7Advantages of Spectral Elements
- h-p finite element method
- Dual convergence paths
- Geometric Flexibility
- Unstructured grids
- Accuracy
- Low numerical dispersion and dissipation errors
- Scalability
- Dense Computational Kernels, O(KN3)
- Sparse neighbor-neighbor communications
8(No Transcript)
9Unstructured Grids
- Complicated ocean basins
- No masking
- Local (elemental) mapping only
- Multiscale Simulations
- Increase resolution where needed
- Regional-Global simulations possible in single
setting - Avoid open boundary conditions
10North East Pacific Grid
11Grid Spacing (km)
12Elements Collocation points
13Disadvantages(TANSTAAFL!)
- Stricter CFL stability than FD
- Gibbs oscillations near fronts
- Global conservation only
- More CPU intensive than FD/grid point
14SEOM Versions and Applications
- Single Layer
- No stratification effects
- Solution is the depth-mean flow
- 1.5 layer (wind circulation/abyssal flow)
- Global long period tides
- Estuarine modeling
- Multiple Layers
- Vertical coordinate is density (isopycnal)
- Vertical stack of single layers
- Wind driven circulation, 2-5 layers (soon up
to16)
15Motivation for Layered SEOM
- Mathematically simpler than SEOM-3D
- Computationally simpler and faster
- No cross isopycnal diffusion
- No pressure gradient errors
- Baroclinic processes possible with 2 layers
- Eddy resolving simulations can be produced
- relatively easily and cheaply
16Current Work
/ TODO Outline /
17Filtering
- Each layer has to solve
- , filtered vorticity
- , filtered divergence field
- The filtering is done by series expansion and the
Boyd-Vandeven filter in each spectral element. - Solve on each of the 5 layers
and
18So, What Is the Problem?
- Symmetric, positive definite matrix
- Not a M-matrix
- Really, really huge
Many parallel solvers available
but
Memory requirements vs. solution time
19Spectral Element
- Gauss-Lobatto discretization
- Element is the support of inner node f.e. basis
functions
- Inner nodes
- Boundary nodes
- consisting of Edge nodes
- Vertex nodes
20System of Equations
- Spectral element discretization
- Solve 10 times the system of equations
- Block structure
- where
- and the are
symmetric.
21Schur Complement-cg
- Solve Laplacian equations by Schur Complement
Conjugate Gradient (cg) - Preconditioner
- Take advantage of the structure of the spectral
elements
22Factor matrix
- Factorization of
- Schur complement
- Matrices are stored.
23Schur Complement and Basis Transformation
- Defining the exact harmonic basis (EHB)
transformation - the Schur complement can be reinterpreted as
- i.e., Galerkin approach.
24Schur Complement cg
25Schur Complement Preconditioner I
- Again, we can factor
such that - BUT (with j
counter of elements/edges/)
26Schur Complement Preconditioner II
- Replace with
- linear interpolation from vertices
onto an edge j
27Schur Complement Preconditioner III
- Calculate element-wise
- Approximate by
- is on edge j
Dryja - Derive directly
by symbolic methods - Bramble/Pasciak/Schatz
28Schur Complement-pcg
29Vertex Node System
- is equivalent to a
- (non-constant) 9-point
stencil - Solve directly (gather on one processor)
- Combine with parallel AMG (PEBBLES)
- Special cache optimized and parallel AMG/MG for
9-point stencil ()
30Memory Requirements
- Laplacian in 2D
- Small example 99 elements, 5146 nodes
- M O(nelem)
- M(Schur-cg) 2.35 MB
- M(Schur-cg,pc) 2.36 MB
31Memory requirements
99 elements, 5146 nodes
32Simultaneous Iterations
Filtering requires in each time step
5 layers
Solve 10 Laplacian equations at once! Less
memory accesses, better cache usage
33Improvements by Sim. Solving
2.05 faster on R10k 1.75 faster on R14k
Filtering only 15 WCPU
3470 of Computing Time
is spent in solving
Note
35System of Equations
Diagonal matrix
36Schur Complement System (Pressure)
Solve via precond. cg...
37Solving of Pressure System
Solve via pcg
- No matrices stored !
- gt Only matrix-vector
operation available.
38Preconditioning of Pressure System
- New preconditioner (Lumped Schur Complement)
39Further Improvement by Lumped Schur Complement
Speedup 1.28 on R10k 1.22 onR14k
40Future of SEOM
- SEOM
- SEOM is currently a big, bad Fortran 90 code.
- Fortran is a dying language.
- Fortran 90 should be avoided at all costs.
- BYON
- Distributed Preprocessing Stage
- Vanishing Layer Thickness
- Non-Hydrostatic Primitive Equations
- 3D Continuous Stratification
- Integration of PEBBLES, a parallel algebraic
- multigrid solver from Linz.
41SEOM Versions and Applications, Contd
- 3D Continuous Stratification
- density is a dependent variable
- low order SE discretization in vertical
- terrain following
- gravitational adjustment
- overflow
- flow rectification in a laboratory canyon
- test problems available on marine.rutgers.edu/po
42Thank You