BNL Batch System - PowerPoint PPT Presentation

1 / 28

About This Presentation

Title:

BNL Batch System

Description:

David Yu. Brookhaven National Laboratory. Files are sunken into tape cartridges sequentially. ... Daily Report. Daily Report. Daily Report. Heartbeat. CFG ... – PowerPoint PPT presentation

Number of Views:68

Avg rating:3.0/5.0

Slides: 29

Provided by: dyu

Category:

more less

Transcript and Presenter's Notes

Title: BNL Batch System

1

BNL Batch System
David Yu
Brookhaven National Laboratory

2
Sourcing Data from HPSS

Files are sunken into tape cartridges
sequentially.
However, file retrieving requests are accessing
tapes randomly 24 x 7.
Large number of retrieval requests, large
quantity of cartridges, but limited tape drives.
Extremely high tape mount/dismount rate.
We need to guarantee that we have the necessary
tape drives available for migration. How can we
guarantee the drives are available for migration?

3
Sourcing Data from HPSS

Oak Ridge Batch is one of the HPSS applications
that attempts to provide resource control for
Staging multiple files by sorting requests by
tape.
Capability of guarantee the drive availability
for migration.
Oak Ridge Batch System has been modified to fit
BNL's requirements.

4
Oak Ridge Batch

Like HSI, Oak Ridge Batch aggregates files by
tape id and then stages files in batch, which
improves the reading performance by reducing tape
mounts and dismounts.
Not available in HSI Resource Control - logical
resource management that throttles tape drive use
for reading, thus guaranteeing that drives are
available for migration.

5
BNLs Environment

3 Experiments Atlas, Phenix and Star
Peak stage rate in last 5 months
Atlas 12759 files, 370 tapes (July 14)
Phenix 12991 files, 580 tapes (Aug 9)
Star 20530 files, 302 tapes (June 6)
Average stage 7 TB / day

6
BNLs Environment

Since the file stage requests are randomly
accessing different tapes, real time access like
pftp is not appropriated when there are so many
requests but limited tape drives. HSI does not
provide the capability to control number of
drives.
Due to the nature of tape storage, we need to
stage file in batch mode.
Need the resource control for guaranteeing the
drive availability for migration.
Oak Ridge Batch System was introduced to BNL for
resource management a few years ago.

7
BNLs Environment

Data sourcing activities are 24x7, randomly out
of 38 million files. Hundreds of requests per
hour.
BNLs HPSS system receives data from 5 different
sources (7 projects).
BNL has 7 instances of Batch.
7 instances are distributed on 2 servers.
For resource allocation purpose, some instances
may be using multiple PVRs and some are sharing
PVRs.

8
BNLs Environment

Multiple instances and multiple PVRs

Batch instance 1
Batch instance 2
Batch instance 3
PVR 1 9940B
PVR 2 LTO-3
PVR 3 LTO-4
9
BNLs Requirements

Stability
Enhance error handling
Enhance performance
Support multiple PVRs
Tape drive resource control, management, and
statistic for performance tuning.
Enhance monitoring tools
Need Dynamic Configuration without shutting down
the process.

10
BNL Batchs New Features

Supports multiple PVRs. Each PVR is being
handled independently by a dedicated thread.
Each thread is throttling the number of drives
and number of requests for a PVR.
Added web-based monitoring tools.
All instances are monitored and managed by
web-based GUI in real time..
All transactions are stored in a database, for
Tracking purpose (file lost? why? how?).
Performance analyzing to learn from historical
data and fine tune the configuration.
Performance analyzing to compare to other Batch
instances and see why the other user can stage
more files.

11
BNL Batchs New Features

Configurations can be modified dynamically.
Debug level, error auto-retry, priority, number
of drives and number of files to stage at the
same time. PVR lock/unlock from user level or PVR
level.
Added some web-based general historical
performance lookup tools.
Stage priority is optional FIFO (default), or
high-demand (sorted by number of requests on
tape).
Optional timeout value can be specified in
request.

12
BNL Batch Monitoring Tools
13
BNL Batch Queue View

Due to the nature of the usage, instance A has
requests from all around the world, and the
chance to have multiple users to access the same
tape at the same timeframe is very low.
In this example, 157 files are located on 124
tapes. That means it will require 124 tape mounts
and dismounts.
Average 14 mounts / hour / drive.

14
BNL Batch Queue View

In Instance Ss case, 1793 files are on 32
tapes. That means it will only take 32 tape
mounts / dismounts to complete the stage.
Average 4 mounts / hour / drive.

15
BNL Batch Resource Management

Tape drives are used for both Read and Write.
Also a PVR may be shared by multiple Batch
instances.
In the following example, PVR Star Raw LTO-3
has a total of 14 drives, but only 7 are
allocated to instance Starrdat.
We can always adjust the drive allocation when
necessary
To allow more drives for migration
To allow other users to use more drives

16
BNL Batch Resource Management
BNL Batch Resource Management is not in control
of resource allocation. Full control is achieved
only when there is no direct user access to HPSS.
Tape drive allocation can be adjusted manually
from Web GUI, or automatically by scripts.

When a disk free space is reaching the
threshold, our monitoring scripts will
automatically reduce the drive allocation for the
PVR using the disk. This will slow down the
speed of disk usage growth, and free up more
drives for HPSS to process migration.

17
BNL Batch Resource Management

When we need to lock one entire PVR for
maintenance, staging activities for other PVRs
should not be affected. BNL Batch allows you to
suspend a PVR (but continue to queue up requests)
without affecting other PVRs staging activities.
We can also suspend all staging activities by a
single button.

18
BNL Batch Historical Data Search

BNL Batch allows you to view/search the status
of a request.
This feature will generate a report based on
your interest.
You may use a cron job to download the report
for each instance and email it to the subscribed
users.
For performance reason, this feature is
accessing the historical database (AKA
Secondary DB).

19
Architecture Overview

Requirement, Design and Implementation
Use well established 3rd party tools to save
development time.
Use MySQL as the central repository.
Each Batch is a MySQL client, which updates
status to MySQL from time to time. The Batch
also pulls message from MySQL periodically, and
processes the message accordingly.

20
Architecture Overview
PVRA
PVRB
PVR A Tape 1012 Tape 1041 Tape 1338

PVRC
PVR B Tape 3847 Tape 3784 Tape 3348
HPSS
PVRD
PVRE
Batch Instance 1
PVRF
PVRG
hpss get file attribute
Tape 3348
File1.txt
DB2
Requests File1.txt
21
Architecture Overview
PVRA
PVRB
PVR A Tape 1012 Tape 1041 Tape 1338

PVRC
Staging 2 tapes
HPSS
PVRD
PVRE
PVR B Tape 3847 Tape 3784 Tape 3348 Tape
3924 Tape 3923
Batch Instance 1
Staging 5 tapes
PVRF
PVRG
DB2
Staging 2 tapes
PVR D Tape 5013 Tape 5243 Tape 5345
22
Architecture Overview
BNL Batch Server

HPSS
PVR A Tape 1012 Tape 1041 Tape 1338
PVRA
Staging 2 tapes
Batch Instance 1
File1.txt staged
Destination
Notification script
Delivery script PFTP
HPSS Disk File1.txt
File1.txt
23
Architecture Overview

Web GUI browser
Batch instance 1
Web GUI browser
Monitoring
Heartbeat
MySQL Production
Web GUI browser
CFG update
Shutdown
Batch instance 2
Changing configuration
Hourly synchronization
Batch instance 3
Web GUI browser
Searching
MySQL Historical Database
Web GUI browser
Hourly performance
MySQL scripts
Daily Report
Batch instance 7
Daily Report
Daily Report
Cron job
24
BNLs Experience

The stage activity statistics has been used
heavily for fine tuning performance.
We proved to our users that small files and
random file access are performance killers.
When a disk is getting full, we no longer need to
kill Batch process.
When we need to bring down a PVR for maintenance,
we no longer have to kill all Batch processes
using that PVR.

25
BNLs Experience

When a file is lost, we always have the ability
to track the complete history of this request.
We also have scripts that keep monitoring the
errors in Batch and send alert back to system
admins when the situation is critical.

26
BNL Experience?

BNL has such a high demand of fully utilizing
HPSS system in both sinking and staging files, 24
x 7.
We use BNL Batch to throttle the tape drives by
limiting number of stages. Thus to guarantee a
number of the unused drives available for HPSS
migration.
How do other sites satisfy this requirement? Any
suggestions?

27
Questions?

28
Thank you!

Write a Comment

User Comments (0)