MINERVA: an automated resource provisioning tool for large-scale storage systems

About This Presentation

Title:

MINERVA: an automated resource provisioning tool for large-scale storage systems

Description:

Title: MINERVA: an automated resource provisioning tool for large-scale storage systems Author: Jack Lange Last modified by: Fabian E. Bustamante –

Number of Views:105

Avg rating:3.0/5.0

Slides: 36

Provided by: JackL167

Learn more at: https://users.cs.northwestern.edu

Category:

more less

Transcript and Presenter's Notes

Title: MINERVA: an automated resource provisioning tool for large-scale storage systems

1
MINERVA an automated resource provisioning tool
for large-scale storage systems

G. Alvarez, E. Borowsky, S. Go, T. Romer, R.
Becker-Szendy, R. Golding, A. Merchant, M.
Spasojevic, A. Veitch, J. Wilkes

2
Large Scale Storage Systems

Very Difficult to configure and design
10 100s of host computers
10 100s of storage devices
10 1000s of Disks/Logical Volumes
Terabytes of capacity
Meet throughput demands
Maximize capacity utilization
Automation would be nice

3
MINERVA

Subdivide problem into three stages
Choose correct device set
Choose correct configuration parameters
Map user data onto devices
NP-hard
Architectural elements
Declarative descriptions of storage workload
requirements
Constraint-based problem representation
Optimization strategies and heuristics
Analytic performance models

4
MINERVA Inputs

Workload Description
Data type descriptions and access patterns
Two types
Stores
Logically contiguous data (db table or
filesystem)
Streams
Sequences of accesses on a store (pattern and
throughput)
Device Descriptions
Disk information (number, size, and type)
Array information (number of LUNs)

5
MINERVA Objects
6
MINERVA Outputs

Assignment
Device Set taken from Device Descriptions
Mapping of stores to devices
2nnm possible configurations
O((2m)m) complexity
Goal
Minimum cost that meets performance requirements
Effector tool
Takes assignment as input
Automated configuration of physical devices

7
Storage System Lifecycle
8
Architecture

Array Allocation
Tagger
Assigns a preferred RAID level
Allocator
Determines number of arrays
Array Configuration
Array Designer
Actually configures the arrays
Store Assignment
Solver
Assigns stores to LUNs
Optimizer
Prunes unused resources and balances load
Evaluator
Verifies design with analytic models

9
Architecture
10
MINERVA Process
11
Analytical Device Models

Determines feasibility
Predicted throughput error rate 20
Streams
Modeled as ON-OFF Markov-modulated Poisson
process
Arrays
Array controller, bus connection, disks
Case Study
HP SureStore Model 30/FC High Availability disk
array

12
Tagger

Choose storage class based on access pattern
RAID 1/0 or RAID 5
Rule Based
Determines capacity bound stores
Estimates average number of IO ops per sec.
IOPS

13
Capactiy Rules

Calculated per GB of storage
Capacity bound RAID 5

14
IOPS Estimation

RAID level least number of per-disk IOPS

15
Allocator

reasonable set of arrays
3 steps
Consider type and number of arrays
Consider array configurations
Consider LUN divisions and RAID configurations

16
Allocator models

Can only use analytic device models
Ignores stream phasing
Rillifier handles large resource demands
Distribute workload among different LUNs
Stores become shards
Excessive capacity requirements
Streams become rills
Excessive throughput requirements

17
Allocator Search

Uses Branch-and-Bound strategy
Determines number of array types
Chooses lowest cost that supports workload
Searches array configurations
Starts with mixed arrays
Iteratively converts arrays to dedicated types
Branch and Bound-bias dedicated
Searches in reverse order starting with dedicated
types
Calls array designer with configuration
If array designer fails, search continues

18
Array Designer

Determines LUN sizes and array parameters
Starts with simple cases of equal size LUNs
Also considers greedy configuration
Workload description determines LUN size
Relies on Optimizer to take care of unused
capacity
Target disk assignment done with round robin
across buses

19
Solver

Assigns stores to LUNs
Multidimensional constrained bin-packing
Uses analytic device models to evaluate objective
function
Constraints
LUN capacity
LUN phased utilization
Array bus bandwidth
Array controller utilization

20
Solver Heuristics

Simple Random
50 random cases using first fit
Toyoda
Best fit using gradient function
Objective function combined with economic
utilization
(1/penalty lun_cost)
Favors LUNS already in use or low cost
LUNs filled in order of increasing cost
Minimizes resource contention

21
Solver Heuristics 2

ToyodaWeighted
Maps gradients against remaining available
resources
Maps stores to LUNs such that utilization is
balanced
Objective_function cos(a)
Objective_function max_lun_cost lun_cost
Minimizes cost

22
Toyoda and ToyodaWeighted
23
Optimizer

Reruns Solver against configuration
Reduces required arrays
Runs ToyodaWeighted with new objective function
Objective_value 1 lun_utilization
Assigns stores to underutilized LUNs
Variations
Simple Random
Randomized first fit, chooses lowest utilization
variance
Simple Balanced
Round robin first fit, based on capacity and
utilization constraints

24
Clusterer

Addresses performance scaling issues
With many stores runtime grew to days
Combines multiple stores into a cluster
Cluster is mapped instead of stores
Cluster rules based on observation
10MB/s bandwidth
2GB size
Increases cost 3

25
Evaluation

Analytic model performance predictions
Evaluate sensitivity to workload changes
Effect of design changes
Measure live system

26
Model Validation

Based on single FC-30
Ran performance tests on physical system
Compared results to model predictions
Results showed mean error rate of 5.4
Range of -11, 19

27
Safety and Sensitivity

Examined scaling of workload parameters
Start with baseline workload, then modify a
single parameter
Wanted to have 3 effects
Mixing of appropriate RAID levels
Requiring non-trivial number of arrays (2)
Balanced store performance requirements

28
Scaling Store Size and Bandwidth

Store size scaling
System becomes capacity bound
Creates RAID 5 LUNs
System size scales linearly with store size
Bandwidth scaling
Ratio of RAID 1/0 to RAID 5 increases linearly

29
(No Transcript)
30
Scaling Number of Stores

Number of arrays scales linearly with stores

31
Running time

Quadratic increase with number of stores

32
Workload Variability

Workload attributes randomly taken from
log-normal distribution
Baseline values mean distribution values
Capacity utilization drops with increased
variability
RAID 5 LUNs increase
Segmentation increases

33
Workload variance
34
Whole System Validation

MINERVA vs. Human Expert
3 aspects
Comparison of resultant system cost
Comparison of application performance
Low runtime and minimal human interaction
Based on TPC-D benchmark
Decision Support system based on DB queries
Human designers from HP system benchmarking team

35
Execution Times

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them.

Related Presentations

GE3502GE5502 Geographic and Land Information Systems - Attendance required at all lectures and labs. ... a. Automated Mapping ... It is updated over time. It is the basis of land valuation and of taxation (rates) ... | free to view

Human Resource Management System Software Reduces Work By More Than Half - The human resources software programs are playing more prominent roles in hr recruiting. The increasing competition for jobs sets a dilemma for the hr professional. The software that can keep track of the appointments and schedules helps them to become efficient at concentrating on getting the top talent hired for their firms. | free to view

Impact of Implementation of Safety Management Systems (SMS) on Risk Management and Decision-Making - Impact of Implementation of Safety Management Systems (SMS) on Risk Management and Decision-Making. Kathy Fox, Board Member. System Safety Society Canada Chapter ... | free to view

Automated Material Handling Equipment - In the industrial domain, varied equipments are required for storage, control, movement and protection of materials, products and goods. This is where automated material handling equipment is used. These tools are very handy throughout the process of manufacturing, distribution, consumption and finally disposal. http://bit.ly/1eTIHIt | free to view

CoSMIC: A Model Driven Middleware for Provisioning Large-scale Distributed Real-time and Embedded Systems - CoSMIC: A Model Driven Middleware for Provisioning Large-scale Distributed Real-time and Embedded Systems Dr. Aniruddha Gokhale a.gokhale@vanderbilt.edu | free to view

70-270: MCSE Guide to Microsoft Windows XP Professional Second Edition, Enhanced Chapter 4: Managing Windows XP File Systems and Storage - 70-270: MCSE Guide to Microsoft Windows XP Professional Second Edition, Enhanced Chapter 4: Managing Windows XP File Systems and Storage Objectives Understand basic ... | free to view

Talygen Human Resource Management Tool - Human Resource Management is a process of handling all the HR related tasks and activities. For more information visit http://talygen.com/Human-Resource-Management-Tool. | free to view

Human Resource Software to Boost Your Credentials - Human resources i.e. people working in your organization forms the basis for the success of your business http://talygen.com/Human-Resource-Management-Tool. | free to view

Data Pipelines: Real Life Fully Automated Fault-tolerant Data Movement and Processing - Data Pipelines: Real Life Fully Automated Fault-tolerant Data Movement and Processing | free to view

Automated administration for storage system - Automated administration for storage system Presentation by Amitayu Das Introduction Major challenges in storage management System design and configuration (device ... | free to view

Automated storage and retrieval system - Efacec is able to deliver a fully functional turnkey automatic storage system, including its own equipment, such as stacker cranes, conveying systems, aisle equipment | free to view

Automated storage and retrieval system (1) - Efacec is able to deliver a fully functional turnkey automatic storage system, including its own equipment, such as stacker cranes, conveying systems, aisle equipment | free to view

Factors influencing for the development of WEB SCALE IT Market Till 2020 | Market IntelReports - Download Sample Brochure @ http://bit.ly/2doJsj2 In the era of globalization, all the organizations are using data centers that can virtualize application workloads. Almost 50% of the data centers now are shifting to public clouds for their growing data storage need. Moreover, traditional servers are connected over a network architecture which is complex to scale and manage. Due to this, data centers are deploying technologies such as virtualization, social media, and big data. Read Analysis @ http://bit.ly/2doJiZd | free to view

Next generation library automation - Need automation framework designed from the ground up for partnerships and resource sharing. Cross-institutional Identity management. Resource sharing obstacles. | free to view

Top 9 DevOps Tools: Which DevOps Tool Should I Learn - This JanBask Trianing Top 9 DevOps tools video will give you a short introduction to the most trending DevOps tools in the industry. | free to view

Network Attached Storage Market to reach $20bn by 2024 - More Information @ https://bit.ly/2IHGczL The consumer NAS market is projected to grow at around 25% CAGR during the forecast period due to the increasing popularity of NAS solutions for data sharing, backup, and recovery purposes among consumers. The system provides a scalable and affordable mechanism to backup multimedia content from personal devices such as smartphones, PCs, laptops, smart TVs, and home automation systems. | free to view

Automated Storage & Retrieval Systems Market Report - The global revenue generated by Automated Storage and Retrieval Systems in 2017 was $5.53billion, owing to growing need for automation in the supply chain industry, the market is estimated to grow at a CAGR of 6.55% throughout the forecast period 2018-2023. | free to view

Automated Storage and Retrieval System Market 2018 by Growth Analysis and Forecast to 2023 - Market Research Future published a research report on “Global Automated Storage and Retrieval System Market Research Report- Forecast 2023” – Market Analysis, Scope, Stake, Progress, Trends and Forecast to 2023. Get Complete Report @ https://www.marketresearchfuture.com/reports/automated-storage-retrieval-system-market-3886 | free to view

IBM Systems C1000-020 study guide - Passcert has always verified and updated IBM Systems C1000-020 study guide which helps you to prepare your exam with less effort in very short time. It has latest and relevant IBM Systems C1000-020 study guide which is useful for you to get prepare for IBM C1000-020 exam with ease. I can recommend everyone Passcert where you can download and read latest dumps in PDF and VCE document. | free to view

Automated Parking System Boon for architects and builders - Trust Wohr’s automated parking systems to optimize your parking space and return on investment. | free to view

Turbine Control Systems Market Segmentation, Competitive Landscape, CAGR, Strategy, Business Overview and Forecast to 2023 - Market Research Future (MRFR)’s new study reveals that the global turbine control systems market is estimated to value at USD 19.20 Bn expanding at a CAGR of 4.73% during the forecast period 2018 to 2023. The growth of the energy sector coupled with rising demand for electricity from renewable energy has intensified the need for the deployment of automation. | free to view

Solved Practice questions for Microsoft Provisioning SQL Databases 70-765 Exam - Are you searching for solved questions for Microsoft Provisioning SQL Databases 70-765. You also need to pass it in first attempt but It is difficult to pass Microsoft 70-765 for most of the students. You can make it easier with the help of fravo Microsoft 70-765 Provisioning SQL Databases dumps. Get complete version here: https://www.fravo.com/70-765-exams.html | free to view

Migrating to Microsoft Technology Stack: Automated and Customizable Database and Application Migration and Modernization - We help organizations all over the world to keep up with the best practices of Software Development and Data Storage. | free to view

Automated Storage and Retrieval System Market Size Worth $12.4 Billion By 2026 - KBV Research - The Global Automated Storage and Retrieval System (ASRS) Market size is expected to reach $12.4 billion by 2026, rising at a market growth of 9.3% CAGR during the forecast period. | free to view

Approval Forms: Automate Your Forms With An Intranet - Are paper and email systems slowing down your business? Are they also costing your company valuable dollars? In these s, we look in detail at how exactly automated approval forms processes would work on an intranet before we go on to look specifically at the efficiencies to be gained from doing so to save you time and money. | free to view

Automated Vertical Storage System Nomenclature Guide - Check out this glossary of terms if you deal with an automated storage system, vertical lift module or vertical carousel in your facility. Read this blog. | free to view

How Vertical Storage Systems Can Be a Valuable Asset for Businesses? - Vertical storage systems will undoubtedly be a wise investment for warehouses. Learn how it can help your facilities by reading this document. https://bit.ly/3vFh8ji | free to view