File Classification in self-* storage systems - PowerPoint PPT Presentation

About This Presentation

Title:

**File Classification in self-* storage systems**

Description:

File Classification in self-* storage systems Michael Mesnier, Eno Thereska, Gregory R. Ganger, Daniel Ellard, Margo Seltzer Introduction Self-* infrastructure need ... – PowerPoint PPT presentation

Number of Views:89

Avg rating:3.0/5.0

Slides: 14

Provided by: ChiY152

Learn more at: https://users.cs.northwestern.edu

Category:

Tags: classification | file | self | storage | systems

Transcript and Presenter's Notes

Title: File Classification in self-* storage systems

1
File Classification in self- storage systems

Michael Mesnier, Eno Thereska, Gregory R. Ganger,
Daniel Ellard, Margo Seltzer

2
Introduction

Self- infrastructure need information about
Users
Applications
Policies
Not readily provided, and cannot depend on them
to provide them
So? Must be learned

3
Self- storage systems

Sub-problem of the self- structure
Key to get hints based on what creators
associate with their files
File size
File names
Lifetimes
Intentions determined, then decisions can be made
Results better file organization, performance

4
Classifying Files

Current rule-of-thumb policy selection
Generic, not optimized
Better distinguish classes
Finer grained policies
Ideally assigned at file creation
Determine classes at creation
Self- must learn this association
1) traces 2)running fs

5
So, how?

Create model that classify based on (some
attribs)
Name
Owner
Permissions
Must filter out irrelevant attribs
Classifier must learn rules to do so
Based on test set
Then inference happens

6
The right model

Model must be
Scalable
Dynamic
Cost-sensitive (mis-prediction cost)
Interpretable (human)
Model selected decision trees

7
ABLE

Attribute-based learning environment
1. obtain traces
2. make decision tree
3. make predictions
Top down, until all attribs are used
Split sample until leaves have similar file
attribs
After creation, query begins

8
Tests

Based on several systems to make sure it is
workload-independent
DEAS03
EECS03
CAMPUS
LAB
The control MODE algorithm places all files in
a single cluster

9
Results

Prediction results quite good
90 - 100 claimed
Clustering files by attribs are clear
Predict that a models ruleset will converge over
time

10
Benefits of incremental learning

Dynamically refines model as samples become
available
Generally better than one-shot learners
Sometimes one-shot performs poorly
Ruleset of incremental learners are smaller

11
On accuracy

More attributes chance of over-fitting
More rules -gt smaller ratios
Loses compression benefits
Predictive models can have false predictions
Can impact performance
Things that should be in RAM is placed on disk
instead etc.
Solution cost functions
Penalize errors
Create biased tree
System goals will need to be translated into it

12
Conclusion

These trees provide prediction accuracies in the
90 range
Adaptable via incremental learning
Continued work integration into self-
infrastructure

13
Questions?

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

John Deere 215A Self-Propelled Windrower Operator’s Manual Instant Download (Publication No.OME36284) PowerPoint PPT Presentation

John Deere 215A Self-Propelled Windrower Operator’s Manual Instant Download (Publication No.OME36284) - John Deere 215A Self-Propelled Windrower Operator’s Manual Instant Download (Publication No.OME36284) | PowerPoint PPT presentation | free to view

John Deere 215A Self-Propelled Windrower Operator’s Manual Instant Download (Publication No.OME36284) PowerPoint PPT Presentation

John Deere 215A Self-Propelled Windrower Operator’s Manual Instant Download (Publication No.OME36284) - John Deere 215A Self-Propelled Windrower Operator’s Manual Instant Download (Publication No.OME36284) | PowerPoint PPT presentation | free to view

John Deere 215A Self-Propelled Windrower Operator’s Manual Instant Download (Publication No.OME36284) PowerPoint PPT Presentation

John Deere 215A Self-Propelled Windrower Operator’s Manual Instant Download (Publication No.OME36284) - John Deere 215A Self-Propelled Windrower Operator’s Manual Instant Download (Publication No.OME36284) | PowerPoint PPT presentation | free to view

Chapter 1 Introduction to Computers and Information Systems PowerPoint PPT Presentation

Chapter 1 Introduction to Computers and Information Systems - Introduction to Computers and Information Processing Processing (Main) Memory The capabilities of main memory are a direct function of processor access time and ... | PowerPoint PPT presentation | free to view

Technology Trends PowerPoint PPT Presentation

Technology Trends - Technology Trends ILM Tiered Storage Optical Replacement | PowerPoint PPT presentation | free to view

Protocol and System Design, Reliability, and Energy Efficiency in Peer-to-Peer Communication Systems PowerPoint PPT Presentation

Protocol and System Design, Reliability, and Energy Efficiency in Peer-to-Peer Communication Systems - Protocol and System Design, Reliability, and Energy Efficiency in Peer-to-Peer Communication Systems Salman Abdul Baset salman@cs.columbia.edu Thesis defense | PowerPoint PPT presentation | free to view

Unit A Mechanical Systems and Technology PowerPoint PPT Presentation

Unit A Mechanical Systems and Technology - The welding arc is then struck under the shielding gas cover and the molten puddle is not contaminated by the elements in the atmosphere . Title: | PowerPoint PPT presentation | free to view

Oracle Database 10g The Self-Managing Database PowerPoint PPT Presentation

Oracle Database 10g The Self-Managing Database - ... Done internally, direct access to kernel structures Data captured ... Manageability foundation Holistic Management Control ... PLSQL and Java execution times ... | PowerPoint PPT presentation | free to view

Declarative Specification of NLP Systems PowerPoint PPT Presentation

Declarative Specification of NLP Systems - Declarative Specification of NLP Systems Jason Eisner student co-authors on various parts of this work: Eric Goldlust, Noah A. Smith, John Blatz, Roy Tromble | PowerPoint PPT presentation | free to view

The Pros and Cons of Desktop Virtualization PowerPoint PPT Presentation

The Pros and Cons of Desktop Virtualization - The process of virtualization enables the creation of virtual forms of servers, applications, networks and storage. The four main types of virtualization are network virtualization, storage virtualization, application virtualization and desktop virtualization. | PowerPoint PPT presentation | free to view

CS556: Distributed Systems PowerPoint PPT Presentation

CS556: Distributed Systems - Setting up a secure channel between a (Venus) client & a (Vice) server in Coda. ... of file & directory operations recognized by Coda wrt access control. ... | PowerPoint PPT presentation | free to view

Introdution Of Computers PowerPoint PPT Presentation

Introdution Of Computers - 1. What is a Computer?1. [ 2. Basic Operations & Functioning of a Computer system 3. Memory –Types of Memory Hardware –Types of Hardware Software –Types of Software 4. Characteristics of Computers 5. Classification / Types of Computers 6. Overview of Operating System - Basic functions Operating Systems | PowerPoint PPT presentation | free to view

CSE598D Storage Systems PowerPoint PPT Presentation

CSE598D Storage Systems - ... Sudarshan M. Srinivasan, and Yuanyuan Zhou, University of Illinois at Urbana-Champaign, ... George Savvides, David Mazieres, M. Frans Kaashoek (MIT, McGill, ... | PowerPoint PPT presentation | free to view

Digital Object Storage and Retrieval (DOSR) Vision PowerPoint PPT Presentation

Digital Object Storage and Retrieval (DOSR) Vision - Digital Object Storage and Retrieval (DOSR) Vision Josh Alspector Disclaimer The Mundaneum In 1910 Belgians Paul Otlet and future Nobel Peace Prize laureate Henri La ... | PowerPoint PPT presentation | free to view

Vulnerability Assessment and Emergency Response Planning for Water Systems PowerPoint PPT Presentation

Vulnerability Assessment and Emergency Response Planning for Water Systems - Title: PowerPoint Presentation Author: ken Last modified by: ken Created Date: 5/6/2003 5:56:39 PM Document presentation format: On-screen Show Company | PowerPoint PPT presentation | free to view

IE458CAM Computer Aided Manufacturing Part5 Robotic Systems PowerPoint PPT Presentation

IE458CAM Computer Aided Manufacturing Part5 Robotic Systems - The sensors can be classified in many different ways based on their utility. ... First use the keys or button of the teach pendant to drive the robot physically ... | PowerPoint PPT presentation | free to view

Tutorial on Neural Network Models for Speech and Image Processing PowerPoint PPT Presentation

Tutorial on Neural Network Models for Speech and Image Processing - ... Applications in speech and image processing PART I Feature Extraction and Classification Problems in ... Analysis Feature extraction Image ... | PowerPoint PPT presentation | free to view

The User is the Computer: From Decentralized Systems to Social Computing PowerPoint PPT Presentation

The User is the Computer: From Decentralized Systems to Social Computing - Pastry: prefix-based routing. Similar to Plaxton Trees [Plaxton et al. 97] But added ... 'At any time, at most one overlay node accepts messages with a given key' ... | PowerPoint PPT presentation | free to view

Fire Protection and Prevention in Chemical Laboratories PowerPoint PPT Presentation

Fire Protection and Prevention in Chemical Laboratories - ... but are not effective for life safety ... D Extinguishing agents must ... maintenance and backup systems * Fire Detection & Alarms ... | PowerPoint PPT presentation | free to view

Applying Open Storage to Institutional Repositories PowerPoint PPT Presentation

Applying Open Storage to Institutional Repositories - Applying Open Storage to Institutional Repositories | PowerPoint PPT presentation | free to view

Elmasri and Navathe, Fundamentals of Database Systems, Fourth Edition PowerPoint PPT Presentation

Elmasri and Navathe, Fundamentals of Database Systems, Fourth Edition - Exercise. New Whatcom Library Checkout List ... now CA), DMS 1100 (Unisys), IMAGE (H.P.), VAX -DBMS (Digital Equipment Corp. ... first commercial system in 1981-82. ... | PowerPoint PPT presentation | free to view

ITEC 3010 PowerPoint PPT Presentation

ITEC 3010 - ITEC 3010 Systems Analysis and Design, I LECTURE 5: Modeling System Requirements [Prof. Peter Khaiter] | PowerPoint PPT presentation | free to view

Databases and Database Management Systems PowerPoint PPT Presentation

Databases and Database Management Systems - Used by the DBA and database designers to specify the conceptual schema of a database. ... model used: Traditional: Relational, Network, Hierarchical. Emerging: ... | PowerPoint PPT presentation | free to view

The Revolution In Database Systems Architecture PowerPoint PPT Presentation

The Revolution In Database Systems Architecture - The Internet is the world's best telescope: It has data on every part of the sky, ... Queries represented as records. New query optimizations. Sensor networks ... | PowerPoint PPT presentation | free to view

Portal Design: Methodology PowerPoint PPT Presentation

Portal Design: Methodology - Portal Design: Methodology & Technology Mohammad Nazeeruddin M.S. (Systems Engineering) Department of Systems Engineering King Fahd University of Petroleum and Minerals | PowerPoint PPT presentation | free to view

Safety and Security PowerPoint PPT Presentation

Safety and Security - Safety and Security A room should never be left unattended with the door open. ... This key opens every hotel room and, many times, all housekeeping storage rooms. | PowerPoint PPT presentation | free to view

World Health Organization PowerPoint PPT Presentation

World Health Organization - Antonio E. Puente, Ph.D. University of North Carolina Wilmington World Health Organization s International Statistical Classification of Diseases and Related Health ... | PowerPoint PPT presentation | free to view