Bloom Filters - PowerPoint PPT Presentation

About This Presentation

Title:

Bloom Filters

Description:

Recovery = Copy Master File (MF) from backup. Copy Master Index ... Recovery time depends on MF & MI size #transactions since last ... of disk accesses ... – PowerPoint PPT presentation

Number of Views:248

Avg rating:3.0/5.0

Slides: 24

Provided by: cise8

Learn more at: https://www.cise.ufl.edu

Category:

Tags: bloom | disk | file | filerecovery | filters | recovery

Transcript and Presenter's Notes

Title: Bloom Filters

1
Bloom Filters

Very fast set membership.
Is x in S?
No
Maybe
False Positive
Response is Maybe but should have been No
Minimize false positive rate.

2
Differential Files

Simple large database.
Collection/file of records residing on disk.
Single key.
Index to records.
Operations.
Retrieve.
Update.
Insert a new record.
Make changes to an existing record.
Delete a record.

3
Naïve Mode Of Operation

Problems.
Index and File change with time.
Sooner or later, system will crash.
Recovery gt
Copy Master File (MF) from backup.
Copy Master Index (MI) from backup.
Process all transactions since last backup.
Recovery time depends on MF MI size
transactions since last backup.

4
Differential File

Make no changes to master file.
Alter index and write updated record to a new
file called differential file.

5
Differential File Operation

Advantage.
DF is smaller than File and so may be backed up
more frequently.
Index needs to be backed up whenever DF is. So,
index should be small as well.
Recovery time is reduced.

6
Differential File Operation

Disadvantage.
Eventually DF becomes large and can no longer be
backed up with desired frequency.
Must integrate File and DF now.
Following integration, DF is empty.

7
Differential File Operation

Large Index.
Index cannot be backed up as frequently as
desired.
Time to recover current state of index DF is
excessive.
Use a differential index.
Make no changes to Index.
DI is an index to all deleted records and records
in DF.

8
Differential File Index Operation

Performance hit.
Most queries search both DI and Index.
Increase in of disk accesses/query.
Use a filter to decide whether or not DI should
be searched.

9
Ideal Filter

Y gt this key is in the DI.
N gt this key is not in the DI.
Functionality of ideal filter is same as that of
DI.
So, a filter that eliminates performance hit of
DI doesnt exist.

10
Bloom Filter (BF)

N gt this key is not in the DI.
M (maybe) gt this key may be in the DI.
Filter error.
BF says Maybe.
DI says No.

11
Bloom Filter (BF)

Filter error.
BF says Maybe.
DI says No.

BF resides in memory.
Performance hit paid only when there is a filter
error.

12
Longest Matching Prefix

Suppose the router prefixes have W different
lengths.
Create W Bloom filters, one for each length.
ith Bloom filter is for prefixes of length i.
Keep W hash tables. ith hash table has length i
prefixes together with next hop information.
Query Bloom filters to get list of hash tables
that may have matching prefix.
Query hash tables in decreasing order of length
(or, in parallel) to find longest matching prefix.

13
Longest Matching Prefix
14
Bloom Filter Design

Use m bits of memory for the BF.
Larger m gt fewer filter errors.
Initially, all m bits 0.
Use h gt 0 hash functions f1(), f2(), , fh().
When key k inserted into DI, set bits f1(k),
f2(k), , and fh(k) to 1.
f1(k), f2(k), , fh(k) is the signature of key k.

15
Example

m 11 (normally, m would be much much larger).
h 2 (2 hash functions).
f1(k) k mod m.
f2(k) (2k) mod m.
k 15.

k 17.

16
Example

DI has k 15 and k 17.
Search for k.
f1(k) 0 or f2(k) 0 gt k not in DI.
f1(k) 1 and f2(k) 1 gt k may be in DI.
k 6 gt filter error.

17
Bloom Filter Design

Choose m (filter size in bits).
Use as much memory as is available.
Pick h (number of hash functions).
h too small gt probability of different keys
having same signature is high.
h too large gt filter becomes filled with ones
too soon.
Select the h hash functions.
Hash functions should be relatively independent.

18
Optimal Choice Of h

Probability of a filter error depends on
Filter size m.
of hash functions h.
of updates before filter is reset to 0 u.
Insert
Delete
Change
Assume that m and u are constant.
of master file records n gtgt u.

19
Probability Of Filter Error

p(u) probability of a filter error after u
updates
A B
A p(request for an unmodified record after u
updates)
B p(filter bits are all 1 for this request for
an unmodified record)

20
A p(request for unmodified record)

p(update j is for record i) 1/n.
p(record i not modified by update j) 1 1/n.
p(record i not modified by any of the u updates)
(1 1/n)u
A.

21
B p(filter bits are all 1 for this request)

Consider an update with key K.
p(fj(K) ! i) 1 1/m.
p(fj(K) ! i for all j) (1 1/m)h.
p(bit i 0 after one update) (1 1/m)h.
p(bit i 0 after u updates) (1 1/m)uh.
p(bit i 1 after u updates) 1 (1 1/m)uh.
p(signature of K is 1 after u updates)
1 (1 1/m)uhh
B.

22
Probability Of Filter Error

p(u) A B
(1 1/n)u 1 (1 1/m)uhh
(1 1/x)q eq/x when x is large.
p(u) eu/n(1 euh/m )h
d p(u)/dh 0 gt h (ln 2)m/u 0.693m/u.

23
Optimal h

h 0.693m/u.
m 106, u 106/2
h 1.386
Use h 1 or h 2.

m 2106, u 106/2
h 2.772
Use h 2 or h 3.

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

Bloom filters PowerPoint PPT Presentation

Bloom filters - Title: PowerPoint Presentation Last modified by: ncnu Created Date: 1/1/1601 12:00:00 AM Document presentation format: Other titles | PowerPoint PPT presentation | free to view

Bloom Filters PowerPoint PPT Presentation

Bloom Filters - Allow false positive errors, as they only cost us an extra data access. ... Let p=e-kn/m, probability of a false positive is: ... False Misses ... | PowerPoint PPT presentation | free to view

Bloom%20Filters:%20A%20History%20and%20Modern%20Applications PowerPoint PPT Presentation

Bloom%20Filters:%20A%20History%20and%20Modern%20Applications - Why Aren't Bloom Filters Taught in Algorithms 101 ? ... Can Bloom filters handle deletions? ... Bloom filters can handle insertions, but not deletions. ... | PowerPoint PPT presentation | free to view

A cyanobacteria bloom PowerPoint PPT Presentation

A cyanobacteria bloom - Title: A cyanobacteria bloom Author: Dr. Michael Parsons Last modified by: Dr. Jason P Turner Created Date: 1/23/2000 1:11:51 AM Document presentation format | PowerPoint PPT presentation | free to view

Bloom Where You Are Planted PowerPoint PPT Presentation

Bloom Where You Are Planted - Shasta Daisy. L. Superbum Becky' L. Superbum. Gold Rush' Leucanthemum x superbum ... Painted Daisy. Tanacetum coccineum Robinson's Red Scarlet' ... | PowerPoint PPT presentation | free to view

Bloom Filters for String Matching PowerPoint PPT Presentation

Bloom Filters for String Matching - Global Velocity, located in St. Louis MO, has an exclusive license to the high ... They are actively commercializing the technology. ... | PowerPoint PPT presentation | free to view

Bloom Based Filters for Hierarchical Data PowerPoint PPT Presentation

Bloom Based Filters for Hierarchical Data - A peer-to-peer system where each node stores a set of XML documents ... The Index Fabric [Cooper & Shadmon, RightOrder Inc 2001] ... | PowerPoint PPT presentation | free to view

Codes, Bloom Filters, and Overlay Networks PowerPoint PPT Presentation

Codes, Bloom Filters, and Overlay Networks - Trailer Distribution Problem. Millions of users want to download a new movie trailer. ... Example --Parallel downloads: Get data from multiple sources, without ... | PowerPoint PPT presentation | free to view

Adaptive Glare Bloom Steerable Streaks PowerPoint PPT Presentation

Adaptive Glare Bloom Steerable Streaks - Adaptive Glare. Bloom. Steerable Streaks. ???? ???????. garret-lit@yandex.ru ... ?? ?????????? Cg Tutorial, GPU Gems, ShaderX3, GDC 2003. Contents. GPU Pipeline ... | PowerPoint PPT presentation | free to view

Pr PowerPoint PPT Presentation

Pr - ... Ancestor Bloom Filters. Query: a//b. Compute the Bloom Filter ... Structural Bloom Filters. A full system for P2P XML indexing. As opposed to some simulation ... | PowerPoint PPT presentation | free to view

Beyond Bloom Filters: Approximate Concurrent State Machines PowerPoint PPT Presentation

Beyond Bloom Filters: Approximate Concurrent State Machines - But lots of other people currently working in this area an area of research in full bloom. ... Bloom filters can handle insertions, but not deletions. ... | PowerPoint PPT presentation | free to view

Using Bloom Filters to Refine Web Search Results PowerPoint PPT Presentation

Using Bloom Filters to Refine Web Search Results - Department of Computer Sciences, UT Austin. 1. Using Bloom Filters to Refine Web Search Results ... Department of Computer Sciences, UT Austin. 7. Feature ... | PowerPoint PPT presentation | free to view

Optimizing Data Popularity Conscious Bloom Filters PowerPoint PPT Presentation

Optimizing Data Popularity Conscious Bloom Filters - Optimizing Data Popularity Conscious Bloom Filters. Ming Zhong Pin Lu Kai Shen Joel Seiferas ... Bloom filters: ... Data popularity conscious Bloom filters: ... | PowerPoint PPT presentation | free to view

Theory and Network Applications of Dynamic Bloom Filters PowerPoint PPT Presentation

Theory and Network Applications of Dynamic Bloom Filters - CONCISE REPRESENTATION AND MEMBERSHIP QUERIES OF STATIC SET ... query, while the Gnutella-like protocol can obtain relatively lower recall with ... | PowerPoint PPT presentation | free to view

An Improved Construction for Counting Bloom Filters Flavio Bonomi Michael Mitzenmacher Rina Panigrahy Sushil Singh George Varghese PowerPoint PPT Presentation

An Improved Construction for Counting Bloom Filters Flavio Bonomi Michael Mitzenmacher Rina Panigrahy Sushil Singh George Varghese - Is x a member of S. Bloom Filter is a technique that can answer this ... Just decrement the counters. 12 - Sailesh Kumar - * Improved Counting Bloom Filter ... | PowerPoint PPT presentation | free to view

OceanStore An Architecture for Global-scale Persistent Storage PowerPoint PPT Presentation

OceanStore An Architecture for Global-scale Persistent Storage - A Bloom filter is a bit-vector of length w with a family of hash functions ... Attenuated Bloom Filters ... Attenuated Bloom Filter for the outgoing link A ... | PowerPoint PPT presentation | free to view

Tuple Set Bloom Filter: Bloom Filter Extensions for Membership Queries on Tables PowerPoint PPT Presentation

Tuple Set Bloom Filter: Bloom Filter Extensions for Membership Queries on Tables - Bloom Filter Extensions for Probabilistic Membership Queries On Tables as ... Extend Bloom Filters for Tables. Queries: A fully or partially specified row/tuple ... | PowerPoint PPT presentation | free to view

Scalable Context-sensitive Points-to Analysis using Multi-dimensional Bloom Filter. PowerPoint PPT Presentation

Scalable Context-sensitive Points-to Analysis using Multi-dimensional Bloom Filter. - Scalable Context-sensitive Points-to Analysis using Multi-dimensional Bloom Filter. Rupesh Nasre. Indian Institute of Science, India. Jointly with: Dr. Kaushik Rajan ... | PowerPoint PPT presentation | free to view

BUFFALO: Bloom Filter Forwarding Architecture for Large Organizations PowerPoint PPT Presentation

BUFFALO: Bloom Filter Forwarding Architecture for Large Organizations - BUFFALO: Bloom Filter Forwarding Architecture for Large ... Large layer-2 network on flat addresses. Simple for configuration ... in WREN Workshop ... | PowerPoint PPT presentation | free to view

Beyond%20Bloom%20Filters:%20From%20Approximate%20Membership%20Checks%20to%20Approximate%20State%20Machines PowerPoint PPT Presentation

Beyond%20Bloom%20Filters:%20From%20Approximate%20Membership%20Checks%20to%20Approximate%20State%20Machines - Describe 3 techniques based on Bloom filters and hashing, and evaluate them ... Direct Bloom Filter doesn't store the state of a flow, need to lookup every state ... | PowerPoint PPT presentation | free to view

The Bloomier Filter PowerPoint PPT Presentation

The Bloomier Filter - The Problem Bloom Filters. A large set of data D, with a small subset S. We want to query whether an ... G is a lossless expander with constant probability. ... | PowerPoint PPT presentation | free to view

Stanford CS223B Computer Vision, Winter 2006 Lecture 2 Lenses, Filters, Features PowerPoint PPT Presentation

Stanford CS223B Computer Vision, Winter 2006 Lecture 2 Lenses, Filters, Features - Today s Goals Thin Lens Aberrations Features 101 Linear Filters and ... is constant Deviations from this ideal are aberrations chromatic : ... | PowerPoint PPT presentation | free to view

Imaging and Classification System for Harmful Algal Bloom Detection PowerPoint PPT Presentation

Imaging and Classification System for Harmful Algal Bloom Detection - Imaging and Classification System for Harmful Algal Bloom Detection. Lisa Campbell ... Robert J. Olson, Heidi M. Sosik. Woods Hole Oceanographic Institution ... | PowerPoint PPT presentation | free to view

Improving Search Efficiency Using Bloom Filters in Partially Connected Ad Hoc Networks: A Node-centric Analysis PowerPoint PPT Presentation

Improving Search Efficiency Using Bloom Filters in Partially Connected Ad Hoc Networks: A Node-centric Analysis - measures inefficiency due to query overhead and Bloom Filter overhead ... Multiple Bloom filter transmissions in a single busy period of an observer node ... | PowerPoint PPT presentation | free to view

SpaceCode Bloom Filter for Efficient PerFlow Traffic Measurement PowerPoint PPT Presentation

SpaceCode Bloom Filter for Efficient PerFlow Traffic Measurement - Space-Code Bloom Filter for Efficient Per-Flow Traffic Measurement ... This paper aims to investigate highly efficient algorithms and data structures ... | PowerPoint PPT presentation | free to view

Packet Classification Using CoarseGrained Tuple Spaces PowerPoint PPT Presentation

Packet Classification Using CoarseGrained Tuple Spaces - combining tree bitmap and Bloom filters. Possible extensions ... Insert tree bitmap subtree roots into off-chip hash tables and on-chip Bloom filters ... | PowerPoint PPT presentation | free to view

PlanetP: Using Gossiping to Build Content Addressable PeertoPeer Information Sharing Communities PowerPoint PPT Presentation

PlanetP: Using Gossiping to Build Content Addressable PeertoPeer Information Sharing Communities - Local Index (Bloom Filter) ... Update of a peer's bloom filter could be spread to the whole community in ... intensive than Bloom filter changes gossiping ... | PowerPoint PPT presentation | free to view