Title: Challenges in Exploiting Exponential Storage Gains
1Challenges in Exploiting Exponential Storage Gains
- Seagate Research Lab Grand Opening
- Pittsburgh, PA
- 21 August 2002
- Gordon Bell
- Microsoft Bay Area Research Center
- http//www.research.microsoft.com/gbell
2Bottom Lines aka Killer apps for storage
everywhere we look
- MyLifeBits recording almost everything
- The most cost-effective, highest volume stores
consumer home PCs for video. - Small form factor drives pocket form factor
cameras, phones, tablets, e-books - Largest stores include Operating System,
database, and interconnection via LANs/WANs and
in the cloud
3(No Transcript)
4MyLifeBits, The Challenge of a One1K Tbyte,
lifetime PCs Cyberizing everythingIve
written, said, presented (incl. video), photos
of physical objects a few things Ive read,
heard, seenand might want to see on TV
5"The PC is going to be the place where you store
the information really the center of control
Billg 1/7/2001
- MyLifeBits is an on-going project following
CyberAll to cyberize all of personal bits! - Memory recall of books, CDs, communication,
papers, photos, video - Photos of physical object collections
- Elimination of all physical stores objects
- Content source for home media ambiance,
entertainment, communication, interaction
Freestyle for CDs, photos, TV content, videos - Goal to understand the 1 TByte PC need,
utility, cost, feasibility, challenge tools.
6MyLifeBits charter MemexAs We May Think -
Vannevar Bush
- A memex is a device in which an individual
stores all his books, records, and
communications, and which is mechanized so that
it may be consulted with exceeding speed and
flexibility - Selection by association, rather than indexing,
may yet be mechanized
7Storing all weve read, heard, seen
Human data-types /hr /day
(/4yr) /lifetime read text, few pictures 200 K
2 -10 M/G 60-300 G speech text _at_120wpm 43 K
0.5 M/G 15 G speech _at_1KBps 3.6 M 40 M/G 1.2
T stills w/voice _at_100KB 200 K 2 M/G 60 G
video-like 50Kb/s POTS 22 M .25 G/T 25
T video 200Kb/s VHS-lite 90 M 1 G/T 100 T
video 4.3Mb/s HDTV/DVD 1.8 G 20 G/T 1 P
8Character of Cyber All Use
9CyberAll Nov.1, 2001
Music 6.9 GB 1.8K files 180 CDs
Working 2.3 GB 432 folders 2.9K files
Archive 5.1 GB 477 folders 18.7 K files
.gif
.ppt/ppt albums
.pdf
My Books 98 MB
.tif
Mail .7 GB43K msgs
Video 2.6 GB 10 hours Low res
.xls
.jpg
.doc/html
27.1K files 42K .msg 17.7 GB (by size)
Files (by number)
10MyLifeBits use scenarios
- Acquire from every potentially useful source
including the web, voice and instant messages - Personal use of MLB for work to recall everything
- Provide ambiance entertainment Personal/home
broadcast, CD, Internet radio, TV screen saving - Creation of photo and video albums
- Events, places, trips, people, time
intervals-------------- Database land
-------------------------------------- - Personal/web hosted collections catalogs
- A Person (auto- or -biography web hosted time
line - Historical events by type Personal time line
- Compile a lifes story about (event types, range,
etc.) - IndividualHow I spent my year. A personal diary.
- ISBQ Interactive Story By Query
11ISBQ Editor Interface
Query for media
Query results can be dragged and dropped into
timeline below
Video and images can be added to HTML page
Audio track for story
12Why annotatethe future?
- Future cameras have
- Creation time, content info e.g. people, scene
type - GPS place
- Voice annotation about the shot and scene
- Speech recognition of voice
- Is annotation meta-data about an object?
13Imagine the killer app for The One Tbyte,
Lifetime, PC
- MyLifeBits demonstrates need for lifetime memory!
- MODI (Microsoft Office Document Imaging)! The
most significant Office addition since HTML. - Technology to support the vision
- Guarantee that data will live forever!
- A single index that includes mail, conversations,
web accesses, and books! - E-booke-magazines reach critical mass!
- Telephony and audio capture are needed
- Photo video index serving
- More meta-information Office, photos
- Lots of GUIs to improve ease-of-use
14MyMainBrain storage
- Everything stored in a database to facilitate
searching, backup, complex attributes e.g. photo
characteristics - Audio, video, images(?) may also be stored in
file system (for access). - Ability to easily annotate and form
collections of all the globs
15The Home Digital Multimedia Network
- Vision All digital content. IP on everything.
- Content source for home media ambiance,
entertainment, communication, and interaction - Freestyle for CDs, photos, TV content, videos
- All listening/viewing stations will be digital.
- In the 10year, short-term, Digital Transformers
convert IP to legacy analog devices. - Today Digital Transformers computers!
16The Connected Home
Peripherals
Digital photos
TV
TV
Gaming
Screen devices
Stereo
17Home Networks PC-based service
DSL, etc. input
- Servers
- Hold deliver audio, photos, video
- Encode TV content
- Computers
- Control, get content from web, servers
- Monitors HDTV
- TV-sets receive encoded CATV content
- C computer. X digital transformer.
Home IP network
X
X
X
C.srv
Rec/AMP
Monitor
broadcast
TVset
HDTVTuner
CATV Dist
CATV Network
18Home media network with Digital Transformers
19A Digital Transformer for Audio Gateways
Connected Home Audio Player
20Existing Home Entertainment Centers
21The Black PC aka DHEC Digital Home
Entertainment Center
22ACTIVY Media CenterOne H/W for multiple
functions
Reduces the number of devices, remotes and wires
around the TV
23Pioneer Plasma Panel with 1280 x 768 pixelsTV
Computer Web Surfing at 12
24Art
25Caneel Bay Vacation Jan. 1998
26Disks are becoming computers
- Smart drives
- Camera with micro-drive
- Replay / Tivo / Ultimate TV
- Phone with micro-drive
- MP3 players
- Tablet
- Xbox
- Many more
ApplicationsWeb, DBMS, Files OS
Disk Ctlr 1Ghz cpu 1GB RAM
Comm Infiniband, Ethernet, radio
Courtesy of Jim Gray, Microsoft Bay Area Research
27Chameleon
28Chameleon an XP/CE/Cellphone(800x300 pixels, 5
GB 256 MB computer)
29Disk As Tape What format?
- Today we ship NTFS/SQL disks.
- But that is not a good format for Linux.
- Solution Ship NFS/CIFS/ODBC servers (not disks)
- Plug disk into LAN.
- DHCP then file or DB server via standard
interface. - Web Service in long term
Courtesy of Jim Gray
30Grays 2.4 K, 1 TByte Sneakernet aka Disk Brick
Cost to move a Terabyte Cost, time, and speed to
move a Terabyte Cost of a Sneaker-Net Terabyte
Courtesy of Jim Gray, Microsoft Bay Area Research
31Cost to move a Terabyte
32Cost, time of Sneaker-net vs Alts
Courtesy of Jim Gray, Microsoft Bay Area Research
33 Grays 2,400 1 TByte Sneaker-net
34Google1.5PB as of last spring
- 8,000 no-name PCs
- Each 1/3U, 2 x 80 GB disk, 2 cpu 256MB ram
- 1.4 PB online.
- 2 TB ram online
- 8 TeraOps
- Slice-price is 1K so 8M.
- 15 admins (!) ( 1/100TB).
35Bottom Line
- The focus of computation has shifted from
processing to storage. - Every app and price level is storage oriented
from in/on body, personal, home servers, to large
scale commercial and scientific apps - With databases, pre-computed indices beat
exhaustive searches every time.
36The End