Title: Digital%20Library
1Digital Library
- The networked collections of digital text,
documents, images, sounds, scientific data, and
software that are the core of todays Internet
and tomorrows universally accessible digital
repositories of all human knowledge
The Presidents Information Technology Advisory
Committee
2Information Collection
3Traditional Library
4(No Transcript)
5(No Transcript)
6- Digital Libraries have been positioned at the
intersection of - Library and Information Science
- Computer Science
- Networked System
7Digital Library
The networked collections of digital text,
documents, images, sounds, scientific data, and
software that are the core of todays Internet
and tomorrows universally accessible digital
repositories of all human knowledge
The Presidents Information Technology Advisory
Committee
8History of Digital Library
- Janus Digital Library,
- 1993, 105,000
- Digital Library Phase I, 1993-1998,
- 24 millions, 6 major projects
- Digital Library Phase II, 1998 now
- about 145 millions, about 30 projects each year
9Janus Digital Library
1993, 105000, electronic preservation
10Digital Library Phase I
- 1994-1998, 24 millions, 6 major projects
111, Integrated Speed, Image and Language
Understanding for Creating Digital Video Library
Carnegie Mellon University. This is the only one
focused on Video Medium.
122, Interoperation mechanisms among heterogeneous
services
Stanford University. This project is focused on
providing a uniform way to access a variety of
servers and information sources. --- InfoBus
Protocol.
133, a prototype of a scalable, intelligent,
distributed electronic library
University of California at Berkeley. A prototype
for environmental information.
144, Towards a Distributed Digital Library
University of California at Santa Barbara. This
project is about Digital Earth, a collection of
information about the world.
155, Digital Library infrastructure for a
University Engineering Community
University of Illinois at Urbana_Champaign. It
provides effective access to engineering and
physics journal articles.
166, Intelligent agents for information location
University of Michigan. Combines the traditional
library and internet technologies to provide the
best support for their users.
17Digital Library Phase II
Start from 1998, 145 million, about 30 projects
each year,
18(No Transcript)
19The Theory of Digital Library
- In the earlier years, the theory of digital
libraries was based on its structures and its
behaviors. - In 2001, Edward A. Fox from Virginia Polytechnic
Institute and State University, propose the
fundamental abstractions of Streams, Structures,
Spaces, Scenarios and Societies (5S), the 5S
theory.
20Structure and Behavior
- Hypertext
- Information Storage (Database System)
- Information Retrieval
- Multimedia Services
- Human Computer Interaction
- Program Language
- Interoperation
21Information Storage
- A digital library must be capable of storing
a large amount of data in a variety of formats
and be able to access this data as quickly as
possible. - lt1gt, Relational database,
- lt2gt, Active Database,
- lt3gt, Mobile Database,
- lt4gt, Multiple Database
- lt5gt, Object Oriented Database
22Relational Database
A relationship between the tables
23Active Database
An automatic reaction by event-condition-action
rules.
24Mobile Database
- Dynamic data and location, Currency Protocol
25Multiple Database
- A Multiple Database System consists of a
collection of autonomous and heterogeneous local
databases.
26Object-Oriented Database
- Object-oriented databases are designed to work
well with object-oriented programming languages
such Java, C, and C. This is because
object-oriented databases used the same exact
model as object-oriented programming languages.
27Information Retrieval
- Metadata searching
- Full-text searching
- Union search platform
28Metadata Searching
- Metadata data about data, structured data, data
about Who, What, Where, When
Metadata tags Title, Creator, Subject,
Description, Publisher, Contributor, Date, Type,
Format, Identifier, Source, Language, Relation,
Coverage, Right. Metadata Attributes Name,
Identifier, Version, Registration, Authority,
Language, Definition, Obligation, Data Type,
Maximum Occurrence, Comment.
29(No Transcript)
30Full-Text Searching
- This is a example of searching for the string,
Visual basic, Oracle. It will search throughout
a document to find a match.
31Union Search Platform
- Various providers produce the many types of
database retrieval systems that exists today. - End users want the ability to access different
types of data using a universal interface. - The solution to this problem is to create a new
application that integrates multiple search
requests into a union search platform.
32Problems
- 1, Why should each digital library start from
scratch? - 2, Interoperability across heterogeneous digital
library systems.
33A Fundamental Digital Library Theory 5S theory
34Streams
- Definition A stream is a sequence whose
- codomain is a nonempty set.
- A sequence of abstract items, used to describe
both static and dynamic content. - It can be text, video, audio, or a software
program.
35Structures
- Definition A structure is a tuple (G, L, F),
where G (V, E) is a directed graph with vertex
set V and edge set E, L is a set of label values,
and F is a labeling function F (V ? E) ? L.
- A labeled directed graphs which imposes
organization. - Collection, catalog, hypertext, document,
metadata, organizational tool. - How is the information organize?
36Spaces
- Definition A space is a measurable space,
measure space, probability space, vector space or
a topological space.
- Contains rules to operate on the abstract items.
- User interface, index, retrieval model.
- Different logic and presentational properties.
The operation of digital library components.
37Scenarios
Definition A scenario is a sequence of related
transition events (e1, e2, ,en) on state set S
such that ek (sk, sk1), for 1? k ? n.
- A sequences of events or actions in order to
accomplish a functional requirement. - Service, event, condition, action
- Communication between users and software
developers.
38Societies
- Definition A society is a tuple (CR), where
- C c1, c2, ,cn is a set of conceptual
communities. - R r1,r2, ,rn is a set of relationships.
- A set of entities and activates, and the
relationships between them. - Community, managers, actors, classes,
relationships, attributes, operations. - Actors and managers act together to carry out the
digital library behavior.
39Digital Library is a collection of digital object.
- Definition A digital library is a 4-tuple (R,
DM, Serv, Soc), where - R is a repository
- DM is a metadata catalog,
- Serv is a set of services containing at least
services for indexing, searching, and browsing - Soc is a society of users of the digital library.
405S Language is an XML realization of the 5S model.
41Study Case
42NDLTD
- In Virginia Tech.
- 177 universities and 27 institutions in
worldwide. - A student creates a ETD file (Electronic Theses
and Dissertation) from his or her theses and
dissertation. The ETD file is then checked for
formatting errors and quantity requirements. The
ETD file is then cataloged and placed on a
electronic bookshelf.
43lt1gt, Stream Model
44lt2gt, Structural Model
Electric Thesis and Dissertation Metadata
Structure
45This is a part of the code.
46lt3gt, Spatial Model
47lt4gt, Scenario Model
An example scenario of a searching service in an
NDLTD DL.
48 49(No Transcript)
50lt5gt, Societal Model
51(No Transcript)
52Digital Library Generation
Digital Library Generation Process with 5SL .
535SGraph, A Domain-Specific Visual Modeling Tool
54Digital Library In a Box
- Simplifies and enables the creation of a digital
library - Can be developed with little or no programming
- Built with an interoperable design
- Creates a minimal digital library in less than
an hour
55Open Digital Library
The goal is universal access to digital libraries
and information services.
56Report to the President Digital Libraries
Universal Access to Human knowledge
57Report to the President, Digital Libraries
Universal Access to Human knowledge
58Questions?