Title: R
1R D for Libraries
- Pete Boguszewski
- Stephen Meyer
- Library Technology Group
- UW-Madison Libraries
2What is this about?
- Who? libraries
- What? delivering first rate services
- When? the sooner the better
- Where? at home (I mean, work)
- Why? we can always do better
- How? w/ agility and creativity
3Who?
Yes, you. You control the Information
Age. Welcome to your world.
4Who?
Librarian
Yes, you. You control your librarys
data. Welcome to your world.
5What?
- Researching and developing better systems and
services in libraries.
6When?
7Where?
8Why?
Because no one quite has it figured out yet.
- (Google is just not afraid to admit it.)
9What?(contd)
- A paradigm change
-
- Embrace the beta!
- Main Entry 1beta
- ...
- 4 a nearly complete prototype of a product (as
software) ltreleased in betagt ltthe beta versiongt - (source http//www.m-w.com/dictionary/beta)
10Why release an unfinished product?
- We dont know yet what we dont know.
(But neither do our users.)
11How?
12A Web 2.0 darling Netflix
- Site update schedule 2 weeks
- They know the benefits of failing fast
13Failing Fast
- Ironically, teams that fail fast improve as fast,
if not faster, than those who try to get it right
the first time. The reason is simple Teams
trying to get it right the first time fail as
often as everyone else does.
source http//www.uie.com/articles/fast_iteration
s/
14Agility
15Warning
- RD can be a dangerous enterprisean organization
must have clear goals - venturing into development requires focus
- solve only known problems
- solve problems that are important
16- What is the problem I am trying to solve?
- Do the tools exist to take on this project?
- Do the staff exist to take on this project?
17What if I fail?
- What can I learn from the experience upon
failure? - What can I learn from the experience if the
product or service does not materialize?
18Library Tech Group
Overview Infrastructure
19The Library Tech Groups Infrastructure
Virtualization Security Ability to move fast
20Virtualization on Vmware
It is truly magic
21Server Setup is Time Consuming
Virtual servers can be cloned quickly We setup
servers with specific software sets, patch them,
test results for consistency
22Why do we care at all?
- It is really cool
- Allow us to be able to look at multiple products
and/or applications at once - We can easily create servers to host products
that have different needs simultaneously - Easily compare functionality, look and feel
23Cloning (of servers) is good
24Last bit on virtualization
Virtualized servers allow us to take a snapshot
of the environment before doing development Can
quickly revert to a moment in time if development
goes bad
25(No Transcript)
26Virtual Environment
Now we have our server environment
27Security
Integral part of development Never replaces
good programming practices or proper development
techniques
28Web Environment
29Security helps development
Blocks out malicious users This locked down
environment allows us to put applications up
quickly
30Library Tech Group Helpdesk
31Project Background
32Ticket System Research
Web Search Ask other institutions Read current
user opinions
33Ticket System Preparation
Commercial Products
Open Source
- Read documentation
- Research which platform is best suited for
application - Research back-end requirements
34Pick your favorite flavor
35Pick your database
36Ticket System Setup
Open Source
Commercial
- Install according to documentation
- Modify based on your specific environment
- Clearly document all changes, snags, surprises
- Install according to documentation
37Ticket System
Compare all products
Compare Products
38Commercial Products
Advantages
Disadvantages
- Easy to install
- Tech support
- Less flexible because we do not have the source
code - Cost
- Less flexible by design
39Open Source
Advantages
Disadvantages
- Constantly changing, fixing bugs
- Ability to modify the source code
- Community enhancements and plug-ins
- Simple, easily changeable interface
- Constantly changing, fixing bugs
- No direct customer support
- Development is not free
40What I learned about Open Source
- Free software is a matter of liberty, not
price. To understand the concept, you should
think of free as in free speech, not as in
free beer. - Richard Stallman
- It can be great in the right situations
41Open source is the big winner
42Ticket System
- Reinstall to get a clean, unmodified starting
point - Implemented but in perpetual beta
- Only used inside our office
43Open Source Benefits
- Constantly adding features
- Email generated tickets
- Web forms
- Inventory information
- - Home-grown scripts
44A learning experience
- Time is money - open source is not free but can
still be well worth the effort - Economies of scale
- Later projects on Linux benefit from this
experience - Now have expertise in-house
45Make the catalog data work harder
46Inspiration
- The OPAC Sucks
- Libraries don't just collect things, we build
collections - the value of a library lies in its
bibliographers, not just its bibliographic info - A faculty member claimed there is no stack browse
47Why does the OPAC suck?
48(No Transcript)
49(No Transcript)
50(No Transcript)
51(OPAC)
52- Who's ever written a great work about the immense
effort required in order not to create?
Dostoyevsky Wannabe from the movie Slacker
53(OPAC)
54Leveraging our greatest strengths
- Patrons come to the library because we have the
goods - Without an infinite budget, we collect smartly,
rather than indiscriminately - Bibliographers and collection managers build
collections
55- There is no online equivalent to browsing the
stacks.
source paraphrase of a faculty comment during
question and answer session of a library lecture
series
56 57(No Transcript)
58Gawd, like even my llama knows that.
59- Real point of need
- vs.
- Awkward access to our data
60SaneCat(a mini RD project)
61- Is it possible to build and OPAC-like toy that
addresses these issues over the winter
intersession?
62Primary challenge
- How do I realize the my goals within the
construct of a web database application?
63What are the problems I am trying to solve?
- To create an OPAC-like prototype that doesnt
suck - To showcase library collections not just provide
the call number for an individual title - To approximate the experience of browsing the
stacks in 2-D
64Focusing the task at hand
- not sucking vague, fuzzy, dangerous
65Focusing the task at hand(contd)
- showcasing library collections
how does one bibliographic record stand in
relation to others in the collection?
66Focusing the task at hand(contd)
- browsing the stacks online
When does a patron browse the stacks?
67Which problems are important?
- More importantly, which problems are not
important?
68How was it built?
- A random selection of 72,000 catalog records
- almost 1 of our catalog
- 59,686 after dups and errors were thrown out
- 87,761 unique subjects
- 213,719 subfields within subjects
69How was it built?(contd)
- With a whole lot of help and guidance.
70(No Transcript)
71(No Transcript)
72(No Transcript)
73(No Transcript)
74Geeky Details(prototyping tools)
- MySQL database
- marc4j libraries (for parsing raw MARC data)
- Java/Tomcat webapp
- Spring application framework
- Hibernate Object Relational Mapping
- jsp with jstl tag libraries
75techie design goals
- model relationships among bib records(bibliograp
hers build collections) - provide access to data at point of need(faculty
member did not find the access that exists when
he needed it)
76 77What did I learn?
- there are doors to be opened
- there are performance issues to be resolved
- there are data hooks that would need to be
addressed - there is no reason to write acq, cat, circ
modules - we need live circ data
78What if ... we never create a SaneCat?
79- we have a mockup that can stand as leverage with
vendors - we have proof that our data can do what we want
- we know that Amazon does not have a monopoly on
'more like this' - we can lend our tech to vendors so our systems
are better
80Where could we take this?
- work out the performance problems
- graph theory and a research map
- begin collecting intentional data
- develop the next gen MARC records an
object-oriented bibl. record
81Why should we do this?
82- This is a fantastic tool for simulating something
like browsing through the stacks. I have enjoyed
playing with it for a few minutes. - ...
- Again, this is a great tool. I look forward to
using it extensively in the future. Please let me
know if I can help in any other way.
source faculty member who would like to browse
stacks online