Title: Designing Information Architecture for Search
1Designing Information Architecturefor Search
Tutorial SIGIR 2001
- Marti Hearst
- University of California, Berkeley
- www.sims.berkeley.edu/hearst
- NSF CAREER Grant, NSF9984741
2Outline
- Motivation
- Search Interfaces
- Web search vs Site Search
- Search UIs What works what doesnt
- Methodology
- Information Architecture Defined
- Faceted Metadata
- Integrating Search into IA via Faceted Metadata
- Results of Usability Studies
- Tools
- Conclusions
3Contributors to the Research
- Dr. Rashmi Sinha
- Graduate Students
- Ame Elliott
- Jennifer English
- Kirsten Swearington
- Ping Yee
- Research funded by
- NSF CAREER Grant, NSF9984741
4Motivation and Background
5Claims
- Web Search is OK
- Gets people to the right starting points
- Web SITE search is NOT ok
- The best way to improve site search is
- NOT to make new fancy algorithms
- Instead
6The best way to improve search
Improve the User Interface
7Recent Study by Vividence Research
- Spring 2001, 69 web sites
- 70 eCommerce
- 31 Service
- 21 Content
- 2 Community
- The most common problems
- 53 had poorly organized search results
- 32 had poor information architecture
- 32 had slow performance
- 27 had cluttered home pages
- 25 had confusing labels
- 15 invasive registration
- 13 inconsistent navigation
8Vividence findings effects on users
- Poorly organized search results
- Frustration and wasted time
- Poor information architecture
- Confusion
- Dead ends
- "back and forthing"
- Forced to search
9Vividence findings effects on users
- Cluttered home pages
- Creates disinterest
- Wastes time
- No contrast everything has equal weight
- Dont know where to start
- Failure to engage
- No call to action
- Failure to establish navigation
- Layout reflects company organization chart
- Investor centeredness
10Vividence findings characteristics
- Inconsistent Navigation
- Primary navigation bar is, in fact, really
secondary - Un-scalable designs
- Poor transitions between company divisions
- "Junk Drawer" navigation bars
- Random links
- Shoe-horned functions
- Heavy need to hit the "back-button"
11Vividence Study
- Breakdown of most common search problems
- 41 - of searches encountered no problems
- 20 - had search problems not named below
- 14 - of searches were not advanced enough
- 12 - did not organize results well
- 10 - of searches yielded inaccurate/unrelated
results - 9 - were too slow
- 8 - of searches had insufficient instructions
- 7 - engine was too difficult to locate
- 7 - of searches produced too few results
- 7 - of searches were too limiting
- 3 - of searches produced an error message
- 3 - were too difficult to use
12Other Relevant Studies
- Commercial studies (are not usually scientific,
do not supply full details) - CreativeGood.com Holiday 2000 ecommerce report
- UIE, and Jared Spools talks http//world.std.com
/uieweb - Scientific studies (often less relevant to real
web situations) - Many papers from the CHI proceedings
http//www.acm.org/dl/ - Papers from Human Factors and the Web
http//www.optavia.com/hfweb/ - See the extensive bibliography from my textbook
chapter (in this package).
13The Philosophy
- Information architecture should be designed to
integrate search throughout - Search results should reflect the information
architecture. - This supports an interplay between navigation and
search - This supports the most common human search
strategies.
14The Approach
- Assign faceted metadata to content items
- Allow users to navigate through the faceted
metadata in a flexible manner - Organize search results according to the faceted
metadata so navigation looks similar throughout - Give previews of next choices
- Allow access to previous choices
15Advantages of the Approach
- Supports different task types
- Highly constrained known-item searches use one
interface - Open-ended, browsing tasks use another interface
- Both types of interface use the same underlying
structure - Can easily switch from one interface type to the
other midstream
16Advantages of the Approach
- Honors many of the most important usability
design goals - User control
- Provides context for results
- Reduces short term memory load
- Allows easy reversal of actions
- Provides consistent view
17Advantages of the Approach
- Allows different people to add content without
breaking things - Can make use of standard technology
18Web Search vs. Site Search
19Web Search is Working!
- Survey finds high user satisfaction
- Study by npd group
- http//www.searchenginewatch.com/reports/npd.html
20Why is Web Search Working?
- Web Search is Successful at Finding Good Starting
Points (home pages) - Evidence
- Search engines using
- Link analysis
- Page popularity
- Interwoven categories
- These all find dominant home pages
21(No Transcript)
22(No Transcript)
23Organizing Search ResultsWhat works, What
Doesnt
- There is a lot of prior work on this
- Cha-Cha (Chen et al. 1999)
- Scatter-Gather clustering (Cutting et al. 93,
Hearst et al. 1996) - Becoming more prevalent in web search too.
- Teoma
- Vivisimo
- Northern Light
24Putting Results into Clusters
25Drilldown what does it mean?
26Vivisimo same idea
27(No Transcript)
28Yahoo lists category matches
29Web Search Results Grouping
- Drill down one category
- Cannot mix and match categories
- Not clear if it is useful or not
- Can help differentiate different meanings of the
same word. - But what about site search?
30If Web search engines are providing source
selection what happens when the user gets
to the site?
Follow Links or Search
31Following Hyperlinks
- Works great when it is clear where to go next
- Frustrating when the desired directions are
undetectable or unavailable
Site Search
Is not getting good reviews
32An Analogy
hypertext
33Analogy
- Hypertext
- A fixed number of choices of where to go next
- A glance at the map tells you where you are
- But may not go where you want to go.
- To get from Topeka to Santa Fe, may have to go
through Frostbite Falls - Site Search
- Can go anywhere
- But may get stuck, disoriented, in a crevasse!
34Goal An All-Tertrain Vehicle
- The best of both techniques
- A vehicle that magically lays down track to
suggest choices of where you want to go next
based on what youve done so far and what you are
trying to do - The tracks follow the lay of the land and go
everywhere, but cross over the crevasses - The tracks allow you to back up easily
35Organizing Search ResultsWhat works what doesnt
36What works, what doesnt
- There is negative evidence for
- Clustering
- Fancy visualizations
- There is positive evidence for
- Grouping into meaningful, consistent categories
- Relevance feedback
- Depends how you do it
- Showing similar items
37Kohonen Feature Maps on Text(from Chen et al.,
JASIS 49(7))
38Study of Kohonen Feature Maps
- H. Chen, A. Houston, R. Sewell, and B. Schatz,
JASIS 49(7) - Comparison Kohonen Map and Yahoo
- Task
- Window shop for interesting home page
- Repeat with other interface
- Results
- Starting with map could repeat in Yahoo (8/11)
- Starting with Yahoo unable to repeat in map (2/14)
39Study (cont.)
- Participants liked
- Correspondence of region size to documents
- Overview (but also wanted zoom)
- Ease of jumping from one topic to another
- Multiple routes to topics
- Use of category and subcategory labels
40Study (cont.)
- Participants wanted
- hierarchical organization
- other ordering of concepts (alphabetical)
- integration of browsing and search
- corresponce of color to meaning
- more meaningful labels
- labels at same level of abstraction
- fit more labels in the given space
- combined keyword and category search
- multiple category assignment (sportsentertain)
41Visualization of Clusters
- Huge 2D maps may be inappropriate focus for
information retrieval - Cant see what documents are about
- Documents forced into one position in semantic
space - Space is difficult to use for IR purposes
- Hard to view titles
- Perhaps more suited for pattern discovery
- problem often only one view on the space
42Summary Clustering(Based on other studies as
well)
- Advantages
- Get an overview of main themes
- Domain independent
- Disadvantages
- Many of the ways documents could group together
are not shown - Not always easy to understand what they mean
- Different levels of granularity
- Probably best for scientists only
- Take heart there is good evidence for
organizing via categories!
43The DynaCat System
- Decide on important question types in an advance
- What are the adverse effects of drug D?
- What is the prognosis for treatment T?
- Make use of MeSH categories
- Retain only those types of categories known to be
useful for this type of query.
Pratt, W., Hearst, M, and Fagan, L. A
Knowledge-Based Approach to Organizing Retrieved
Documents. AAAI-99 Proceedings of the Sixteenth
National Conference on Artificial Intelligence,
Orlando, Florida, 1999.
44DynaCat
45DynaCat Study
- Design
- Three queries
- 24 cancer patients
- Compared three interfaces
- ranked list, clusters, categories
- Results
- Participants strongly preferred categories
- Participants found more answers using categories
- Participants took same amount of time with all
three interfaces
46Cha-Cha (intranet search)
Cha-Cha A System for Organizing Intranet Search
Results, by Chen, Hearst, Hong, and Lin,
Proceedings of 2nd USENIX Symposium on Internet
Systems, Boulder, CO, Oct 1999.
cha-cha.berkeley.edu
47Cha-Cha (intranet search)
48How People Search
49The Standard Model
- Assumptions
- Maximizing precision and recall simultaneously
- The information need remains static
- The value is in the resulting document set
50Berry-Picking as an Information Seeking
Strategy (Bates 90)
- Berry-picking model
- Interesting information is scattered like berries
among bushes - The user learns as they progress, thus
- The query is continually shifting
51A sketch of a searcher moving through many
actions towards a general goal of satisfactory
completion of research related to an information
need. (after Bates 89)
Q2
Q4
Q3
Q1
Q5
Q0
52Search Tactics and Strategies
- Marcia J. Bates, Information Search Tactics,
Journal of the American - Society for Information Science, 30, 4, 1979
- Marcia J. Bates, Where should the person stop and
the information - search interfaces start?, Information Processing
Management, 26, 5, - 1990
- Marcia J. Bates, The Berry-Picking Search User
Interface Design, User - Interface Design, Harold Thimbleby,
Addison-Wesley, 1990 - Marcia J. Bates, The design of browsing and
berrypicking techniques - for the on-line search interface, Online Review,
1989, 13, 5, - 407431.
- Vicki L. O'Day and Robin Jeffries, Orienteering
in an information - landscape how information seekers get from here
to there, Proceedings of ACM INTERCHI '93, April,
Amsterdam, 1993 - Gary Marchionini, Information Seeking in
Electronic Environments, Cambridge University
Press, 1995.
53Tactics vs. Strategies
- Tactic short term goals and maneuvers
- operators, actions
- Strategy overall planning
- link a sequence of operators together to achieve
some end
54An Important Strategy
- Do a simple, general search
- Gets results in the generally correct area
- Look around in the local space of those results
- If that space looks wrong, start over
- Akin to Shneidermans overview details
- Our approach supports this strategy
- Integrate navigation with search
55Term Tactics
- Move around a thesaurus
- Look at category labels
- Look at related terms
- Look at parent terms
- Look at child terms
- In older literature, refers to navigating the
thesaurus itself, as opposed to the items
themselves.
56Source-level Tactics
- Bibble
- look for a pre-defined result set
- e.g., a good link page on web
- Survey
- look ahead, review available options
- e.g., dont simply use the first term or first
source that comes to mind - Cut
- eliminate large proportion of search domain
- e.g., search on rarest term first
57Source-level Tactics (cont.)
- Stretch
- use source in unintended way
- e.g., use patents to find addresses
- Scaffold
- take an indirect route to goal
- e.g., when looking for references to obscure
poet, look up contemporaries
58Monitoring Strategies
- Check
- compare original goal with current state
- Weigh
- make a cost/benefit analysis of current or
anticipated actions - Pattern
- recognize common strategies
- Correct Errors
- Record
- keep track of (incomplete) paths
59Additional Considerations(Bates 79)
- Need a Sort tactic
- When to stop?
- How to judge when enough information has been
gathered? - How to decide when to give up an unsuccesful
search? - When to stop searching in one source and move to
another?
60Information Architecture
61A Taxonomy of WebSites
Catalog Sites Web-based Information Systems
Web-Presence Sites Service-Oriented Sites
high
Complexity of Data
low
low
high
Complexity of Applications
From The (Short) Araneus Guide to Website
development, by Mecca, et al, Proceedings of
WebDB99, http//www-rocq.inria.fr/cluet/WEBDB/pr
ocwebdb99.html
62 A View of Website Design
- Information design
- structure, categories of information
- Navigation design
- interaction with information structure
- Graphic design
- visual presentation of information and navigation
(color, typography, etc.)
Information Architecture
From Sitemaps, Storyboards, and Specifications A
Sketch of Web Site Design Practice as Manifested
Through Artifacts. M.W. Newman and J.A. Landay.
In proceedings of Designing Interactive Systems
DIS '00. August 17-19, 2000.
63A View of Information Architecture
- Content Items
- Information Structure
- Navigation Structure
- Layout
64Content Items
- The information items that the site is designed
to show the user. - Individual content items can be considered leaves
in a tree, or base-level items. - Aggregates of individual (base-level) items can
be considered to be content items. - This definition is especially relevant for
catalog-style sites, for example - Image collection
- Product selling
- Collection of articles on some topic (medical,
legal) - Collection of information about some entity (IRS,
Park Service)
65Information Structure
- Independent of the website.
- A set of descriptors which are used to
characterize the content of a website. - Consists primarly of a category structure and a
set of textual labels. - The categories can have flat, hierarchical,
faceted or network structure. - The textual labels include alternative ways of
expressing the same concepts (synonyms).
66Navigation Structure
- Defined in terms of the website.
- Site level
- The paths connecting content items throughout the
site. - Page level
- The link from one page to others.
67Example from Walmart.com
68Content
Navigation Structure
69Related Items
- Often are content items
- Related to the target by some shared information
structure - The particular related items that are shown are
revealed through the navigation structure
70The Information Structure
- Consists of a set of descriptors for the content
items - Cant really see it directly, since it is
independent of web site description - Can see parts of it in the navigation structure
71A View of Information Architecture
Content Items
Information Structure
Start with an information structure
(categories and labels) and a set of content
items.
72A View of Information Architecture
Content Items
Information Structure
Prod Camping Brand REI Material Nylon Size
4-person
Each content item is assigned some descriptors
from the information structure.
73Navigation structure links items or groups of
items.
74Navigation Structure Differs from Information
Structure
- Example
- Part of the info structure is the product
hierarchy. - Some products are assigned more than one spot in
the hierarchy (e.g., sports and games), thus
forming a tree structure - Navigation structure shows a progressive
disclosure of the hierarchical structure only.
75Navigation Structure Differs from Information
Structure
- Example
- Main navigation structure is the product
hierarchy. - However, lateral links are shown from product
leaf nodes to other nodes - (e.g., from a tent to a flashlight and a sleeping
bag)
76Navigation Structure Differs from Information
Structure
- The differences can be much more profound
- Examples
- Show only main product categories at top levels
- After a search, show links according to brands of
items, but only those brands that make sense for
the items retrieved by the search.
77Breadcrumbs
- A navigation technique for showing either history
or contextualizing hierarchy via hyperlinks. - Two main types
- Hierarchy without history
- Search results at walmart.com
- History across facets (without hierarchy)
- Epicurious path recording.
78An Important IA Trend
- Generating web pages from databases
- Implications
- Web sites can adapt to user actions
- Web sites can be instrumented
- An essential feature of a design environment is
to give authors the possibility of evaluating the
current network against the final adaptive
system. - Petrelli, Baggio, Pezzulo, Adaptive Hypertext
Design Environments Putting Principles into
Practice, AH 2000
79Faceted Metadata
80Metadata data about dataFacets orthogonal
categories
81Faceted Metadata Biomedical MeSH (Medical
Subject Headings)www.nlm.nih.org/mesh
82Mesh Facets (one level expanded)
83Using Mesh Facets
- Some stats
- gt18,000 labels
- avg depth 4.5, max depth 9
- 8 labels/article on average
- How to go from the information structure to the
navigation structure?
84Using faceted metadata incorrectly
- Yahoo uses faceted metadata poorly in both their
search results and in their top-level directory - They combine region other hierarchical facets
in awkward ways
85Yahoos use of facets
86Yahoos use of facets
87Yahoos use of facets
88Yahoos use of facets
- Where is Berkeley?
- College and University gt Colleges and
Universities gtUnited States gt U gt University of
California gt Campuses gt Berkeley - U.S. States gt California gt Cities gtBerkeley gt
Education gt College and University gt Public gt UC
Berkeley
89However, Yahoo does use some metadata well
- Yahoo restaurant guide combines
- Region
- Topic (restaurants)
- Related Information
- Other attributes (cuisines)
- Other topics related in place and time (movies)
90Yellow geographic region
Green restaurants attributes
Red related in place time
91Combining Information Types
- A E
- Film
- Theatre
- Music
- Restaurants
- California
- Eclectic
- Indian
- French
Assumed task looking for evening
entertainment
92Other Possible Combinations
- Region AE
- City Restaurant Movies
- City Weather
- City Education Schools
- Restaurants Schools
93Bookstore preview combinations
- topic related topics
- topic publications by same author
- topic books of same type but related topic
94Problems with Metadata Usage
- Standard approaches
- Paths are hand-edited, predefined
- Not well-integrated with search
- Not tailored to task as it develops
- Not personalized
- Not dynamic
95Questions we are trying to answer
- How many facets are allowable?
- Should facets be mixed and matched?
- How much is too much?
- Should hierarchies be progressively revealed,
tabbed, some combination? - How should free-text search be integrated?
96Recipe Collection Examples
97From soar.berkeley.edu (a poor example)
98(No Transcript)
99From www.epicurious.com (a good example)
100(No Transcript)
101(No Transcript)
102(No Transcript)
103Epicurious Metadata Usage
- Advantages
- Creates combinations of metadata on the fly
- Different metadata choices show the same
information in different ways - Previews show how many recipes will result
- Easy to back up
- Supports several task types
- Help me find a summer pasta,'' (ingredient type
with event type), - How can I use an avocado in a salad?''
(ingredient type with dish type), - How can I bake sea-bass'' (preparation type and
ingredient type)
104Metadata usage in Epicurious
105Metadata usage in Epicurious
106Metadata usage in Epicurious
I
107Metadata usage in Epicurious
gt
I
108Metadata usage in Epicurious
gt
I
Select
Prepare
Cuisine
I
109Recipe Information Architecture
- Information design
- Recipes have five types of metadata categories
- Cuisine, Preparation, Ingredients, Dish, Occasion
- Each category has one level of subcategories
110Recipe Information Architecture
- Navigation design
- Home page
- show top level of all categories
- Other pages
- A link on an attribute ANDS that attribute to the
current query results are shown according to a
category that is not yet part of the query - A change-view link does not change the query, but
does change which categorys metadata organizes
the results
111Metadata Usage in Epicurious
- Can choose category types in any order
- But categories never more than one level deep
- And can never use more than one instance of a
category - Even though items may be assigned more than one
of each category type - Items (recipes) are dead-ends
- Dont link to more like this
- Not fully integrated with search
112Epicurious Basic Search
- Lacks integration with metadata
113(No Transcript)
114Information previews
- Use the metadata to show where to go next
- More flexible than canned hyperlinks
- Less complex than full search
- Help users see and return to what happened
previously - Reduces mental work
- Recognition over recall
- Suggest alternatives
115The Importance of Information Previews
- Jared Spools studies (www.uie.com)
- More clicks are ok if
- The scent of the target does not weaken
- If users feel they are going towards, rather than
away, from their target.
116Problem with Metadata Previews as Currently Used
- Hand edited, predefined
- Not tailored to task as it develops
- Not personalized
- Often not systematically integrated with search,
or within the information architecture in general
117Putting it Together
118Desiderata for Objects inInformation-Seeking
Workspaces
- Structured
- Fractal
- Queriable
- Navigable
- Historical
- Similarity Engine Compatible
- Contextualized
- Other
From Furnas, G., and Rauch, S., Considerations
for information environments and the NaviQue
workspace. In Proceedings of DL 98.
Pittsburgh,PA, June, 1998.
119Search Usability Design Goals
- Strive for Consistency
- Provide Shortcuts
- Offer Informative Feedback
- Design for Closure
- Provide Simple Error Handling
- Permit Easy Reversal of Actions
- Support User Control
- Reduce Short-term Memory Load
From Shneiderman, Byrd, Croft, Clarifying
Search, DLIB Magazine, Jan 1997. www.dlib.org
120Analogy Chess
- Chess is characterized by a few simple rules that
disguise an infinitely complex game - Another intriguing characteristic the
three-part structure - Openings many strategies, new ones all the time,
many books on this - Endgame well-defined, well-understood
- Middlegame nebulous, hard to describe
- Our thought search is similar and the middlegame
is critically underserved.
121Chess-based view of Info Architecture
- The Opening
- Usually exposes top-level hierarchy or top-level
facets (or both) - Usually also has a search component
- This is also the place to expose the main tasks
that can be accomplished on the site
122The Opening
123The Opening
124The Opening
125The Opening
126The Opening
127Chess-based view of Info Architecture
- The Endgame
- Has become rather well-established in shopping
sites - Penultimate page shows a list of items
- Leaf node
- Shows one content item in detail
- Lateral links
- To similar items (same facet)
- To other items that go with it (other facets)
128The Endgame Penultimate Pages
129The Endgame Penultimate Pages
130The Endgame Leaf Nodes
131Chess-based view of Info Architecture
- The Middlegame
- Hardest to describe/understand
- The berry-picking part of supporting search
- Issues
- How to progressively expose hierarchies?
- How to show multiple facet choices?
- How to integrate with search results?
- How to show history / retain context?
132Sophisticated Middlegames
zdnet.com Continued next slide
133Sophisticated Middlegames
134Sophisticated Middlegames
Walmart.com Continued next slide
135Sophisticated Middlegames
136Sophisticated Middlegames
137Online Grocery Shopping Examples
- In each case, note
- Chess analogy
- What is the opening?
- What is the endgame?
- How is the middlegame handled?
- How are search results integrated?
- How is hierarchical drill-down revealed?
- Are multiple facets allowed?
-
138Grocery shopping example
Homerun.com
139Grocery shopping example
Homerun.com
140Grocery shopping example
Homerun.com
141Grocery shopping example
peapod.com
142Grocery shopping example
peapod.com
143Grocery shopping example
peapod.com
144Grocery shopping example
webvan.com
145Grocery shopping example
webvan.com
146Summary Grocery Shopping Examples
- A good opening seems to make a big difference
- Familiar metadata helps make the task easier
- Middlegame hierarchy exposure
- One uses cascading menus
- Two use webpage-based drilldown
- Two use metadata to organize search results
- But dont use metadata creatively
- Could organize by recipe, etc.
147Medical Text Example
- Allow user to select metadata in any order
- At each step, show different types of relevant
metadata, - based on prior steps and personal history,
- include of documents
- Previews restricted to only those metadata types
that might be helpful
148Ecommerce Examples
- E-commerce sites are farther ahead than
information collection sites - However, their problem is usually easier
- Single facet often works fine
- Categories are familiar to users
- Collections are often much smaller
- How to move this to large sites containing more
abstract information? - Image collections?
- Text collections
149Current Search Approach
Can a metadata preview approach do better?
150Asthma gt Steroids
- A steroid-induced acute psychosis in a child with
athsma. - Management of steroid-dependent asthma with
methotrexate.
151Asthma gt Steroids gt Admin Dosage
- Dosage levels for asthmatic steroids A survey.
152- Other paths back up and go forward
153Advantages of the Methodology
- Supports different types of information seeking
tasks - Uses interface idioms known to be usable for
general users - Flexible content entry and update
- Allows for non-experts to add new content
independently - Makes use of standard DBMS technology
154Advantages of the Methodology
- Systematically integrates search
- search results reflect the structure of the info
architecture - search results retain the context of previous
interactions - search results preview next choices
- Gives user control
- Over order of metadata use
- Over when to navigate vs. when to search
- Allows integration with advanced methods
- Collaborative filtering, predicting users
preferences
155Advantages
- Users have a feeling of control
- Users can predict what will happen
- Not true of statistical ranking or clustering
- Adding new items to the system changes the
behavior in understandable ways - Users have flexibility
- In ordering of operations
- In combining of operations
156Usability Study epicurious
157Epicurious Usability Study
- 9 participants so far
- Independent Variables
- 1) Epicurious Interface (Basic vs. Enhanced vs.
Browse) - 2) Task type (known-item search vs. browsing for
inspiration) - 3) Degree of constraint of query
- 4) Number of results required (1 vs. many)
- Dependent Variables
- 1) Time to find satisfactory recipe(s)
- 2) Navigation path (backtracking, starting over,
revising queries) - 3) Satisfaction with results of search
- 4) Satisfaction with individual system features
(e.g. breadcrumbs, query previews, refine by
hyperlinks) - 5) Likelihood of using each interface in the
future.
158Epicurious Usability Study
- Participants were asked to
- Do 3 pre-specified searches in advance
- In the lab
- Specify a cooking scenario of interest to them
- Search for 3 recipes for this recipe
- Search for each recipe using each of the
interfaces - Complete several structured tasks
- Along the way, answer questions about
- Getting closer or farther away from goal
- Satisfaction with search results
- Satisfaction with the interace
159Usability Study Preliminary Results, Preference
Data
160Usability Study Preliminary Results, Preference
Data
161Usability StudyPreliminary Results Feature
Preference
162Usability StudyPreliminary Results Quantitative
163Usability StudyPreliminary Results
Constraint-based Preferences
164Observed patterns of use of epicurious metadata
browse interface
- choosefacet
- refine
- refine
- back
- scan focus
- choosefacet
- refine
- back refine
- scan focus
- choosefacet
- refine
- refine
- scan focus
- choosefacet
- refine
- refine
- choosefacet
- refine
- refine
- searchword
- choosefacet
- searchword
- scan searchword
- back refine
- scan focus
- choosefacet
- refine
- back back refine
- refine
- refine
- choosefacet
- refine
- choosefacet
- refine
- refine
- back refine
- choosefacet
- scan focus
165Usability Study Results Summary
- People liked the browsing-style metadata-based
search and found it helpful - People sometimes preferred the metadata search
when the task was more constrained - But zero results are frustrating
- This can be alleviated with query previews
- People dis-prefer the standard simple search
- More study needed!
166Application to Image Search
167Image Search What is the task?
- Illustrate my slides?
- Find a crevasse
- Keyword match works pretty well
- Find inspiration for an architectural design?
- Needs richer search support
168Faceted Metadata for Image Collection
Planalto Palace Parti Communiste
Francais Pantheon Oscar Neimeyer
Oscar Neimeyer Jacques-Gabriel
Soufflot 20th Century 20th Century
17th 18th C. Brasilia Paris
Paris Stone Curvilinear Stone
Image Architect Period Location Concept
169Faceted Metadata for Image Collection
Planalto Palace Parti Communiste
Francais Pantheon Oscar Neimeyer
Oscar Neimeyer Jaques-Gabriel
Soufflot 20th Century 20th Century
17th 18th C. Brasilia Paris
Paris Stone Curvilinear Stone
Image Architect Period Location Concept
170SPIRO Query Form (Original)
171SPIRO query on Subject church
172Pilot Study
- Architecture task
- Emphasize images over text
- Use hypertext-style interface as a reasonable
baseline for comparison - Find out how much choice is too much
- Find out whether explicit metadata is better than
implicit more-like-this
173Evaluation Methodology
- Solicit feedback from architects to determine if
faceted metadata is helpful and how to present it - Informal evaluation of paper prototype
- Informal study of a crude live version
- 1 hour one-on-one with 9 architects /grad
students, 2 tasks (audio recorded) and a survey
174Results of a pilot study with Archictects
Metadata is Helpful
- Very positive feedback about the general approach
- All 9 participants named the metadata in the
search results area as their favorite aspect of
Flamenco - Metadata was successful at giving hints about
where to go next - Perceived as useful These are places I can go
from here.
175Results More Metadata Please
- Participants asked for more metadata
- Although there were complaints about the contents
of the metadata, users still wanted more - Longer lists of options (more hints)
- Users wanted more control to make very specific
searches - Half the participants requested the ability to
control order of results with metadata - Juxtapose visible images 2 different ways
- Overview (one image from each project) vs. like
together ( all images of a project next to each
other) - Different than ranking for text retrieval
(precision, recall), but ordering does matter
176Results Complaints
- The UI was not successful at clarifying searching
within results vs. starting a new search - Only 2 of the 9 participants understood the
distinction without discussion but they want to
do both - The 1/3 of the participants who couldnt find a
treasure hunt image felt that Flamenco was slow - Corroborates findings that perceived system speed
is about finding what you want (Spool 00)
177New Developments
- A new, sophisticated implementation
- Richer, hierarchical, cleaned up metadata
- Usability Study contrasting four versions
- Single search form
- Multiple facet search form
- Yahoo-style directory-based
- Faceted interface with query previews
- Results TBA
178(No Transcript)
179(No Transcript)
180(No Transcript)
181Tools
- Our system (all open-source)
- Mysql (has a text search component)
- Python 2.2
- Python-mysql
- Webware (python application server)
- Earlier attempt
- Cold fusion not flexible enough, not enough of
a programming language
182A new term Parametric Search
- From an XML glossary
- "A search request submitted to a search or
database engine delivered with consideration for
the metadata of the underlying dataset. - www.sla.org/chapter/ctor/courier/v37/v37n1.pdf
183Commercial Tools
- This list is NOT comprehensive
- These are NOT recommendations
- General Search
- Inktomi Search/site (formerly Infoseek ultra)
- Specializing in Online Catalogs
- Dieselpoint
- Requisite
- Saqqara
- Question-answering
- Askjeeves
- Primus (formerly Answerlogic)
184Parametric Search
- A survey of sites using parametric search
- http//www.amp.com/search/default.asp (see
product family search) - http//ebiz.zilog.com/
- http//www.sears.com (Dieselpoint)
- http//dieselpoint.com/flashlink.htm (for
Dieselpoint 2.0 demo) - http//www.findmro.com (Requisite's BugsEye)
- http//www.cypress.com (Saqqara's one step)
- http//infineon-tech.sacosnet.de/search/index.htm
- http//www.idt.com/tools/parametric.html
- http//www.ti.com/sc/docs/psheets/parms/uarts.htm
parms - http//www.gensemi.com/search/productsearch.htm
- http//www.usa.samsungsemi.com/search/
- http//www.gearfinder.com
- http//www.mysimon.com/category/index.jhtml?cbaby
diaperingbathing
Site list courtesy Mark Detweiler
185Parametric Search Usage
- Goal is to focus on product group for comparison
shopping. - Common Procedure
- Begin with a list of product "families" or
groups. - User selects a category, and is prompted to
- 1) select a sub-category from a list of
hyperlinks or - 2) select search parameters using a form
- If the number of results is too big, the system
may prompt the user to refine the search further.
- When an acceptable number of results is returned,
the user sees a list of products which can be - 1) sorted by various criteria
- 2) selected for display in a comparison table
- 3) viewed individually with more detail.
186Parametric Search as used on these Sites
- Observations
- Only one facet (appropriate for products?)
- No query previews
- Breadcrumbs rare
- Many allow sorting by attribute to facilitate
comparison - Others like this simply moves up the hierarchy
187Summary and Conclusions
188Summary
- Web site search needs improvement
- Users want more organized results
- Our approach integrate navigation with search
- Metadata is being mixed and matched in
interesting ways, but there are no guidelines on
what works - We are investigating how to design websites
containing large sets of items - Preliminary results indicate that metadata
organization is useful in some situations - Depends on the type of search need
189Advantages of the Methodology
- Supports different types of information seeking
tasks - Uses interface idioms known to be usable for
general users - Flexible content entry and update
- Systematically integrates navigation search
- Gives user control
- Allows integration with advanced methods
190Summary
- Our research goals
- Systematically determine what works, with the
following emphases - Task-centric
- Integrate metadata with search
- Dynamic previews
- Easily retrace steps
- Develop recommendations that reflect both the
task structure and the richness of the
information structure - In future integrate with more sophisticated
displays
191Some Unanswered Questions
- How best show combinations of facets that consist
of large hierarchies? - How to use faceted metadata to expand (as opposed
to refine)? - How to integrate with relevance feedback (more
like this)? - How to incorporate user preferences and past
behavior? - How to combine facets to reflect tasks?
192Thank you!
For more information
- bailando.sims.berkeley.edu/flamenco.html