GIRWG GGF12 - PowerPoint PPT Presentation

1 / 16
About This Presentation
Title:

GIRWG GGF12

Description:

Amberfish (nn) In process. Documents Detail. Requirements document. Provides an overview of IR ... Nassar's 'Amberfish' is an existing IR toolkit being fitted for GIR ... – PowerPoint PPT presentation

Number of Views:24
Avg rating:3.0/5.0
Slides: 17
Provided by: jenn187
Category:
Tags: girwg | amberfish | ggf12

less

Transcript and Presenter's Notes

Title: GIRWG GGF12


1
GIR-WG _at_ GGF12
  • Grid Information RetrievalWorking
    GroupSeptember 21 2004
  • VUB, Brussels

2
Session Particulars
  • GGF IP policies apply
  • Meet your chairs
  • Dr. Greg Newby, Arctic Region Supercomputing
    Center (presenting at GGF12)
  • Kevin Gamiel, NC RCI
  • Nassib Nassar, Etymon

3
What is GIR-WG?
  • GIR-WG was chartered by GGF to develop standards
    and reference implementations for information
    retrieval (IR) on computational grids.
  • GIR-WG has published a Requirements document
    under GGF
  • Today, we will discuss progress on the
    Architecture document and the reference
    implementations

4
What is Information Retrieval?
  • IR is the science and method of delivering
    documents that are relevant to human information
    needs.
  • Rather than delivering sets of matching documents
    (as DBMS do), IR systems rank matching documents.
  • IR systems usually focus on textual input data
    (aka, natural language) either unformatted or
    formatted (plain text, HTML, XML, etc.)

5
Why is IR a good candidatefor Grid computing?
  • Excellent for divide and conquer coarse-grained
    parallelism
  • Input items are discrete
  • Coordination across subsets of a document
    collection can be minimal
  • Results from multiple sources can be coordinated
    and relevance ranked together
  • Queries may be handled independently

6
Significant Progress
  • Our chartered deliverables are Documents and
    Reference Implementations
  • Documents
  • GIR Requirements published
  • GIR Architecture in mid-draft
  • GIR Specifications not yet underway
  • Reference implementations
  • MCNC released a technology preview
  • IRTools (gbn) In process
  • Amberfish (nn) In process

7
Documents Detail
  • Requirements document
  • Provides an overview of IR
  • Mentions desirability of Grid infrastructure for
    IR, notably enterprise IR
  • VO (for security, segmentation)
  • Separation of duties (for indexing, collection
    management query processing)
  • Flexible but coarse-grained flow of control among
    elements
  • Persistence of queries, collections and indexes
  • Proposes three primary components
  • Collection manager handles input gathering,
    transformation, transport, staging and delivery
  • Indexer core information retrieval collection
    representation
  • Query processor respond to user needs, including
    standing information needs (I.e., information
    filtering)

8
Reference Implementation Detail
  • MCNC released a functional demonstration based on
    GT2.4. Available for free download, and
    extensively documented. Demonstrated at GGF8,
    GGF9 and GGF10
  • Newbys IRTools is an existing IR toolkit being
    fitted for GIR
  • Nassars Amberfish is an existing IR toolkit
    being fitted for GIR
  • Newby Nassar are TREC participants, giving the
    opportunity for evaluation of GIR systems
  • Additional partners, input data, etc. are being
    sought

9
Implementation Approaches
  • Do not rely on particular implementations or
    middleware (that is, Globus)
  • Pursue different types of Grid implementations
  • Minimalist, home grown
  • Globus-based
  • Pure Web services

10
Architecture Document
  • Builds on Requirements
  • Builds towards Specifications
  • Organized around three main GIR components,
    security
  • Main body common architectural elements
  • Appendices elements for specific middleware

11
Planned Appendices
  • GIRI minimalist implementation using sockets.
    Grid elements (VO persistence security)
    home-grown
  • Web services implementation based on Tomcat.
    Additional Grid functionality developed as
    light-weight services
  • Globus We are waiting for GT4

12
Discussion of GIR-WG
  • Your questions, thoughts and suggestions

13
Future Steps
  • Architecture draft look for more progress by
    GGF13 in Seoul
  • Reference implementations continued development

14
Get Involved!
  • Visit http//www.gir-wg.org
  • Subscribe to gir-wg_at_gridforum.org
  • Talk with chairs about data and reference
    implementations

15
(No Transcript)
16
Write a Comment
User Comments (0)
About PowerShow.com