random quote Link: Publications Link: Forum Link: About DLF Link: News
Link: Digital Collections Link: Digital Production Link: Digital Preservation Link: Use, users, and user support Link: Build: Digital Library Architectures, Systems, and Tools
photo of books

DLF AQUIFER PARTICIPANTS

newlogo

Report on a meeting of the Aquifer initiative held at Stanford University on August 5, 2004

D Greenstein

The following is based on a meeting involving:
  • Stanford University (Michael Keller, Jerry Persons)
  • DLF (David Seaman)
  • CDL (Daniel Greenstein, Peter Brantley)
  • UIUC (Paula Kaufman, Beth Sandore)
  • Indiana (Suzanne Thorin)
  • Michigan (John Price Wilkin)
  • JHU (Winston Tabb)
  • Emory (Martin Halbert)
  • Minnesota (Eric Celeste)
  • Virginia (Karin Wittenborg)
  • Library of Congress (Caroline Arms)
  • NYU (Jerome McDonough)

A. What is Aquifer

  • Supporting research, teaching, and learning with high-quality online special collections and distinctive information services. Aquifer is an initiative of the Digital Library Federation (DLF). It leverages DLF members' extant digital collections, their institutional capacities, curatorial expertise, and historic service to scholarly communities.

    • Google commodifies mass information and global search; Aquifer commodifies information's curation and selection, and the distinctive services that it enables

  • A basket of collaborative initiatives in support of innovative scholarship, selected and prioritized by DLF directors, and emphasizing interoperability, resource sharing, and institutional commitment and adherence to consensually developed standards, information architectures, and development agenda

    • Membership in Aquifer requires more than annual dues; it requires alignment of and changes in organizational behaviors

B. Some Aquifer principles

  • Driven by an interest in scholarship
  • Focused practically on the development of discrete online collections and services that support definable research, teaching, and learning needs
  • Supported by collaboration and deep resource sharing
  • Enabled by alignment of organizational behaviors, commitments, and investments

C. Aquifer. A basket of initiatives

Aquifer is envisaged a basket of loosely related initiatives set out below

  1. Foundational service infrastructure - enabling Aquifer with essential organizational support

    1. Enabling Aquifer partners to share and comment upon planning, policy, and other key documents in a secure (password-protected?) web space

      • Action: DS to consult with EC and others in establishing a web-space where Aquifer partners can share documentation and comments as appropriate. Workspace plan to be circulated and reviewed by Aquifer members by September 7 and available for use by September 30, 2004

        • EC action Aug 23: done -- wiki set up
        • DS actionSep 1: done -- website set up at http://www.diglib.org/aquifer/
        • DS actionSep 2: draft workplan and wiki notes sent to workgroup by email

    2. Enabling leverage of partners' technology development effort by creating a registry of the tools they are developing and that can potentially support the initiative's aims. Registered tools will either be open source and platform independent or have the possibility of developing as such. Aquifer is particularly interested in tools that enable and/or support
      1. creation of Aquifer-standard-conformant metadata (enriching the metadata content that can be supplied by a data provider)
      2. enrichment of metadata or content (helping a service provider overcome the constraints inherent in the metadata and content they capture from third-party data providers - annotation tools fit neatly here)
      3. tools being developed around a cornerstone application such as DSpace
      4. tools for integrating digital libraries into local learning environments (e.g. tools that build bridges between commercially supplied platforms such as SFX and WebCT, or that support Blogging, etc.)

      • As part of their membership in Aquifer, partners will commit themselves to contributing to and maintaining the tools registry.

      • Aquifer will use the registry to help prioritize tools co-development effort

      • Actions:
        • DS to consult with EC about development of a Wiki to host the tools registry

        • DS actionAug 23: used EC's Aquifer wiki testbed and learned some wiki markup language. read up on wikis and am preparing a brief background factsheet on them for the group: http://www.diglib.org/pubs/execsumm/wikiexecsumm.htm

        • DS action Aug 25: spoke to EC to follow up on email exchange about the possibilities and limitations of the pmwiki software used at Minnesota

        • DS action Aug 26: spoke to Martin Halbert about the Ockham group's use of wikis as a collaborative editing tool. Will make recommendations to Aquifer group about how best for us to use wiki next week.

        • DS to consult with Aquifer partners to
          • develop detailed documentation about the registry's scope and intention,
          • create a template description for tools (PB to supply DS with template description developed by UC libraries' SOPAG group)

        • DS action Aug 25: spoke to Martha Brogan about working with us on this task, and she is interested (we have previously discussed a “production tools survey” that should perhaps be rolled into this one); am looking for other tools registries as examples, e.g. http://www.oaforum.org/oaf_db/list_db/list_software.php or http://sourceforge.net/softwaremap/trove_list.php

      • Registry specification to be available by DS for review on the Aquifer workspace (see 1.1. above) by October 11, 2004

    3. Enabling learning from partners' institution-based assessment and evaluation activities by developing a repository where members deposit results of the work they undertake to understand users' information and service needs, and to assess the use, usability, and perceived value of various digital library products (collections, services, interfaces, etc.).

      • As part of their membership in Aquifer, partners will commit themselves to contributing to the repository.

      • Actions: DS to consult with staff at partner institutions to:
        • develop detailed documentation about the repository's scope and intent
        • prepare a template for describing reports that are deposited in the repository
        • recommend an infrastructure (service environment) capable of sustainably hosting and managing the repository

      • Repository specification as described above to be available for review on the Aquifer workspace (see 1.1.) by October 11, 2004

  2. Foundational knowledge - informing Aquifer with environmental scans conducted in key areas The environmental scans and the organization or individuals responsible for developing a detailed specification for them are supplied below. Unless otherwise indicated, specifications will be made available on the Aquifer workspace for review and comment by October 11, 2004. Specifications will be detailed enough to use as the basis for an RFP process as may be required.

    1. What lessons can Aquifer learn from IMS's work in the area of content/information interchange? What tools emerging in this space from within the IMS arena and in which Aquifer might take an interest?

      • Action: Stanford to create specification for this environmental scan making it available for review on the Aquifer workspace on October 11, 2004

    2. How are undergraduates, graduates, teachers, and researchers using digital information? What information can be gleaned from extant research (e.g. as sponsored by Mellon, Hewlett, NSF, and others) and initiatives (e.g. ACLS's cyberinfrastructure committee)? What gaps exist in our understanding and how (through what kind of investigative effort) could they be filled?

      • Actions:

        • DG to speak with Saul Fisher (Mellon) to get advice about how/where Aquifer might make headway with these questions, reporting back to partners by October 11, 2004

        • PK to speak with John Unsworth (ACLS/Cyberinfrastructure) to get advice about how/where Aquifer might make headway with these questions, reporting back to partners by October 11, 2004

        • MK/DG/KW to think about and scope a program through which Aquifer might leverage initiatives like that hosted by UVa and with Mellon support as a means of defining (getting some traction under) scholarly information needs in a manner that can help define and focus Aquifer development efforts. The three to consult with Don Waters (Mellon) and report back to Aquifer meeting at DLF Forum, October 2004

  3. Pooling - tools and approaches to aggregating digital information content

    1. What standards, best practices, and approaches can Aquifer encourage and/or require of data providers and service providers in order to improve the adequacy and usefulness of OAI harvesting as a means of harvesting extant online digital library information resources.

    • Where Aquifer partners have the rights necessary to do so, they are committed to making online digital information available for harvesting by other Aquifer partners and doing so in a manner that conforms to the standards and best practices that the group agrees through the above initiative

      • Action: Sarah Shreeves (UIUC) to define and scope a program of activities in this area leveraging the DLF initiative in this area that got underway in July 2004. Program statement to be available on the Aquifer workspace for review by October 11, 2004.

  4. Puddling - tools and approaches to selective assembly of digital information content in support of curated online collections and/or specialized information services

    1. How can Aquifer leverage existing taxonomies to improve search capability and/or to assist in the automated selection (curation) of digital collections that are based on large-scale aggregations of materials assembled via OAI harvesting, web crawling, or other means? What taxonomies exist that could be useful? What are the technical obstacles that need to be overcome so they may be utilized effectively, for example, as web services?

      • Action: Stanford (MK/JP) to develop a detailed specification or problem statement for work in this area along with recommendations for funding required effort, supplying Aquifer partners with both the specification and recommendations by October 11, 2004

    2. What data mining, metadata enrichment, and other tools currently exist or are in development that may assist “curators” to develop special or focused collections based on digital content that is aggregated by OAI harvesting, web crawling and other means

      • Action: CDL to develop a scoping statement and timelines for the survey by October 11, 2004, and to conduct the survey thereafter. Scoping statement and timeline to be made available by October 11, 2004 for review by Aquifer partners via the workspace

    3. Aquifer partners have between them explored a variety of approaches to curating digital library collections. Approaches range from selecting a canonical reference work and digitizing all items mentioned therein (Indiana's work with American Literature), to aggregation of distributed content that meets specific format or protocol requirements (OAIster), to working with scholars to define needs appropriate to a particular course or teaching need (CDL's Cal Cultures). What are the costs and benefits involved in these and the other approaches that have become familiar to us? What lessons can we learn from those with practical experience of these areas that can be brought to bear on our future work building high-quality digital collections that are based at least in part on extant digital content?

      • Action: Pending. What is required is a statement framing work that might be conducted in this area.

    4. What special collections can Aquifer build based at least initially on the digital information content that partners can supply and using the tools and knowledge gained through other Aquifer initiatives? How and by whom will collections be developed?

      • Round-table review of available collections indicated potential pools in the following areas:
        • American history, society, and culture (all and sundry)
        • American literature (UVa/CDL/Emory/NYU/Michigan/Indiana/Stanford/to be informed by Mellon-supported survey being conducted by Martha Brogan)
        • Slavic periodical index and etc (Stanford/Indiana/UIUC)
        • Performing Arts inclusively (Indiana/JHU/NYU/UVa/CDL for UCLA etc)
        • Art and architecture (UVa/CDL/JHU/LoC/Indiana/Michigan)
        • Data (UVa/CDL/JHU/Michigan to speak with ICPSR)
        • Medieval manuscripts (JHU/Stanford/UIUC/)

      • Action: DG to develop a draft collection planning framework detailing the set of activities likely to be required to plan and build any specific collection. Activities might include such things as as user needs assessment, review of existing collection strengths and weaknesses, metadata and collection or metadata profiling requirements, special service requirements (e.g. distributed text analysis as appropriate for encoded literary texts, slide-table presentation as appropriate for image collections etc.). Based on this framework (to be available for review by October 11, 2004), Aquifer to define candidate collections and identify curators (individuals or groups) responsible for planning and building them

  5. Piping - moving digital library content whether pooled or puddled, into particular applications in support of specific research, teaching, and learning needs

    • Action: Pending. Work in this area should support exploitation of the curated collections that Aquifer builds and should also, preferably, take advantage of some of the tools being developed at partner sites. Accordingly, detailed development path awaits progress in collection development (4.4.) and initial population of the tools registry (1.2. above)

  6. Building core infrastructure

    1. Persistent digital asset management is likely to be key to Aquifer's success. How can Aquifer partners leverage their extensive respective experiences with digital object repositories in order to put into place a network of robust and appropriately replicated digital filestores?

      • Action: JPW to convene a group including PB to scope frame an initiative reporting back with a scoping statement and recommendations by October 11, 2004. After discussion, JPW and colleagues asked to consider at least the following directions:
        • review costs and benefits of repository options that are currently available (DSpace, Fedora, CDL's Ark-based METS repository, etc)
        • explore repository replication costs, benefits, and development options
        • explore repository co-development options

    2. Irrespective of the variety of business models that may be used within Aquifer to govern access to collections and services, authentication/authorization is likely to be essential. How can Aquifer leverage existing effort with Shibboleth to develop appropriate access control and rights management approaches?

      • Action: JMd/PB/JP to scope a shibboleth/Authentication initiative and implementation path, reporting out by October 11, 2004 for review by partnership

  7. Changing the economics of scholarly publishing

    1. The current model of scholarly publishing is incontrovertibly unsustainable. How can Aquifer help to finding alternatives that are as crucial to the academy as it is to the research libraries that play such an essential supporting role within it. How can Aquifer members leverage their commitment to resource sharing and information interoperability to find and support the development of such alternatives and in so doing help support sustainable, widespread and low-cost access to scholarly research

      • Action: DG to frame a possible initiative that leverages the work federating institutional repositories that has already been begun by CDL, MIT, Harvard, and Cal Tech, including in this initiative UIUC, Indiana, Michigan, and UVa with Minnesota, Stanford, JHU as observers. Framework to be available by October 11, 2004 for review by partners.

return to top >>


Copyright © 2005 by the Digital Library Federation