FINAL PROGRAM DIGITAL LIBRARIES '96: 1st ACM INTERNATIONAL CONFERENCE ON DIGITAL LIBRARIES March 20-23, 1996 Hyatt Regency Bethesda, Maryland USA Digital Libraries '96 is sponsored by ACM through SIGIR and SIGLINK. Other ACM SIGs have joined in cooperation, including SIGAda, SIGART, SIGBIO, SIGCAPH, SIGCOMM, SIGCUE, SIGDA, SIGMIS (formerly SIGBIT), and SIGOIS. In-cooperation sponsors include: ASIS (American Society for Information Science) CNI (Coalition for Networked Information) D-Lib (Digital Library Forum) IEEE CS (IEEE Computer Society) KSI (Knowledge Systems Incorporated) LITA (Library and Information Technology Association) LoC (Library of Congress) NAL (National Agricultural Library) NLM (National Library of Medicine) SLA (Special Libraries Association) WELCOME FROM THE CONFERENCE CHAIR Welcome to DL '96! Digital libraries meld the storage and retrieval power of computing, the communication capabilities of electronic networking, and the structures and practices of physical libraries and archives. Much of the excitement related to digital libraries comes as a result of the interactions among disparate communities of scholars coming together to address common problems of information organization, access, and use. This meeting builds on two conferences held in Texas in 1994 and 1995 and initiates a series of ACM-sponsored research conferences devoted to digital library research and development. The program committee has gathered a rich set of research papers that represent the best thinking of scholars in information retrieval, library and information science, human-computer interaction, social impact, and information policy. A series of interactive D-Lib Working Sessions will involve all of us in discussing critical topics and issues--please see the conference Web site for pointers to the papers and reports that will prepare you for active participation. Our two keynote speakers bring years of technical and practical experience in shaping digital libraries. Tutorials, posters, and workshops round out what will be a stimulating conference agenda. Gary Marchionini WELCOME FROM THE PROGRAM CHAIR On March 20 newcomers to the world of digital libraries can learn useful background through a tutorial on hypertext and information retrieval (especially good for librarians) and one on information organization (for those without much background in library and information science). More advanced tutorials cover the Z39.50 standard for client/server library access and its adaptation to digital libraries, as well as documents. The opening session brings us an exciting view of future national and international digital libraries, made possible through interoperability. Barry Leiner, a prominent proponent of digital libraries at ARPA, brings his considerable expertise in networking and distributed applications, helping us understand the implications of new and emerging technologies and high performance computing. The paper sessions begin with a summary of experiences from building digital libraries (e.g., Perseus) and the special challenges of handling multimedia. The perspective shifts to that of library and information science, and then to human-computer interaction, covering browsing, visualization and then image and spatial aspects --- new interface approaches adding great value! To conclude the first day, Ann Okerson, well-known for award-winning efforts at the Association of Research Libraries, shares her experiences with intellectual property rights, electronic journals, and practical concerns of moving from traditional to digital libraries. On March 22 the paper sessions begin with an exploration of documents, the content of digital libraries. Information retrieval contributions follow, leading to the final paper session on entering, analyzing, indexing, and clustering documents. Thus we come full circle from experience with digital libraries to technologies for their construction! The program draws speakers from as far as New Zealand, and from almost every one of the large digital library projects in the USA. The opening plenary and the final panel both show us how existing national libraries will join together in the eyes of users; users and evaluation is the theme of one of the Saturday workshops, running in parallel with another about the internationally developed TEI guidelines for text encoding. Ed Fox FEATURED SPEAKER Opening Plenary Session Thursday, March 21, 8:00-9:30am DR. BARRY M. LEINER Assistant Director, Information Technology Office Advanced Research Projects Agency "Interoperability Issues in Digital Libraries" Dr. Leiner is a Senior Scientist with the Universities Space Research Association. He is currently on loan to the Advanced Research Projects Agency, where he is Assistant Director of the Information Technology Office, responsible for the area of Networked Systems. This area is developing the information technologies required to support widely distributed operation. Particular areas of focus are mobile information systems, high performance networking, and the technologies required to facilitate distributed applications that exploit the emerging ubiquitous network environment. Prior positions include Director of Research with Advanced Decision Systems (1990-1992), Assistant Director of the Research Institute for Advanced Computer Science (1985-1990), and Assistant Director, Information Processing Techniques Office, ARPA (1980-1985). Dr. Leiner has spent the last 15 years working in the area of packet switched networking technology and its applications. Dr. Leiner is a graduate of Rensselaer Polytechnic Institute (B.S.E.E. 1967) and Stanford University (M.S.E.E. 1969, Ph.D. 1973). He is a member of Eta Kappa Nu, Tau Beta Pi, and the Internet Society and a senior member of the IEEE. FEATURED SPEAKER Banquet Speaker Thursday, March 21, 7:00-9:30 pm ANN S. OKERSON Associate University Librarian Yale University "How Will We Know When It Is a Library?" After 15 years of academic library and library management experience, particularly in serials and collections development, several years in the commercial sector, and service as a senior program officer for the Association of Research Libraries as its Director, Office of Scientific & Academic Publishing, Ms. Okerson became Associate University Librarian at Yale in September 1995, with responsibilities including making digital collections available. Ms. Okerson's activities include numerous projects, publications, advisory boards, and speaking engagements, e.g.: the "Synopsis" chapter for the Andrew W. Mellon study "University Libraries and Scholarly Communication" (11/92); 5 editions of the standard "Directory of Electronic Journals, Newsletters and Academic Discussion Lists" (1991-1995), 4 electronic networked publishing symposia organized on behalf of the ARL and the Association of American University Presses (and ed. of 3 volumes of proceedings). With James O'Donnell she edited "Scholarly Journals at the Crossroads, a Subversive Proposal for Electronic Journal Publishing" (ARL, June 1995), representing an extensive multi-national Internet discussion across many e-lists about the future of scholarly journals. Her articles on serials pricing (1987) and on copyright (1992) have won American Library Association awards for Best Article in the area of serials, acquisitions, and/or collections in both 1988 and 1993. The ALA named her Serials Librarian of the year in 1993. Since January 1993, Ms. Okerson has been co-owner and co-moderator of NewJour, the Internet list that announces new e-journal, magazine, and newsletter startups. NewJour has 2400 subscribers on all seven continents. The WWW site garners hundreds to thousands of searches daily. TUTORIALS Wednesday, March 20, 1996 Pre-registration is required for all tutorials. Room assignments will be announced. 12:30 to 3:00 pm 1A. Information Retrieval and Hypertext Edward A. Fox, Virginia Tech Robert Akscyn, Knowledge Systems Incorporated Content: An introductory to intermediate tutorial intended for those interested in information retrieval (IR) and hypertext (HT), which provides a background for: digital libraries as well as work in: content-based retrieval, hypermedia, networked information, educational courseware, and related technologies. Topics will include: 1. Hypertext Issues Data Models; User Interfaces Script Languages; Applications 2. Information Retrieval Issues Representations, Automatic Indexing Models: Boolean, String, Vector Space, Probabilistic 3. Systems as Illustrations Presentations with: KMS, Hyper-G, WWW, Envision Illustrations of: MARIAN, TOPIC, and those in DL projects Presenters: Edward A. Fox (fox@vt.edu) has a B.S. from MIT and M.S. & Ph.D. from Cornell. Since 1983 he has been at Virginia Tech (VPI&SU), where he serves as Associate Director for Research at the Computing Center and Professor of Computer Science. He has been active in ACM, serving as editor-in-chief of electronic publishing, member of the Publications Board, chair of SIGIR, and chair of the ACM Multimedia Conferences Steering Committee 1992-94. Fox directs a large NSF project on the educational application of digital libraries in CS, and led preparation of the April 1995 special issue of CACM on digital libraries (with Akscyn and others). He has published widely and given over 20 tutorials at conferences world-wide. Robert Akscyn (rma@ks.com) is President of Knowledge Systems, Inc., distributor of the KMS hypertext/CSCW system. Since the middle 1970s he has researched, developed, and commercialized advanced hypertext systems, data models, protocols, scripting languages, and application support agents, initially at CMU. He served as Founder and Chair of ACM SIGLINK (through 1993), and was editor of the Hypertext Compendium published through ACM Press. He has written and spoken widely on hypertext, hypermedia, collaborative work, electronic publishing, and digital libraries, also giving numerous tutorials on these topics. 12:30 to 3:00 pm 1B. Foundations of the Organization of Information Elaine Svenonius, University of California, Los Angeles Content: A brief introduction to the foundations of Information Organization, which will cover: a) Principles which have been used to guide the design of systems for organizing information b) Examples of methods that have been used in the design of such systems, with a particular emphasis on automated methods c) current issues and problems in designing systems for organizing information in very large online databases. Presenter: Elaine Svenonius is Professor Emeritus of the Graduate School of Education and Information Science at UCLA. She is currently teaching at the College of Information and Library Services, University of Maryland. She has written and taught in the areas of indexing, classification and cataloging. Her research focus has been on design problems in information organizing systems in the online environment. 3:30 - 6:00 pm 2A. Z39.50 Tutorial Ray Denenberg, Library of Congress Clifford Lynch, University of California Content: Z39.50 is an international standard client/server protocol for information retrieval. Z39.50-1992 is widely implemented, primarily for access to bibliographic databases. Z39.50-1995 is significantly richer and addresses search and retrieval of digital objects. Work is in progress to define a "Z39.50 Profile for the Access to Digital Collections". The Z39.50 tutorial will describe the background and history of Z39.50, its model, functionality, and technical aspects, and illustrate how Z39.50 addresses problems of access to digital collections. Presenters: Ray Denenberg (ray@rden.loc.gov) has been with the Library of Congress since 1983, and is the principal author of the Z39.50 standard. He has had overall responsibility for its development and maintenance since its inception, in 1984. He currently is responsible for the Z39.50 Maintenance Agency (see http://lcweb.loc.gov/z3950/agency). Clifford Lynch (clifford.lynch@ucop.edu) is Director of Library Automation at the University of California Office of the President. He has been involved with Z39.50 since the mid 1980s, as an implementor, as a participant in the Z39.50 implementor's group, and as the leader of the Z39.50 Interoperabilty Testbed. 3:30 - 6:00 pm 2B. Documents and Digital Libraries David M. Levy, Xerox Palo Alto Research Center Content: This tutorial will offer a conceptual framework that highlights the inseparable relationship between document technologies and social purpose (or use). Topics will include: (1) digital documents: how they are similar to and different from paper documents; (2) document genre as the nexus of form, content, and use; (3) the fixity of documents: how documents fix communication in a medium; (4) documents and change (fluidity); (5) how fixity and fluidity are orchestrated by genre; (6) the medium (in)dependence of documents. The framework will be applied to analyze one or more current issues, such as the future of cataloging. Presenter: David Levy (dlevy@parc.xerox.com) received a Ph.D. in computer science from Stanford University in 1979 and a Diploma in calligraphy and bookbinding from the Roehampton Institute (London) in 1982. He has been a researcher at the Xerox Palo Alto Research Center (PARC) for more than a decade, focussing on documents and document technologies, and has been involved in the digital library community since its inception. He has written and spoken extensively on topics including: digital libraries, the future of cataloging, document interchange standards, computers and calligraphy, and the philosophy of documents. TECHNICAL PROGRAM Wednesday, March 20, 1996 7:00-9:00pm Reception (included with registration) Waterford-Lalique Suite Thursday, March 21, 1996 7:00-8:00am Breakfast (included with registration) Crystal Ballroom Foyer 8:00-9:30 Opening Plenary Session Keynote Speaker: DR. BARRY M. LEINER Assistant Director, Information Technology Office Advanced Research Projects Agency "Interoperability Issues in Digital Libraries" Waterford-Lalique Suite 9:30-10:30 Paper Session 1: Multimedia Digital Libraries Chair: Cliff McKnight, Loughborough University, UK Waterford-Lalique Suite "Building a Digital Library: The Perseus Project as a Case Study in the Humanities" Gregory Crane, Tufts University "Towards the Digital Music Library: Tune Retrieval from Acoustic Input" Rodger J. McNab, Lloyd A. Smith, Ian H. Witten, Clare L. Henderson, and Sally Jo Cunningham University of Waikato, New Zealand "VISION: A Digital Video Library" Wei Li, Susan Gauch, John Gauch, and Kok Meng Pua University of Kansas 10:30-11:00 Break Crystal Ballroom Foyer Sponsored by Ameritech Library Services 11:00-12:00 D-Lib Working Session 1A: Metadata to Describe Information in Digital Libraries Terence R. Smith, University of California, Santa Barbara D-Lib Working Session 1B: User Needs Assessment and Evaluation Nancy A. Van House, University of California, Berkeley Working Sessions 1A and 1B will be held in the Waterford- Lalique and Cabinet-Judiciary Suites. Specific suite assignments will be announced. 12:00-1:30pm Lunch On Your Own 1:30-2:15 Paper Session 2: Library and Information Science Perspectives Chair: David Levy, Xerox PARC Waterford-Lalique Suite "The Role of Intermediary Services in Emerging Digital Libraries" Allen Brewer, Wei Ding, Karla Hahn, and Anita Komlodi University of Maryland at College Park "Toward the Bibliographic Control of Works: Derivative Bibliographic Relationships in an Online Union Catalog" Gregory H. Leazer, University of California, Los Angeles Richard P. Smiraglia, Long Island University 2:30-3:30 D-Lib Working Session 2A: Social Aspects of Digital Libraries Christine L. Borgman, University of California, Los Angeles D-Lib Working Session 2B: Repository Interactions William L. Scherlis, Carnegie Mellon University Working Sessions 2A and 2B will be held in the Waterford- Lalique and Cabinet-Judiciary Suites. Specific suite assignments will be announced. 3:30-4:00 Break Crystal Ballroom Foyer 4:00-5:00 Paper Session 3: Human-Computer Interaction: Browsing and Visualization Chair: Ann Bishop, University of Illinois, Urbana-Champaign Waterford-Lalique Suite "Graphical Table of Contents" Xia Lin University of Kentucky, Lexington "Visual Relevance Analysis" Nikos Pediotakis and Mountaz Zizi Laboratoire de Recherche en Informatique, CNRS-URA410, Univ. Paris-sud, France "A Browsing Tool of Multi-lingual Documents for Users without Multi-lingual Fonts" Tetsuo Sakaguchi, Akira Maeda, Takehisa Fujita, Shigeo Sugimoto, and Koichi Tabata University of Library and Information Science, Ibaraki, Japan 5:00-7:00 Poster Sessions Chair: Beth Davis-Brown, Library of Congress Cabinet-Judiciary Suite "SEPTEMBER - Secure Electronic Publishing Trial" Jack Brassil, AT&T Bell Laboratories MITRE Information Discovery System Raymond J. D'Amore, Daniel J. Helm, Puck-Fai Yan, and Stephen A. Glanowski, The MITRE Corporation Creating a Networked Computer Science Technical Report Library James R. Davis, Cornell University The Common Ground Surrounding Access: Theoretical and Practical Perspectives Geri Gay and June P. Mead, Cornell University Digital Libraries and Impacts on Scientific Careers Richard Giordano, University of Manchester, United Kingdom An Object-Oriented Hypermedia System for Structured Documents Hyunki Kim, Hakgene Shin, and Jaewoo Chang Chonbuk National University, Korea The Cultural Heritage Information Online Project: Demonstrating Access to Distributed Cultural Heritage Museum Information William E. Moen and John Perkins University of North Texas Inverse Mapping in the Handle Management System Varna Puvvada and Roy H. Campbell University of Illinois, Urbana-Champaign GC-ASK: A Prototype Information Discovery Project for the Global Change Data and Information System Roberta Y. Rand, National Agricultural Library Betty Coyle-Friedman, Excalibur Technologies The Electronic Reserve System at Penn State University Joan A. Reyes, Penn State University Providing Multiple Levels of Difficulty in EarthLab's Digital Library Ruth A. Ross, Ross Computational Resources Lois F. Kelso, Belmont High School Gary R. Broughton, Ross Computational Resources Edward J. Hopkins, University of Wisconsin, Madison A Framework for Pricing Services in Digital Libraries J. Sairamesh and Y. Yemini, Columbia University D. F. Ferguson, IBM Watson Research Center C. Nikolaou, ICS-Forth, Greece Establishing Computer-Based Information Services in the School Library Werner Schoggl, Vienna, Austria MESL Project Description Howard Besser, University of Michigan 7:00-9:30 Banquet Banquet Speaker: ANN S. OKERSON Associate University Librarian, Yale University "How Will We Know When It Is a Library?" Crystal Ballroom Friday, March 22 7:00-8:00am Breakfast (included with registration) Crystal Ballroom Foyer 8:00-8:45 Paper Session 4: Human-Computer Interaction: Images and Spatial Organization Chair: Su-Shing Chen, University of North Carolina, Charlotte Waterford-Lalique Suite "User Controlled Overviews of an Image Library: A Case Study of the Visible Human" Chris North, Ben Shneiderman, and Catherine Plaisant University of Maryland at College Park "A Spatial Approach to Organizing and Locating Digital Libraries and Their Content" Jason Orendorf and Charles Kacmar Florida State University 9:00-10:00 D-Lib Working Session 3A: Digitization and Conversion M. Stuart Lynn, University of California, Office of the President D-Lib Working Session 3B: Naming Objects in the Digital Library William Y. Arms, Corporation for National Research Initiatives Working Sessions 3A and 3B will be held in the Waterford- Lalique and Congressional-Georgetown Suites. Specific suite assignments will be announced. 10:00-10:30 Break Crystal Ballroom Foyer Sponsored by The UnCover Company 10:30-11:30 Paper Session 5: Documents Chair: Henry Gladney, IBM Almaden Research Center Waterford-Lalique Suite "Index Structures for Structured Documents" Yong Kyu Lee, Seong-Joon Yoo, Kyoungro Yoon, and P. Bruce Berra Syracuse University "Toward Active, Extensible, Networked Documents: Multivalent Architecture and Applications" Thomas A. Phelps and Robert Wilensky University of California, Berkeley "Physical Objects in the Digital Library" Richard Furuta, Catherine C. Marshall, Frank M. Shipman III, and John J. Leggett Texas A&M University 11:30-1:00 Lunch On Your Own 1:00-2:00 Paper Session 6: Information Retrieval Chair: Bruce Schatz, University of Illinois, Urbana-Champaign Waterford-Lalique Suite "Natural Language Information Retrieval In Digital Libraries" Tomek Strzalkowski, GE Corporate Research & Development Jose Perez-Carballo, Rutgers University Mihnea Marinescu, New York University "Interactive Term Suggestion for Users of Digital Libraries: Using Subject Thesauri and Co-occurrence Lists for Information Retrieval" Bruce R. Schatz, Eric H. Johnson, and Pauline A. Cochrane, University of Illinois, Urbana-Champaign Hsinchun Chen, University of Arizona "Information Product Evaluation as Asynchronous Communication in Context: A Model for Organizational Research" Lisa D. Murphy, Indiana University 2:00-3:00 Paper Session 7: Document Indexing and Analysis Chair: Nancy Ide, Vassar College Waterford-Lalique Suite "Text to Hypertext: Can Clustering Solve the Problem in Digital Libraries?" Robert B. Kellogg, PRC., Inc., Reston, VA Madhan Subhas, Virginia Tech "Indexing Handwriting Using Word Matching" R. Manmatha, Chengfeng Han, E. M. Riseman, and W. B. Croft University of Massachusetts, Amherst "Building a Scalable and Accurate Copy Detection Mechanism" Narayanan Shivakumar and Hector Garcia-Molina, Stanford University 3:00-3:30 Break Crystal Ballroom Foyer 3:30-4:30 Panel: Digital Library Research at the U.S. National Libraries Chair: Charles Kacmar, Florida State University Waterford-Lalique "The Role of the Library of Congress in the National Digital Library" Herb Becker, Library of Congress "Agricultural Network Information Center (AgNIC): A Model for Access to Distributed Resources" Richard E. Thompson, National Agricultural Library "Knowledge-Based Biomedical Information Retrieval" Alexa T. McCray, National Library of Medicine D-Lib Working Sessions are based on continuing activities by the working groups. Before coming to the sessions, conference participants need to read the working group materials. They are available at: http://www.dlib.org WORKSHOPS Saturday, March 23, 1996 Workshops are scheduled from 9:30 am through 3:30 pm and include a box lunch. Each workshop is intended for researchers or practitioners with an active interest in the subject matter. Pre-registration is required. Workshops will be held on the meeting room level. Specific room assignments will be announced. Workshop A: Text Encoding Initiative Nancy Ide, Vassar College Judith Klavans, Columbia University Jean Veronis, Universite de Provence, France Workshop B: User Needs Assessment and Evaluation Ann Bishop, University of Illinois Nancy Van House, University of California, Berkeley David Levy, Xerox PARC On behalf of the participants of Digital Libraries '96, the conference organizers would like to acknowledge the generous support of our corporate sponsors: AMERITECH LIBRARY SERVICES Provo, Utah EXCALIBUR TECHNOLOGIES CORPORATION McLean, Virginia UNIVERSITY OF MARYLAND AT COLLEGE PARK COLLEGE OF LIBRARY AND INFORMATION SERVICES College Park, Maryland UNIVERSITY OF MARYLAND AT COLLEGE PARK COMPUTER SCIENCE CENTER PRESENTATION GRAPHICS LAB College Park, Maryland OPEN TEXT CORPORATION Waterloo, Ontario, Canada THE UnCOVER COMPANY Denver, Colorado We appreciate their support and wish to thank them very much. CONFERENCE ORGANIZATION Conference Chair: Gary Marchionini Univ. of Maryland at College Park Program Chair: Ed Fox Virginia Tech Treasurer: Larry Fitzpatrick Open Text Corporation Registration: Linda Hill Univ. of Maryland at College Park & CESDIS Publicity: Nancy Van House University of California, Berkeley Local Arrangements: Lida Larsen Univ. of Maryland at College Park Tutorials: Edie Rasmussen University of Pittsburgh Workshops: Maria Zemankova National Science Foundation Panels: Charles Kacmar Florida State University Posters: Beth Davis-Brown Library of Congress Conference Evaluation: Phil Doty University of Texas Industry Liaison: Roberta Rand National Agricultural Library ORGANIZING COMMITTEE William Arms, Corporation for National Research Initiatives Ed Fox, Virginia Tech Richard Furuta, Texas A&M University David Levy, Xerox PARC Gary Marchionini, University of Maryland at College Park PROGRAM COMMITTEE Robert Akscyn, Knowledge Systems Inc. Robert Allen, Bellcore William Arms, Corporation for National Research Initiatives Daniel Atkins, University of Michigan Ann Bishop, University of Illinois at Urbana-Champaign Christine Borgman, University of California, Los Angeles W. Bruce Croft, University of Massachusetts, Amherst Su-Shing Chen, University of North Carolina, Charlotte Steven DeRose, Electronic Book Technologies, Inc. Timothy Finin, University of Maryland Baltimore County Mark Frisse, Washington University School of Medicine James French, University of Virginia Richard Furuta, Texas A&M University Robert Futrelle, Northeastern University Hector Garcia-Molina, Stanford University Henry Gladney, IBM Almaden Research Center Ephraim Glinert, Rensselaer Polytechnic Institute John Guidi, University of Maryland at College Park Thomas Hickey, OCLC Nancy Ide, Vassar College Rob Kling, University of California, Irvine Ronald Larsen, University of Maryland at College Park John Leggett, Texas A&M University Enrica Lemut, Istituto Matematica Applicata CNR Michael Lesk, Bellcore David Levy, Xerox PARC Clifford Lynch, Univ. of California Office of the President Catherine Marshall, Texas A&M University Cliff McKnight, Loughborough University Francis Miksa, University of Texas at Austin Eugene Miya, NASA Ames Research Center Sung Myaeng, Chungnam National University A. Desai Narasimhalu, Institute of Systems Science Gultekin Ozsoyoglu, Case Western Reserve University Roy Rada, Washington State University P. Venkat Rangan, University of California at San Diego Pamela Samuelson, Cornell Law School Bruce Schatz, University of Illinois at Urbana-Champaign John Schnase, Missouri Botanical Garden Terence Smith, University of California at Santa Barbara Scott Stevens, Carnegie-Mellon University SEI Chris Welty, Vassar College Terry Winograd, Stanford University DIGITAL LIBRARIES '96 Registration Desk: meeting room level Wed: 11am-7pm Thu: 6:30am-6:30pm Fri: 6:30am-5pm Sat: 8am-noon Email Room: Cartier-Tiffany Salon Conference Office: Executive Boardroom (map of hotel not included here)