EMANI Meeting July 25-26, 2002 Cornell University Library Ithaca, New York EMANI – Electronic Mathematical Archiving Network Initiative: An international collaboration to support and coordinate the long-term preservation and open accessibility of mathematical publications in digital form. Project Coordinator: Bernd Wegner 1. Project Tasks and Priorities Immediate Send proposed advisory board names to Wegner Distribute PDFs of EMANI logo to team Press release Survey of the EMANI partners on metadata requirements Collect and distribute “problematic examples” to test metadata September Send work package reminders 5 weeks prior to November meeting DOI report October Definition of work packages due – send to Becker November November meeting Website operational Minimum list of metadata requirements available by November meeting Functional requirements for users – proposal available by November meeting (prototypes by March 2003) Report on the NSDL No date / ongoing Prepare budget for EMANI Prepare 2-year plans for digitization (toward a registry). Submit descriptions to Becker Assemble a list of organizations for outreach Conference papers and publications (partner should review before submission) Funding efforts Interoperability of distributed system All Springer Wegner and Hasan Göttingen All Wegner De Kemp All WPs Göttingen Springer and Neuroth Metadata WP Access, Nav, Design WP Becker Springer All partner organizations All All All EMANI Meeting July 25-26, 2002 2. Presentations Hans Becker (SUB Göttingen), “Retrodigitization” Pierre Bérard (Grenoble), “NUMDAM” Arnoud de Kemp and Syed Hasan (Springer), “Formalizing Project EMANI” Gertraud Griepke (Springer), “Contents and Formats: Existing Digital Sources” Jiang Airong, “The Development of the Chinese Math Digital Library at Tsinghua University” Bernd Wegner (TU Berlin), “Information dissemination” Bernd Wegner, “Ongoing Projects” Project Euclid economic model 3. Meeting Minutes Below are the minutes from the July 25-26 meeting of the EMANI project, held in Olin Library at Cornell University, Ithaca, New York. The notes are my paraphrase of the transactions and should not be taken as a direct transcript of the discussions. Links to participants’ PowerPoint presentations are provided above, as well as in the text of the minutes. Brief notes on the presentations in the minutes are meant to supplement the slides. – Kizer Walker N.b. Tasks requiring specific follow-up from participants are marked with “” and bold face. Thursday, July 25 Sarah Thomas (CUL) opened the meeting by welcoming the EMANI partners to Cornell University Library and wishing the group success in its collaboration. Bernd Wegner (TU Berlin/Zentralblatt) provided an overview of projects relevant to digital archiving in general and the archiving of math materials in particular. [View presentation]. He noted that several participants in the EMANI meeting would go on to attend the July 29-30 planning meeting of the Digital Mathematics Library project at the US National Science Foundation in Washington, DC. Among other developments, Wegner pointed to a backfiles system being developed by Elsevier for 39 of its journals as a possible competitor for EMANI. He emphasized the importance of establishing visibility for EMANI, noting that content results should be made available as soon as possible – even before the infrastructure is perfected, since the infrastructure is not visible. Discussion: Gertraud Griepke (Springer Heidelberg) stressed that the priority assigned to archiving digital materials must influence current production. Today’s digital production must facilitate a future transition to the archives. What is new and significant about EMANI is this way in which the project unites future and past. Bernd Wegner agreed, adding that this influence on the production of digital materials must be a measure of EMANI’s success, not only what the project is able to make available on the web. 2 EMANI Meeting July 25-26, 2002 Hans Becker (SUB Göttingen) cited the development (e.g. in Germany and the UK) of preservation and archiving projects that are regionally based, rather than subject-based. SUB Göttingen is attempting to play the role of coordinating these two approaches. Tom Hickerson (CUL) explained that in the US context, subject-based, national (LoC), and institutional archives (i.e., archives that collect scholarship in the institution where it is produced) are typically understood as competing models. He insisted that these models must interoperate. Today (especially in the US context), there can be no single solution, and at the moment, no single model is dominant. EMANI can serve as an exemplar of the subject-based approach and at the same time advocate for interoperability among the different models. Several of the Work Package subjects assigned at the February Heidelberg meeting were discussed in the context of a grant proposal that EMANI partners from Cornell and Göttingen are preparing for submission to NSF/DFG. The proposal – “Access to Mathematics Over Time: Cooperative Management of Distributed Digital Archives” -- responds to a call for proposals from DFG and NSF’s joint funding program for International Digital Libraries Research (see <http://www.dfg.de/foerder/biblio/neues/dfg_nsf.pdf>). The NSF/DFG proposal took priority at the EMANI metadata meeting in Göttingen in May. Some of the participants at the Ithaca meeting also met separately to finish the proposal, which had an August 1 submission deadline. David Ruddy (CUL) described the proposed collaboration between SUB Göttingen and CUL as a project that could build, with external funding, core working components of EMANI. If funded, the project will focus on the archiving of electronic journal materials in a distributed system that is open and non-proprietary, taking into account the heterogeneous systems at the different institutions. The partners would develop metadata to support long-term archiving in the framework of the Open Archives Initiative. Hickerson stressed that the grant activity should not be seen as an attempt to break away from the larger EMANI project. The proposal responds to a specific funding opportunity that has arisen. It is a grant proposal to the two funding agencies, but should be understood at the same time as a proposal to the EMANI group, a suggestion of the direction the larger project can take. Becker urged that the grant be pursued as a means of fulfilling the tasks laid out in the EMANI Work Packages, but that the Work Package structure be retained, along with the assignments made to the particular organizations in EMANI. Hickerson emphasized that while the NSF/DFG grant is an International Digital Libraries Research grant specifically requiring US-German collaboration, in the EMANI context, the grant activity would be the business of all the national partners. Here discussion diverged from the NSF/DFG proposal. Wegner asked whether the style files promised by Springer-Verlag in at the Heidelberg meeting in February were forthcoming. The Springer partners assured the group that the publisher is at work on style files for worldwide production. Wegner suggested the group resume the discussion begun at the Göttingen meeting (May ’02): What does EMANI want to preserve? What are the minimum requirements? 3 EMANI Meeting July 25-26, 2002 Thomas posed the question of the extent to which “context” should be preserved along with “content.” For instance, is it desirable to archive information about a journal’s editorial board? This is not only a metadata question, but also a policy question. Not only a question for mathematicians, but also a question for historians and other user groups. Arnoud de Kemp (Springer Heidelberg) characterized the “context” question as a structural question: how do the originators of information structure that information? Pierre Bérard (Grenoble) responded that this is a technical point of view, and not an end-user perspective. On the particular question of editorial boards, Ruddy noted that Project Euclid had made a decision to capture them in the metadata. But it would difficult to include them retroactively. Wegner proposed that the group develop a set of potential preservation problems and a set of minimum requirements, with examples. Hickerson agreed that these kinds of questions need to be addressed immediately as a starting point for future developments. Hickerson suggested a new Work Package might focus on the question of minimum requirements for content/context archiving. Jean Poland (CUL) suggested that math librarians consult historians of science on the question of “context” preservation needs. Griepke: EMANI needs to arrive at a clear definition of what a “journal” is (including, e.g., table of contents, index). De Kemp: This is a question of structure – the structure of the journal. Becker noted that SUB Göttingen has already developed a document to address this. Heike Neuroth (SUB Göttingen) is working on a survey of the EMANI partners on metadata requirements. Becker reminded the group that we must consider the economics of preservation in formulating metadata requirements. Hickerson cautioned against expending too much of the group’s efforts in the search for a perfect solution – we must make our best guess at what must be preserved and move on. Must be flexible and pragmatic and not get bogged down by the immenseness of the question of what to keep. Hickerson proposed the November EMANI meeting as a deadline for developing a minimal list of metadata requirements. Marcy Rosenkrantz (CUL) asked whether the group is prepared to preserve TeX on the assumption that TeX will continue to be usable in the future. Ruddy: If we want to preserve presentation formats, preserving TeX will not suffice. 4 EMANI Meeting July 25-26, 2002 Steve Rockey (CUL) added that publishers must decide whether print or electronic will be the copy of record. In summary, Thomas noted that two pieces are needed to move forward with the formulation of metadata requirements: the results of the Göttingen metadata survey and examples of problematic materials. Neuroth will distribute survey and results. Wegner agreed to collect and distribute examples of problematic materials. Griepke agreed to distribute examples from Springer. The problematic examples will be tested against the proposed metadata – can the metadata describe the examples? The five participating groups will make recommendations based on the tests. Access, Navigation, Design Hickerson: the next step after coming to some conclusions about metadata requirements will be to create models/prototypes of a user system. When should such models be ready for review by the group? In one year? Neuroth: Six months would be better. If prototypes are available sooner, this experience can still influence the development of the metadata. Ruddy: The access, navigation, design group (headed by CUL) will propose functional requirements for users in time for the November meeting. Ruddy asked for clarification of whether the group understands the EMANI project to be creating a “dark” or a “light” archive. Are access controls, etc., a requirement? He noted that he has been working under the assumption that the project will produce a light archive. Hickerson recalled that Rüdiger Gebauer had spoken at the February meeting in Heidelberg of opening the archive after a “short period.” He asked for the Springer-Verlag partners to make clear if this was not correctly understood. De Kemp responded that it will not be difficult to discuss opening access to the archive with Springer. He pointed to the open discussion between Springer and Göttingen on Göttingen’s digitization of Springer math books. Springer is willing to discuss an open archive – but probably with restrictions. This might mean a model with 2 or 3 time layers. In any case, Gebauer must be involved in this discussion. There is currently no written policy on this question at Springer. Hickerson answered that time constraints on access, a rolling model, do not pose a problem for CUL. What would be problematic is a subscriber-specific system to support subscription processes – this would be too expensive for the library. De Kemp noted that part of the question of access has to do with the definition of “archive”: is it old materials? Current library collections? 5 EMANI Meeting July 25-26, 2002 Access, Navigation, Design tasks: basic functional specifications: by November meeting prototypes for presentation to the group: March 1 (to share with the group for discussion in April) Participants from the CUL group noted that the prototypes should not represent a universal model, but rather prototypes to be adapted to needs of different user communities. Architecture Frank Klaproth (SUB Göttingen) opened the discussion on EMANI architecture. He emphasized that the system should be OAI compliant. It will be a distributed system. Rights management must be addressed, as well as the question of which institution holds the digital materials. Wegner outlined a model of a distributed system in which copies of content would be stored at multiple mirror sites, but individual institutions would retain proprietary rights over specific content. This is the model employed by Zentralblatt MATH and EMIS. De Kemp said he expected such a model would be of interest to CUL and Göttingen, but wondered whether France or Tsinghua would also be interested. Bérard responded that the French partners would be interested in principle, but that it is too early for a decision. Thomas referred to the ongoing discussion in the archives community of the status of mirror sites. Most hold that while mirror sites represent an important safety net for access, they do not constitute an archive. Nancy McGovern (CUL) drew a distinction between redundancy and replication in the context of digital archiving. Because a mirror site holds copies of documents within an environment that replicates the main site, it is vulnerable to the same problems as the main site. Redundancy in the sense advocated by the LOCKSS project (http://lockss.stanford.edu/) provides more security by storing copies in a different environment. Wegner added that the EMANI architecture should accommodate updates. De Kemp envisioned a docking station for the delivery of information directly (i.e., SpringerVerlag to EMANI) in the desired format. Springer wants to see a platform for the delivery of digital information that offers an alternative to Elsevier. This would not be a proprietary Springer system, but an open system with participation from multiple publishers. Thomas summarized the criteria for evaluating the EMANI architecture: capacity for interoperability with other systems, accommodation of updates. Retrodigitization Becker presented on SUB Göttingen’s Retrodigitization plan for the next two years. [View presentation]. Göttingen will continue work on Springer journals. They wish to create a definitive list of Springer math journals. Teubner journals are also a priority, but like Birkhäuser, Teubner is 6 EMANI Meeting July 25-26, 2002 a sister company of Springer and issues of control must still be worked out. The plan also includes digitization of all the back sets of all the EMIS journals. See presentation for other priorities for journals, monographs. Wegner and Griepke pointed out Springer has already prepared a list of Springer’s math journals. De Kemp: Springer is interested in testing monograph digitization. Up to now, this has not been a success. Serviceable on-demand printing is still ten years away – print and binding quality is not up to expectations. Registry discussion: Need a registry of math materials that will be digitized? Keith Dennis (Cornell) asked for clarification on the purpose of such a registry. Is its purpose to indicate to libraries what is to be digitized? Or is the point to show users what is available in digital form? If the registry is for math users, Zentralblatt and Math Reviews already function as a repository. Participants from CUL all understood the registry as a prospective list with the purpose of managing digitization. Thomas: a registry of intent to digitize. Bérard noted that France is making a list to guide future digitization. A list of priorities. De Kemp maintained that DOI amounts to an agreed-upon repository. Springer is making DOIs for all digital documents and could register DOIs for EMANI. Whoever has the rights to distribute digital material can register a DOI for that material. De Kemp will attend an upcoming DOI meeting in Boston and will report back to the EMANI group. It was agreed that each EMANI partner would prepare a description of what they propose to digitize in the next two years. These will be submitted to Hans Becker who will consolidate and organize the list. This list is meant to evolve into a mature registry at a later date. For more on the Digital Library Federation’s registry model, see: “The Case for a Digital Registry” <http://www.diglib.org/collections/reg/regcase.htm>. Report on Digitization Projects Jiang Airong (Tsinghua UL) presented on The Development of the Chinese Math Digital Library at Tsinghua University. [View presentation]. The pre-1911 Chinese mathematics literature amounts to approximately 160,000 pages (Jiang did not have a figure for materials since 1911, but estimated more than 1 million pages). Current Chinese digitization efforts intend to cover 60 math journals with over 70,000 articles. Bérard reported on the NUMDAM project. Topics included technical specifications, metadata structures, quality control procedures, etc., as well as the relationship between the NUMDAM and EMANI and NUMDAM and the Digital Math Library. [View presentation]. 7 EMANI Meeting July 25-26, 2002 Note: On “programme overview” slide (#12), pink represents outsourced processes. NUMDAM recycles the metadata from CUL and Göttingen for its monograph database. Bérard noted that NUMDAM’s bibliographies are keyboarded. The next step is to add Zbl/MR numbers to the bibliographies. The beta version of NUMDAM’s citation tool is expected in October. Bérard discussed the issue of author identification and permissions. In French law, permission must be obtained from the journal and the author. NUMDAM has been advised by lawyer to show a good-faith effort to get permission from the author. When the project is unable to reach an author, the material is posted, but removed if the author requests. NUMDAM is publicly funded. Sustainability Thomas recalled reporting at the February 2002 EMANI meeting on Project Harvest – an exploratory project in subject-based digital archiving in agriculture that was funded by the Mellon Foundation. It had been CUL’s hope that Mellon might fund a mathematics project along these lines, but the foundation has since shifted its priorities. CUL will need to look for other options to support EMANI participation. McGovern described the competing publisher- and subject-based models of digital archiving that have been under discussion at CUL and elsewhere. The subject-based digital archive (SBDA) model was a key lesson from Project Harvest, but discussion has now moved to a notion of an integrated matrix of repositories, some publisher-based, some subject-based. This integrated model belongs to the broader project at CUL of building a Common Depository of digital materials. This is conceived as an interoperable, distributed system. This reflects a shift in orientation at CUL from individual digital projects to a digital program for the institution. CUL is working with the Open Archival Information System (OAIS) model. While CUL does not have a fully-implemented OAIS archive, it does have pieces. Griepke raised the question of how to “brand” digital content. With multiple projects making math materials available, how does the user choose, especially when the “real” content – the OCR layer – is invisible? We need a quality stamp. Wegner: Quality includes comprehensiveness. How do we find a balance between quantity and quality? JSTOR provides an excellent core math collection, but it is a small sample. De Kemp: One aspect of EMANI is quality control. Thomas concluded the sustainability discussion with the question of how to move forward, especially considering the loss of Mellon as a potential funding framework. CUL continues to do preservation research. We may be able to look to the National Science Foundation for support in the context of the Digital Math Library. 8 EMANI Meeting July 25-26, 2002 EMANI Prototypes Becker: An important component of EMANI will be to make retrodigitized and born-digital materials available through the same interface. Becker discussed the prospect of gathering and merging data from Zentralblatt and the Jahrbuch. He pointed to the EULER project as an example of integrated access to dispersed math materials through a common user interface (http://www.emis.de/projects/EULER/). Cornell Math Projects Poland reported on various CUL digital projects related to mathematics. See the minutes to the May 2002 CUL meeting on digital math projects: <http://www.math.cornell.edu/~library/digitalmath-projects.html> Poland highlighted CUL’s Technical Reports Library (http://techreports.library.cornell.edu), a collection of computer science research reports and papers that descends from the NCSTRL (Networked Computer Science Technical Reference Library) project. CUL views the Tech Reports Library as a prototype collection for an institutional repository model of digital archiving and access provision. CUL hopes to add capacity to handle the ingest of more complex documents than are stored in the Tech Reports Library. Poland and Hickerson described the economic model developed for Project Euclid. [Euclid economic model]. De Kemp suggested that if CUL finds it cannot sustain financial support for Project Euclid, Springer might be interested. Hickerson suggested CUL might be interested in discussing a marketing agreement for Euclid with Springer. De Kemp asked whether CUL supports the Open Archive Initiative (OAI). Hickerson responded that Cornell has been involved in its development. But he noted his impression that OAI is more popular in Europe than in the US. In the US, interest is primarily in its archiving uses; in Europe there seems to be more interest in OAI’s implications for selfpublishing. It would be interesting to fold the OAI discussion into the EMANI project. Relationship of EMANI to DML Several participants in the EMANI meeting went on to attend the first planning meeting of the Digital Mathematics Library project, July 29-30 at the NSF in Washington, DC. Dennis: EMANI is based on the premise that Springer-Verlag will provide access to archived materials, but will not be charged with paying for the work of archiving. The DML represents a search for funding for third parties to do this work. 9 EMANI Meeting July 25-26, 2002 De Kemp wished to express Springer’s interest in EMANI in a different way: Springer-Verlag is interested in long-term preservation and access to mathematics. Thomas: Phillipe Tondeur sees the DML as his legacy at NSF. And NSF seems to have money to fund the project. We hope this wave of interest in math digitization will benefit the EMANI project. Wegner: A high-profile project, such as the DML promises to be, will have a positive effect on local efforts. De Kemp asked about the extent to which EMANI needed to be formalized in advance of the DML meeting. Rosenkrantz: Formalizing EMANI agreements will help to “sell” the DML to funding agencies. Bérard noted that in the French case, in order for public funds (CNRS) to be put into the project, a clear statement from Springer is needed that the worldwide mathematics community will benefit from the work. De Kemp asked whether a paper was needed that would define EMANI for the DML project. The group agreed that this was not needed at this stage, and might even be seen as offering competition to the DML. Friday, July 26 Griepke presented on Springer’s electronic publishing program, outlining what she called a “pure Springer” approach (rather than wider Bertelsmann policies and processes). [View presentation]. She discussed enhancements to content, functionality, and access for readers and advantages of branding and access for authors. She emphasized Springer’s attention to streamlining the digital production process and workflows, and the impact of Springer’s commitment to archiving on production processes. Griepke presented a list of 22 journal titles in mathematics – Springer titles excluding Birkhäuser, identified as “mathematics” from the point of view of SUB Göttingen (not identical with Springer’s definition, which would include statistics, etc.). The task is to digitize the full back set of these 22 titles. Griepke estimates that this comes to approximately 925,000 pages (the estimate comes from a page count of each of these journals from the past two years, extrapolated back to issue one, and subtracting the pages that are already in digital form). Around 60% of the pages to be digitized are in four journals. Griepke and De Kemp cautioned that EMANI must avoid the tendency to produce an overly complex workflow, especially if a distributed system is adopted. More discussion of the evolution of digital workflow within Springer in the past five years. Participants acknowledged the need to provide controls so that access is not interrupted if a journal changes publishers. 10 EMANI Meeting July 25-26, 2002 The question of preserving “context” was taken up again. Is a link needed between each article and an editorial board for the issue? Griepke: Preserving advertisements is probably not important. Thomas: In many fields, researchers want to see advertisements. The group discussed linking to and from errata, which Springer currently provides. De Kemp: Previously, Springer had not dealt with metadata for letters to the editor, conference announcements, and similar journal contents. ISI demanded that Springer go back and include these. The Springer team asked how libraries want to handle the metadata for book reviews and similar. Is there a need to index these? Springer needs to know now, to determine how to ship, etc. Wegner: How is the reader guided to book reviews and similar contents in the electronic environment? Normally with digital material, the reader goes straight to the article. De Kemp: A well-structured search engine will identify a book review. Wegner: But in a print environment, the reader typically does not search for this kind of item. Thomas: The library’s interest in preserving access to supplementary material such as book reviews has to do with the contextual relationship of this material to the main journal content. If in the electronic environment reviews, etc., are moved to a separate database, then there is not so much of a concern. But as long as this material is part of the context of the content, as it has been in the print environment, it needs to be accessible. Wegner: We need a solution that does not artificially integrate these types of content in imitation of print. Griepke: A journal is not only its content, but also a community organizer. Are we archiving only journal content, or also this community organizing function? The group discussed the implications of the ability, in the digital environment, to change an already published work though the addition of links, errata, etc. Springer’s firm rule in this regard is that no article is changed after publication. (In this connection, Wegner referred to the IMU’s Committee on Electronic Information Communication “Best Current Practices” document: <http://www.ceic.math.ca/ceic_docs/best_practices/Best-Practices.pdf>.) Springer provides linkenabled references that point to a separate page of links that can be updated. An article’s original reference page remains unchanged in the PDF while the separate HTML page of references evolves. Wegner pointed to the potential problem of changing the reception of an article through additional references and other materials; the Springer team responded that the segregation of the additional material from the article addresses this. Dennis proposed this model: The archive contains article content only with a link to additional services. The archived “static content” is free, but the publisher can charge for the added services. This is what the publisher gets in return for giving away the content. De Kemp: Well defined. 11 EMANI Meeting July 25-26, 2002 De Kemp posed a question to libraries and mathematicians: Is there a need to identify illustrations, formulae, etc., separate from the text of an article (searchable mathematics)? Wegner: Project MOWGLI is working on searchable formulae. Dennis: This is one purpose behind MathML – not all mathematicians are interested in this issue, but some are. De Kemp: If this is of interest, it is something that Springer wants to explore. Thomas and Ruddy noted that Project Euclid focus groups did not show much interest in these kinds of features. Perhaps it is too expensive. Rosenkrantz: Structure searching in chemistry is very important – the need for such features is community defined. Poland: We do not know whether the math community is interested, since we have not offered these features. Wegner: When these features become possible, Springer will no doubt offer them. Then they will need to be archived. Ruddy asked if mathematicians formulate equations in a uniform way. The clear answer from the group was: no. Ruddy: Perhaps MathML will introduce standards. Springer is involved in the European Commission’s OpenMath project (http://www.nag.co.uk/projects/OpenMath.html) that is working on searchable equations. De Kemp summed up the discussion, noting that features such as searchable formulae are of interest to the group as a possible future direction, but not a realistic prospect at present. On monographs, Griepke noted that Springer’s information model for books is similar to that for journals with a uniform workflow. She noted that digitized monographs require guides for readers to render context and promote browsability, since full-text search engines throw readers into the middle of a complex environment. Formalizing EMANI Syed Hasan (Springer NY) and the Springer team led a discussion of various aspects of EMANI’s formalization: mission, organization, branding, outreach, etc. Hasan’s PowerPoint on “Formalizing Project EMANI” was updated during the course of the discussion to reflect decisions reached on these points. The group agreed upon the following statement of EMANI’s mission: “An international collaboration to support and coordinate the long-term preservation and open accessibility of mathematical publications in digital form.” The term “publications” was chosen after some discussion over alternatives such as mathematical “literature,” “resources,” “material,” etc. The entire group firmly endorsed the goal of open accessibility. 12 EMANI Meeting July 25-26, 2002 The name of the project was fixed as Electronic Mathematical Archiving Network Initiative. The use of the definite article with the acronym (i.e., “The EMANI”) was rejected. It was noted that “selecting relevant content,” which appears under the heading “Aims and Scope” in the presentation, does not mean “quality control” (but rather determining the scope of the collection. With regard to group structure (again, see presentation), the Core Team consists of the representatives of the participating institutions (Cornell University Library, SUB Göttingen, Tsinghua University Library, Orsay Mathematical Library/CMD, Springer-Verlag and its associated publishing houses [Birkhaeuser, Vieweg, Teubner], ELibM in EMIS, and the project Coordinator. Project Coordinator is Bernd Wegner. The precise role of the Advisory Board is still being defined, but it was agreed that it is not “over” the Core Team. It is similar to an editorial board, a scientific board. It is not a management team. The group was shown several proposals for an EMANI logo that had been prepared by the Springer team. Meeting participants favored C, below, but there was discussion about whether the yellow background might too closely associate the project with Springer. The Springer group will bring that concern back to the graphic artists. Springer will have PDFs of the logo made up for use by the EMANI team. A. B. C. The group wanted to pair the logo with a slogan and tentatively settled on “Serving the mathematics community.” MathPortal Rüdiger Gebauer (Springer NY) joined the meeting by speakerphone. Springer considered a mathematics portal a few years ago. Various ideas were generated at that time. Now, reconsidering portal idea in connection with EMANI. The portal would aim to be the leading community organizer for mathematicians. Would include an events calendar, conference information, pre-print links, links to various projects, etc. Thomas said that CUL had also long harbored the goal of creating a math portal – this was under consideration when Project Euclid was launched. At that time, it was decided that the environment was too competitive – the AMS and others already covered many of the functions envisioned for the portal. But perhaps the environment / alliances have shifted. 13 EMANI Meeting July 25-26, 2002 The group thought that there had not been a significant shift. Wegner: First, consider how the existing pieces could be assembled, then consider whether more is needed. Consider forging coalitions. Starting from scratch would be too expensive. As an alternative to the portal model put forward by Springer, Dennis proposed that what mathematicians want is a Google-like search engine that is math specific. Ruddy asked whether AMS would allow MathSciNet records to be harvested for this purpose. Dennis: No. What would they have to sell if they gave the records away? Ruddy: This would mean giving only the metadata away, not the reviews. Dennis: But many people go to MR/Zbl just for citations. Ruddy: We need to encourage the parties to make their records available via OAI. Griepke discussed the World Health Organization’s Health InterNetwork Access to Research Initiative (HINARI), which provides developing countries access to leading medical journals through a medical and life sciences portal. HINARI offers a useful model for our math portal discussion because its scope is well defined and it serves a limited community. Wegner urged the group to consider carefully whether there is a real need for the information services (such as conference information) that a portal would provide. Or are mathematicians satisfied with existing sources? Thomas likewise cautioned against creating a redundant service – we need to channel our efforts and not duplicate existing services. Jerry Curtis (Springer NY) insisted that we would need base our discussion of a portal on a survey of the math community. Impossible to reach a conclusion in this meeting. “One-stop shopping” is talked about a lot, but up to now has not appeared. Thomas raised the question of a business model for a math portal. De Kemp responded that content issues would need to be discussed first. Gebauer noted that the costs involved in building and maintaining a math portal would not be nominal. It would need significant editorial work – one head editor, plus regional contacts. Hickerson agreed: leadership, capital, constant management. What warrants the investment? De Kemp: Would CUL subscribe? Rockey: Only if the faculty demanded it. Only if the portal offered things that could not be found elsewhere. Thomas: Only if the portal were a real productivity tool for mathematicians, significantly better than free sources. Poland: The library perception is that a sharp librarian can find the information without a portal. Hickerson: We need to know what we can do before making any announcements. We need a stronger commitment of resources from the meeting participants before pursuing the portal idea. Hasan: If the portal were a failure, this could damage EMANI. 14 EMANI Meeting July 25-26, 2002 De Kemp: This preliminary discussion has planted a seed. Gebauer: Perhaps a small working group should develop the idea for the next EMANI meeting in the fall. Thomas: The National Science Digital Library (NSDL) (http://comm.nsdlib.org/) is an emerging prototype. Instead of a working group, we could look at carefully at the NSDL. Hans Becker will report at the November meeting on the feasibility of using NSDL harvesting techniques. Curtis: The group should channel its energies to EMANI, not risk distraction. After EMANI is moving, we can ask the math community what might be added. Gebauer: Yes. Adding the portal idea to the agenda was not meant to distract from the core EMANI project. EMANI structure Advisory board Wegner proposed that there be two boards: An “editorial committee” (or similar) made up of respected mathematicians would represent EMANI to the math community. It would function like an editorial board. A “scientific and technical advisory board” with IT expertise and strong ties to the library community. This would not be a steering committee; rather, would give advise on technical matters Poland: asked for clarification on the difference between the two boards. Gebauer: One represents end users, the other represents librarians. Hickerson noted the boards’ outreach function. Gebauer: The boards should be small and extremely professional. Members would not be figureheads. Hickerson: We need to have the advisory groups in place early on, but we also need to have support available for them. We must be able to tell them where the money is coming from to support meetings, etc. Rosenkrantz: And we need to determine their terms of appointment, the time commitment involved, etc. De Kemp suggested one meeting of the boards per year. Wegner: Need to determine procedures for adding and replacing members. Gebauer: The boards will not be supervisory committees – they will be like a Beirat, an advisory council. 15 EMANI Meeting July 25-26, 2002 De Kemp: EMANI is an initiative, a collaboration, not a society or a company. We can invite board members on that basis. The term may not be important. Thomas: The term is important, because there must be a mechanism for cycling people out. Joachim Heinze (Springer Heidelberg) suggested a 3-year renewable term. 2 years deemed too short for a real commitment to the project. De Kemp: Board members would receive no remuneration, but would be compensated for costs. De Kemp: We need to formulate a list of names in the coming weeks and finalize the board membership in November. The group will send suggestions for board members to Bernd Wegner. The listserv should not be used for this purpose. Springer Update Gebauer reported on the current status of Springer-Verlag and assured the group that even with a potential new majority owner, Springer is prepared to stand 100% behind EMANI. Expanding to Other Disciplines Gebauer discussed the prospects of expanding the EMANI model to other fields. He suggested that a successful project cannot be limited to pure mathematics. The applied areas of math are much larger and are thriving. He proposed that EMANI first be formalized, and that the group then begin talking to content providers in other areas – physics and engineering initially. Dennis asked what this plan implies with regard to the involvement of other publishers. Gebauer answered that while EMANI is initiated by Springer, it is not a Springer project. Once the project is formalized and stable, it will be open to other publishers. Gebauer proposed that EMANI branch out sometime in 2003 – he suggested AMS and Cambridge UP as possible future partners. Gebauer envisions EMANI in its final stage as a platform where all publishers in mathematics and related fields find their material archived. Information Dissemination Wegner outlined EMANI’s specific needs with regard to publicity and gave an overview of his own efforts (articles, conference papers, etc.) to publicize the project. [View presentation]. Bernd Wegner and Syed Hasan will draft new EMANI press release. Griepke: Link has a list of around 600 addresses of people who have requested information about EMANI. Springer will set up an EMANI website will be set up on the Springer server in Germany. Heike Neuroth will maintain the website. Site will contain (among other things…): 16 EMANI Meeting July 25-26, 2002 Links to partner institutions Restricted partners’ page (containing work package material and timeline) Each partner is to name a contact person for website. first draft of website to be presented at November meeting. Outreach Thomas noted that the first EMANI outreach following the Ithaca meeting would take place in Washington at the DML planning meeting. Asked for other suggestions. Griepke suggested arranging for a student to work on EMANI as a thesis project. Bérard noted that NUMDAM had library students studying that program. Thomas: Library schools should be alerted about this emerging project. Also: Coalition for Networked Information (CNI) Research libraries ALA SPARC SLA Etc. Need to assemble a list of organizations for outreach. Give categories and partners fill in appropriate national body. Thomas questioned whether dissemination and outreach are really separate tasks and suggested collapsing them into one work package. General agreement. Wegner and Thomas will co-chair combined Dissemination and Outreach work package. For all work packages: Leaders must formulate definition of work package with responsibilities and timetable. Descriptions are due by October 1 – send to Hans Becker who will organize them. Descriptions will appear on restricted part of website. Will need periodic updates. Wegner proposed that annual reports be produced. Also, any member of the team planning to present on EMANI at a conference, etc., should ask the list for suggestions. Hickerson asked whether “access” (which includes large-scale search strategies) should be separated from the “design and navigation” (which have to do with environment) in the work packages. Group decided to leave the single work package for now, but maybe split later when more work has been finished. Neuroth and Becker raised the issue of dependencies among the work packages – the timeline shows linear relationships (and hence “time dependencies”), but what about other logical and thematic dependencies? Hickerson responded that when the work packages are explicated, some will come apart. How do the work packages map to the OAIS model? 17 EMANI Meeting July 25-26, 2002 Arnoud de Kemp and Elmar Mittler added to Work Package 5 (Copyright). Progress report: NSF/DFG proposal Ruddy summarized: The proposal addresses archiving of journal material from Springer, Euclid, and EMIS in a jointly managed, distributed, interoperable digital archive. Includes metadata and protocol development. Funding could be available as soon as November 2002. This would be a 3year grant for approximately $1 million. The participants see the grant project as providing tools and a model for EMANI. The proposal refers to EMANI, but this is not an EMANI project. Whether the project is funded or not, the approach worked out for the proposal can provide an underpinning for EMANI. The participants hope this provides a way to build key pieces of EMANI with public support. The project would also build a working OAIS model. Summary / Next Steps The meeting closed with a summary of project tasks (given at top of this document) and optimistic anticipation of further collaboration among the partners. 18