Digital Library of India A Study of Independent Access Dr. Swapan Deoghuria Scientist-II Indian Association for the Cultivation of Science Kolkata – 700032, INDIA email: ccsd@iacs.res.in ABSTRACT Digital Library of India (DLI) portal [5] was studied in this paper for its effectiveness and how far it is user friendly for independent access [3]. Keywords Digital library, Independent access, User interface, Web design. 1. INTRODUCTION The main objective of a library is to preserve information to facilitate future access and dissemination of knowledge. Digital Libraries (DLs) are not exception. The extra advantage of DLs are that users can access and find information independently from any corner of the world irrespective of the caste, creed, color and economic status of the user. But DLs are of little value unless they are easy to use effectively [2]. Digital library users often have little formal training and they normally use independently. Of late the Digital Libraries initiatives in India have got momentum. DLI is the first major government funded Digital Library project in India in collaboration with other countries. The goal of this study is to determine methods to improve the interface design of DLI to enhance user’s ability to independently access the digital resources remotely. 2. BACKGROUND DLI Portal is a project piloted by Ministry of Communications and Information Technology, Government of India. The Portal was launched in September 2003 and being maintained by ERNET, India. This project is a part of the Universal Digital Library Project [15] of Carnegie Mellon University (CMU), Pittsburgh, which seeks to provide open access to one million books [4, 10, 12, 13] by 2005. At present more than 50,000 books are freely available online on this portal mainly with tiff image format. The primary long-term objective of DLI is to capture all books in digital 1 format for better preservation and to make freely available to everybody through Internet for education, study and appreciation. This portal may provide a gateway to Indian Digital Libraries in science, arts, culture, music, movies, traditional medicine, palm leaves and many more. DLI project may also help other researchers who are working on improved scanning techniques, improved optical character recognition, and improved indexing. Books published before 1920 are non-copyrighted materials and being scanned for DLI project. DLI is also seeking permission from different publishers for books to be scanned which are considered best for academic purposes. There are 23 Indian and 8 Chinese organizations and CMU, USA are involved in this project. The Office of the Principal Scientific Advisor to the Government of India is funding the project in the Indian Institute of Science. The Ministry of Communication and Information Technology (MCIT) is funding the project at various partner centres of the Digital Library of India. Various centres have also pledged their local resources to make the DLI a reality. National Science Foundation (NSF), USA is providing funding for Scanners and Software research and development. Indian Institute of Science (IISc), Bangalore is coordinating the DLI project and also playing as research centre in India. Apart from IISc, other 22 partner centres were selected amongst academic institutions, religious and cultural institutions and government and research agencies. 3. METHODOLOGY The author tested the portal for its usability and user interface. Intel Pentium PCs with MS Windows (95, 98, 2000, XP) and 14”, 17” and 19” colour monitors and SGI Octane with 21” colour monitor were used. Netscape (4.7, 7.1) and IE (5.0, 6.0) browsers are used with 512 Kbps LAN connectivity. 4. USAGE AND USABILITY Digital resources are being mounted online at an enormous rate [9]. But effective use of these resources is being hampered, as there is no uniform and consistent way to find the useful materials existing over the Net [6]. Study of usage and usability of DLs are gaining importance day by day [1-3]. It is found [14] that both faculty and students use and like electronic resources and most easily adopt them if the sources are perceived as convenient, relevant, and time saving to their natural workflow. It is 2 also important for capacity planning, systems design, user support and strategic planning [7]. Digital information is useless unless the formats, protocols, and metadata are recognized and processed by the users. 4.1 How to Find Library people as well as researchers mainly rely on commercial search engines to find digital resources instead of their severe limitations [6]. “Google” and “Yahoo” search for the phrase “digital library of India” fetched 6,28,000 and 7,57,000 listings respectively. The exact phrase “digital library of India” fetched 234 and 293 listings through “Google” and “Yahoo” respectively by their advanced search. The first item was “Digital Library of India” with a link to the URL http://www.dli.ernet.in/vision.html for search through “Yahoo”. “Google” search gave two URLs namely www.dli.ernet.in/ and www.dli.gov.in/. The second URL does not show anything. Earlier the author found content in www.dli.gov.in/ site of DLI but nothing is available now. 4.2 Consistency Consistency in appearance, in control and in function is important for users. Users try to understand what the computer system is doing when they see certain results on the screen. They should be able to interrupt or reverse actions. Moreover error handling should be simple and easy to comprehend. DLI collections are not accessible in lower version of browsers (Netscape 4.7 and IE 5.0). There is no error message and no information about the software and hardware requirements to access the books. There is no link to contact the Webmaster for any queries or any suggestion. At the bottom of the left side of the page there is the counter that is visible only in 19” or larger monitor. Search facility is available to find digital contents through some key words. But if search result exceeds 100 listings, only the first 100 are visible. The “Next” link to show the next search results is not working. There is one “Advanced Search” facility that is not working with any browser. “Advanced Search” and “Fifty Thousand Books so Far” are blinking in Netscape Browser but static in IE. There are fourteen linked items like “Vision”, “Mission” etc. below the search window that provide useful information about the portal. Some of the items at the extreme right are not visible in smaller monitors. The item at the extreme right that is “Members Area” 3 opens a Login screen for entering user name and password. Some basic information should be provided for general users. Interestingly, these items are not appearing even in 21” monitor for lower version of browsers. Below these items there are two frames. Under the heading “Collections” there is a link “Related Projects” that is not consistent with other linked items and not related with any type of collections. The link “Related Projects” show two linked items namely “Example Based Machine Translation” and “Indian Language Transliteration Tool”. The first one opens a page with the heading “Machine Translation (Experimental Setup)” and a login prompt appears that asks for username. The author failed to login as guest. The system does not show any error message or any clue. There should be some basic information about this page regarding its purpose. The second link opens a new page about “Indian Language Transliteration” developed by IISc and CMU with different links for downloading the software and other utilities. 4.3 User Interface A digital library is only as good as its interface. Good support for users through elegant design, appropriate functionality and responsive systems make a difference in its effectiveness of digital libraries. When a system is hard to use, the users may not be attracted towards its contents how valuable they might be. Any digital library must assume that users will have a variety of computers, with a variety of OS and environments. In India majority of users still use 14” monitors. But the information of DLI on 14” monitor is not easy to read. The items on the right side and at the bottom of the page are neither appearing nor even scrollable. Even users have to scroll to read the information appearing in the main frame. All information of DLI is clearly visible in 19” or larger monitor. Frames have been used for the home page of DLI. The top frame contains logo in the left, title in the middle and scanned image of books in the right that is not visible in 14” and 17” monitors. Under the title there are two lines of text regarding host of this portal. This lengthy information can be avoided in the home page and should be linked to other page of the portal without the abbreviations (PSA and MCIT). Fortunately these abbreviations are not used in the linked page of “New User Interface” available at the bottom of the home page under the heading of “New Links”. Below this top frame lie the most important items of the portal that is the tools of finding the digital contents of the DLI. One can find the content either from 4 alphabetically sorted list of Authors or Titles of the contents. But the alphabetically sorted list of Authors are not according to the last name of the authors as claimed by the portal because “Rabindranath Tagore” is available under the alphabet “R” and not under the alphabet “T”. Also ‘Sir Rabindranath Tagore” is found under the alphabet “S”. The author also found “Ravindranath Tagore”, “Raveendranadha Tagur” in the list. There are five linked items namely Books, Journals, Palm Leaves, Books From CMU, Related Projects under the heading Collections at the top of the left frame. The link “Books” opens a separate page with some linked items about different category of books like Engineering, General etc. This list is also available in the right frame. These books are not available through key word search, browse by author or browse by title available in the home page. Also the window opened by this collection of books is not same as the window opened by browse by author or title or through keyword search. There is no zoom facility with this page and also format and other tools are kept at the bottom of the page that are not easily visible with small monitors. There are two categories under this list namely “Indian Institute of Science Collections” and “Books from other centres” make no sense to general users as other categories are mainly related with subjects. “Indian Institute of Science Collections” is organisied into some categories. But the categories are not systematic. There is a list of books under the category “Other Languages” without mentioning the specific languages of those books. Unfortunately words of the titles are connected with underscore character and very difficult to read. The collection of eight scanning centres is available under the link “Books from other centres”. Journals available are mainly published by Indian Academy of Science, Bangalore and also available in the right frame. The last item under “Journals” that is “Scientific Books” opens a page with the following text in the title area <font face=“Arial” size=”2”> along with the heading “Universal Digital Library, Hosted by Indian Institute of Science” and a list of books published by Indian Academy of Science. Scanned images of palm leaves under the link “Palm Leaves” are available from Anna University. Proper summary information about these palm leaves along with the images would be more useful to researchers. “Books From CMU” are alphabetically sorted with a scope to go to the top after each alphabet. There are six time periods (1850 – 1950) under the heading “Rare Collections” with hyperlinks. 5 The collections are organised according to the scanning centres. The heading “Scanning Centres” will pop-up 16 participating centres from where books are being scanned and the collections of a particular centre are supposed to appear once selected. Some of the centres like IIIT-Allahabad, Kanchi University, Maharashtra Industrial Development Corporation (MIDC), Punjab Technical University, Rashtrapathi Bhavan, Sringeri Mutt have scanned books but those are not available on the portal till now. IIIT-Hyderabad Centre has scanned 12409 numbers of books so far and those data are supposed to be made available on the server by June 2004 as shown in the page. Till November 11, 2004 those are not available and DLI did not update the information. IISc collections are same as available under “Books” under the heading “Collections”. Arulmigu Kalasligam College of Engineering (AKCE) collections are alphabetically sorted but not organised according to subject or language. Collection of Anna University is Palm Leaves same as available under “Collection”. City Central Library, Hyderabad collections are organised according to different languages namely Telugu, English and Urdu. Unfortunately the words of the titles are clubbed together and very difficult to understand. For example, the first entry under English language is “Amanualoftheprinciplesofequity”. It should be “A manual of the principles of equity”. Similarly, Tirumala Tirupati Devasthanams, Tirupati collections are ordered according to language namely English, Kannada, Sanskrit, Tamil and Telugu. Alphabetically sorted list of books appear after clicking on a language link. Words of the titles are clubbed together here also. For example, the first entry under English is “Adialogueonprinciplesandpracticeofhinduism”, that should be “A dialogue on principles and practice of hinduism”. This is a serious problem. Indian Institute of Astrophysics, Bangalore collections are alphabetically sorted but the last two entries are not ordered. Books are not categorised according to subject or language. University of Pune, Maharashtra collections are organised according to different subjects and under each subject the list is alphabetically sorted. Shanmugha Art, Science, Technology & Research Academy (SASTRA), Tanjavur, Tamil Nadu collections are divided into books from SASTRA and books from CMU by SASTRA. Again each link is alphabetically arranged. State Central Library, Hyderabad does not show their collections but a different page with heading “Books From Other Centres” 6 with a list of Scanning Centres including State Central Library (SCL), Hyderabad. This link is also available under the link “Books” under the heading ‘Collections” discussed earlier. The collections from this centre are not arranged in any way and there is a number before each title of the book that makes more difficult to understand. For example, the first two entries are “0163s Plays Pleasant And Un Pleasant” and “0212s On Compromise”. SV Digital Library is the only centre that is not available under the heading scanning centres. Under the heading “Languages” there are six languages namely English, Kannada, Sanskrit, Tamil, Telugu and Urdu. Except Urdu all other languages show the collections of Tirumala Tirupati Devasthanams, Tirupati. The Urdu collections are from City Central Library, Hyderabad. Unfortunately there is no space between the words of the titles of the books and difficult to read. Under the heading “New Links” there are three items namely “UDL-Talks”, “New User Interface” and “DLI-Status-Report”. The presentations on May 2004 meeting are listed in “UDL-Talks” link. “DLI-StatusReport” gives a summary of information on status of all participating centres of DLI. “New User Interface” link opens a new home page of DLI with improved search facility for books with title, author, subject, year, language and scanning centre. Search for author “Rabindranath Tagore”, “Rabindranath” and “Tagore” matches 0, 21 and 32 results respectively. It is not clear why “Rabindranath Tagore” did not match any entry. Also there are eight entries for the title “Crescent Moon” by “Rabindranath Tagore” with different page numbers. Out of eight only two show the actual book. One entry with 304 pages is not available on the server. Others entries are Thomas Carlyle with 252 pages, Lucretia by Lord Lytton with 429 pages, On Compromise by John Morley with 289 pages, The Art of Literature with Arthur Schopenhauer with 151 pages, Behind the Bungalow by Eha with 207 pages. The search result is more meaningful here as essential metadata is available for each entry. But this type of wrong entry should be avoided. Under “DLI Collections” there are four headings namely “Book Collections”, “Journals”, “News Papers” and “Manuscripts”. There is no link for the items under “Journals” and “Manuscripts”. Under “Book Collections” three scanning centres are hyperlinked. No hyperlink is available for the word “more...” but users generally expect that. Only the metadata for the collection of “Rastrapati Bhavan” is available at present. Alphabetically sorted 7 list of titles of books are available under the heading “Title Beginning With:” “Authors Last Name” for listing of authors with alphabetically sorted form is not working. The second page is always same for any alphabets. One can browse the books year wise under the heading “Year”. Under the heading “Subject” there are nine subjects on the page and a link “More...” with many more subjects. There are nine languages namely English, Bengali, Hindi, Kannada, Marathi, Sanskrit, Tamil, Telugu and Urdu. For Bengali language all entries are appearing and it seems that there is no Bengali collection on DLI. In the search window there are more languages available apart from those available on the page. There is a scope to send feedback by the users that is not available on the original home page. “New User Interface” link gives organised data and easy to use independently. Meta data of the resources should be updated to minimise duplication and wrong entry. 4.4 Navigation The author first used the keyword “Rabindranath Tagore”, the only Nobel Laureate in India in literature. There are 25 records in the list with the headings Serial No. (#), Score, Matches, Creator(s) and Title. Language should be included as one of the heading because many titles of DLI can be in regional languages. The numbers within brackets in Matches indicate mow many times the key word matches with the document text and with the metadata. One can browse any title directly from this list. It opens a separate page with scanned image of the first page of the book with tools like Next Page, Previous Page, Zoom-in, Zoom-out, Format (TXT, PTIFF, HTM) and Go to Page (within the range of the pages of the book). The design of the page is good and easy to use. “‘GORA-RAVINDRA NATH TAGORE” title by Sri Venkata Parvatishwara Kavi was found twice. Probably these are duplicate records. Moreover the scanned images show that the book is written in another Indian language and may be a translation from the original. This poses a serious problem to users because there is no way to know about the subject or language of the books from the list itself. Books available in regional languages are of no use to users who do not know that language unless transliteration of the same book is available in other regional languages as well as in English. It may be convenient to users if each title is represented by summary information that conveys its essence. In a conventional 8 library contents are organized by classification or by ordering. Library professionals help users to search and access materials on the basis of catalogs and index cards. Only PTIFF format is working for regional language resources. The other two formats (TXT and HTML) should not appear where resources on these formats are not available. Most digital library users still print out articles that are judged useful as no one wants to read them online and a printing format such as PDF is popular. But in DLI there is no scope to print out a particular resource in a single mouse click. Probably DLI does not want systematic downloading of its resources. Categorization of resources under different languages and different subjects is a must for effective use. 4.5 Contents DLI Status Report shows all participating centres have scanned total 87,369 books so far. Out of those IISc received 55,570 books and 48,779 books are available on the Web. The remaining 6,791 books are under processing. Home page of DLI shows contents available in six languages including English. But “New User Interface” link shows contents are available in twelve languages. There are some collections under the category “Other Language”. May be these are in some foreign languages. Contents are not selected from all regional Indian languages to justify the name of the portal. Participating organizations are not selected from all states or regions of India. Of course there is a guideline for interested organizations/ persons to participate in DLI project. Till date the National Library of India is not included in this project that is advocated in Universal Library Project. 5. LIMITATIONS a. DLI portal is browser dependent. Users are not warned that the contents cannot be accessed through all browsers. b. Too many items in the home page to choose. Some information are available more than once. c. The databases of digital contents are not easy to use independently. d. All items in a page are not appearing for small monitors. e. The information appearing on screen is not easy to read in small monitors. f. All digital resources are not categorized according to subjects and languages. 9 g. There is no scope to print the digital resources easily. h. Some duplicate and wrong data exist. i. Representation of all Indian languages and states are absent. j. Short notes on digital resources are not available. k. There are items in the page where hyperlinks are expected but not available. l. There are hyperlinks for certain items but the links are not accessible. 6. FURTHER SCOPE Many STM journals are published from India by nonprofit organizations. Majority of them are not in a position to publish the same online mainly due to lack of technical expertise. Also the very existence of these journals is at risk due to tough competition from large commercial publishers. At present only journals published by Indian Academy of Science can be accessed from this portal. Journals published by other Societies and non-profit making organisations can be included for online access through DLI. Approximately 8,000 to 10,000 Ph. D. theses are being generated annually from Indian universities [11]. But majority of them are not accessible. DLI can play a host to archive these electronic theses and dissertations for easy access and dissemination of knowledge. For both cases the contents are born digital and would not be a problem for DLI. 7. CONCLUSIONS Existing archives of paper have many shortcomings but digital information is also very hard to preserve [8]. Sustainability is an important issue for this type of project as it involves huge fund as well as trained manpower. Being aware of these shortcomings DLI can still play an important role in free access to information globally. The “New User Interface” is more users friendly and data available are more organised. We hope DLI will shift to this user interface in near future. Within a short period of time DLI has done a commendable job and we hope many of the limitations mentioned in this paper will be considered to make it more user friendly for effective use in near future. 10 8. ACKNOWLEDGMENTS The author would like to thank Department of Science and Technology (DST), Government of India for using the computer facilities available at Indian Association for the Cultivation of Science, Kolkata under ICOSER project for this study. 9. REFERENCES [1] ARL. (2002). Collections & Access for the 21st-Century Scholar: Changing Roles of Research Libraries. A Report from the ARL Collections & Access Issues Task Force. ARL Bimonthly Report 225 (December 2002). [2] Arms, Y. W. (2000). Digital Libraries. The MIT Press, Cambridge, Massachusetts, London, England, pp 143. [3] Carole A. G. (2003). A Study of Independent Access to Library Resources: Follow-up Study of LibQual+ Survey of Library Service Quality – Personal Control Domain. Carnegie Mellon University Libraries, Digital Library Initiatives. May 13, 2003. URL: www.library.cmu.edu/Libraries/LibQualPersonalControl.pdf (Accessed on 11.11.2004) [4] Covey, D. T. (2003). Understanding and Assessing the Million Book Project. Pennsylvania Library Association, October 5, 2003. URL: http://www.library.cmu.edu/Libraries/MBPfinalNo_PICTS.ppt (Accessed on 11.11.2004) [5] Digital Library of India. A Portal at Ministry of Communications and Information Technology, Government of India. URL: www.dli.ernet.in/ (Accessed on 11.11.2004) [6] DLF. A New Approach to Finding Research Materials on the Web. URL: www.diglib.org/architectures/vision.htm (Accessed on 10.11.2004) [7] Greenstein, D. and Covey, D. T. (2000). Usage, Usability and User Support. Report of a discussion group convened at the DLF Forum on 2 April 2000. 11 [8] Lawrence, G. W., Kehoe, W. R., Rieger, O. Y., Walters, W. H., and Kenney, A. R.( 2000). Risk Management of Digital Information: A File Format Investigation. A Council on Library and Information Resources Publication, (June 2000) [9] Lyman, P. and Hal, R. V. (2003). How Much Information? Regents of the University of California, October 27, 2003. URL: www.sims.berkeley.edu/research/projects/how-much-info-2003. (Accessed on 10.11.2004) [10] Million Book Project FAQ. (2004). URL: www.library.cmu.edu/Libraries/MBP_FAQ.html (Accessed on 10.11.2004). [11] Nandi, A. and Satpathi, J. N. (2003). Electronic Theses and Dissertations (ETD) – New Genera of Documents in Digital Environment. Proceedings of the XXIV All India Conference of IASLIC (Survey of India, Dehradun, India, December 15-18, 2003). IASLIC, Kolkata, 2003 35-39. [12] St. Clair, G. (2003). Million Book Project Today. At OCLC, October 21, 2003. URL: www.library.cmu.edu/Libraries/MillionBook_OCLC.ppt (Accessed on November 10, 2004). [13] St. Clair, G. (2003). Million Book Project in U.S. and India. At International Conference on the Future of the Book, Cairns, Australia, April 22, 2003. URL: www.library.cmu.edu/People/gstclair/presentations.html [14] Tenopir, C. (2003). Use and Users of Electronic Library Resources: An overview and Analysis of Recent Research Studies. A Council on Library and Information Resources Publication, August 2003 [15] Universal Library Project. 2004. URL: www.ulib.org/ (Accessed on 10.11.2004). 12