1.2 Contents / List of Hits

advertisement
1. The User Interface of the Program
The user interface can be divided into three parts (frames), which can be reproportioned on
discretion, with the help of the slider between the frames:
1. Search and Index
2. Contents / List of Hits
3. Details of Hits
1.1 Search and Index
Usage of Search and Index is detailed at chapter 1.4.
1.2 Contents / List of Hits
We can change Contents and Hit Contents at the
bottom of the list.
We can find all issues of all journals uploaded in ADT in the default of the Contents, arranged in
a tree structure. A journal can be opened by clicking on the symbol “+” on the left, and we can
browse among values and issues. We can refine our search with the help of the checkbox next to
the name of the journal.
1
If we change over to search hits,
we can see how many times the
searched expression appears in
the given journal in all (in bold
numbers next to the name of the
journal), and how many times it
appears within a given volume.
If we click on the journal it shows in the Details of Hits the hit which is connected to the chosen
volume of the chosen journal.
1.3 Details of Hits
The information that helps navigation among hits can be found in the top of the Details of Hits:



We can set in the left roll-down menu how many hits are to be shown in the window at
the same time (10, 15, 20, 50, 100, 200).
We can see how many hits there are to the search phrase on the right.
We can turn pages of hits in the middle.
We can see separately the hits in the middle of the Detailed Hits field, for reasons of better
legibility, with alternating white and grey background:
Information given with hits:





number of hit (1)
name of journal including the search phrase (2)
preview (3)
name of bookmark (4), which is usually the name of the article, study, column
context (5)
The previous searches can be seen in the bottom of the Details of Hits:
2
1.4 Index and truncation
Before starting search it is important to check the searchable concepts. WARCTIS program offers
the possibility to have a view of the index during typing in the searched word or phrase, and find
out if the concept we look for is included in the database or not, if yes, how many times.
We can search on parts of words or phrases. It is useful when we do not know if the searched
word is the second part of a compound. In this case it is useful to use the signs of truncation:
*: substitutes any characters
!: substitutes exactly 1 character
?: substitutes 0 or 1 character
The signs of truncation can be used anywhere, at the beginning or ending of words, or inside
words. We use the sign * most often, typically in order to find inflected forms, but the sign ? is
necessary in case of uncertain spelling (e.g. archaic forms of words, or spelling mistakes of
words divided during OCR). The sign ! is important when we search for the compounds of a
given word.
1.5 Search: truncation, operators
We can use the above described truncation signs in our search. If we do not trunk, our hits will
be the full concord. In the case of publications of archives we can search every word (WORDS
index), or bookmarks (TITLE, AUTHOR index). Search words can be connected with the
following operators:



A AND B: the hits are pages containing both words;
A OR B: the hits are the pages containing one of the words;
A NOT B: the hits are the pages containing A, but not B.
The above logic, Boole operators are always valid within one page, which means that the hits are
always pages, and the scopes of AND, NOT, and OR are one page as well.
In order to refine our search we need further operators, first of all proximity operators:


A Nw B: hits are pages where the number of words between A and B are N at the
maximum, and A precedes B. If we write 2 words without the operator, the program
understands A 0w B search question.
A Nn B: hits are pages where the number of words between A and B are N at the
maximum, and the order of A and B is not fixed.
We can freely combine operators and truncation signs. The program generates even 100,000
hits in some seconds.

3
A--B: interval search, we search all the words between A and B.
1.6 Display of hits, other functions
If, after the search, we click on a hit in the Details of Hits, the PDF_View window appears. The
search word / words appear highlighted. The user interface consists of the following:
1.
2.
3.
4.
Pages view / Bookmarks view,
PDF_View navigation toolbar,
Adobe PDF toolbar,
The view of the page.
1.6.1 Views
We can choose between pages and bookmarks views in views panel. We can see the thumbnails
of pages and their page numbers (according to the PDF) in the Pages view; and we can browse
the contents and other bookmarks in the Bookmarks view.
1.6.2 PDF_View toolbar
The essential tools to help navigation among and display setting of hits can be found in PDF View
toolbar:
1.
2.
3.
4.
We can step forward and back among hits with these arrows.
Here we can set the number of pages to appear.
Here we can adjust the page to the window.
We can save the document here. We can decide from which page to which page we want
to save (maximum 20 pages). We can save the actually displayed page (slice) with the
Save button of Adobe toolbar, and it can be maximum 2 pages.
5. We can navigate with these buttons among the pages of the document. These navigation
toolbar has to be used instead of Adobe toolbar navigation buttons, because the program
displays only the necessary pages in the panel for technical reasons (1 or 2, depending
on arrangement).
6. This link can be used to display the page without search later on; we can share the page
in e-mail with it, or save as note for ourselves.
7. We can set in the left part which page to display. We can fix the current page with
clicking the button on the right, which means that the program will open a new window
to the next hit.
4
1.6.3 Adobe PDF toolbar
The Adobe PDF toolbar appearing here is the same as the toolbars of Adobe Reader or other
application for viewing PDF, offering basic functions. Important is the search field: what we type
in here will search words and phrases in the displayed page.
1.6.4 Display of page
The hits will be seen highlighted after clicking on the detailed hits field in the page display panel.
Display can be changed in the PDF View toolbar or Adobe PDF toolbar.
2. Frequently asked questions
What is necessary to open ADT?
One of the following browsers is necessary to the all-in use of ADT:





Internet Explorer 8 or newer
Mozilla Firefox 3.0 or newer
Google Chrome
Opera
Safari
An Adobe Reader or Adobe Acrobat version 8 or 9 is necessary, too. Highlighting the searched
text functions well only with these versions, but not with version 10 where this function has
been deleted (Adobe Reader X – version 10). Highlight after search is disabled in Adobe Reader 9
default, but it can be switched on again.
Why cannot ADT be run correctly on Chrome?
The display of ADT depends primarily on the technology the browser uses to view PDF. Google
Chrome displays PDFs with its own engine, Sandbox in the default, and neither highlighting
function nor PDF View toolbar are compatible with it. The solution is, if Adobe Reader 8 or 9 is
installed, that Chrome registeres Adobe Plugin out-of-date and shows this in the top in the
information bar. Choose Run Now function from the options and highlighting and PDF View
toolbar will operate in Chrome as well.
If this option does not appear, it means that Adobe Reader plug-in is disabled and Chrome builtin PDF viewer plug-in is active. To change it, write “about:plugins” in the title field, and switch off
“Chrome PDF Viewer” on the page; switch on “Adobe Acrobat” plug-in.
If we do not want click Run Now at every page, Chrome has to be started with this argumentum
in the command line interface: --allow-outdated-plug-ins
5
3. Data Structure of ADT
At present, ADT involves primarily journals in two-layer PDF format. One PDF file includes one
volume of the given journal. PDFs have title (e.g.: Magyar Nyelv – 1. évfolyam - 1905 [Hungarian
Language – Volume 1 – 1905]), and sometimes ( in the case of thematic issues) author or editor.
These data can be found in the descriptive header of the PDF. Besides, PDFs include bookmarks.
One type of bookmarks is called “Oldalszám” [Page Number], which helps to open the specific
page of a given issue. Page numbering of journals can be very different, but most frequently
numbering is continuous within a volume. It is very typical of old journals that the cover of an
issue is not bound; consequently the issues create one sequence. In other cases (usually in the
case of new materials) the cover of an issue is digitized as well. We can identify these and pages
without numbers with the help of the previous last numbered page (e.g. 34u_1. 34u_2: 1st page
after p.34; 2nd page after 34). There are journals where page numbering is separated in each
issue, in this case the bookmark involves the identifier of the issue.
The other type of bookmarks is “Tartalomjegyzék” [Contents]. This includes the authors and
titles of articles in the given journal. We differentiate two types of table of contents: that of the
volume and that of the issue. Usually we have the contents of volumes in the case of old volumes;
and the contents of issues in the case of newer volumes. The contents of volumes are edited,
often according to types of articles, within it in the alphabetic order of authors’ names; series of
articles are enlisted next to each other, and just the page numbers refer to the given parts. In the
case of contents of volumes just the comprehensive titles (e.g. Értekezés [Paper],
Könyvismertetés [Book Review] can be found.
Accordingly, contents with page numbers are less edited, in this case the number of the issue is
the first level, and author – title are the next level. In some cases (e.g. bulletins) there is no table
of contents (only page numbers, or number of issue, date).
According to the above, the structure of ADT database is:
Record: one page
Fields:


text of one page
bookmarks of a page
Indices: (all verbal)


6
WORDS: every word
Title, author: the words of bookmarks
Download