LIS618 lecture 5 Thomas Krichel 2003-02-26 structure • operations on any file (database) – connectors – booleans – set handling – display • file-specifics – bluesheet – Introduction to structured queries Use of connectors • Connectors are used to put several words together. • One instance where this is useful is when you have words that on their own mean different things. • For example "mate" is a herbal beverage consumed in South America. Looking for mate on the Internet retrieves a lot of singles' pages. example: terms related to "mate" What other terms to be used? – matear – matero – cebar – cebador – yerba – bombilla (drink mate) (mate drinker) (prepare mate) (mate preparer) (mate herb) (mate straw) connectors I • '(W)' requires terms to appear one after the other next to each other e.g. 'yerba(W)mate?' matches "yerba mate". • '(i W)' where i is an integer, means followed by at most i words, e.g. 'ceba?(3W)mate?' matches "cebar un maravilloso mate" but not "cebador guapo mirando un buen mate" connectors II • '(N)' requires terms to be next to each other e.g. 'yerba(N)mate?' matches "yerba mate" or "mate yerba". • '(i N)' where i is an integer, means proximity by at most i words, e.g. 'ceba?(3N)mate?' matches "cebar mate" or "matear con la cebadora". • '(S)' searches for the occurrence of connected terms in the same paragraph. using Boolean operators • In your query, you can combine several expressions with Boolean operators • Example: "S LIBRARY(W)SCHOOL? AND DISTANCE(W)EDUCATION" • But I usually do not issue such fancy queries. executing several searches • there can be several searches done sequentially, and the results sets are saved by the system. • Each time the system assigns a set number, Si, • These can be combined in Boolean expressions, e.g. 's S1 or S2 and S3' • Remember that Boolean operations are set-theoretic! Boolean operators on sets • when using Booleans, be aware that "and" has higher precedence than "or". • Thus: a or b and c is not the same as (a or b) and c but it is a or (b and c) • use parenthesis when in doubt DS (display sets) • This command can be executed any time to review the sets that have been formed since the last B (begin) command. • This can be useful to review your search history. the target command • "target set" where set is a search result set creates a subset of the "statistically most relevant results" in the original set. • I have not seen details about how this subset is computed. • new result set is being formed. display: the type command type set/format/range • set is a result set • format is a format • range can be – start – end • start is a record number to start • end is a record number to end – all standard delivery formats • • • • • 2 -- full record except abstract 3 or medium – citation 5 or long – full except full text 6 or free – title and dialog number 8 or short – title plus indexing terms – useful to find other indexing terms • 9 or full – everything • KWIC or K – keywords in context options for delivery • I once tried to email results to me, to no avail • You can save the html of the search results in the browser. • You can print the results within the browser. Looking at database structure • Up until now, we have looked at commands that take a full-text view of the database. • Such commands can be executed for every database. • If we want to make more precise queries, we have to take account of database structure. blue sheet • each database name is linked to a blueish pop-up window called the blue sheet for the database • This is called the bluesheet. • It contains the details of the database. closer look at the bluesheet • file description • subject coverage (free vocabulary) • format options, lists all formats – by number (internal) – by dialog web format (external, i.e. crossdatabase) • search options – basic index, i.e. subject contents – additional index, i.e. non-subject basic vs additional index • the basic index – has information that is relevant to the substantive contents of the data – usually is indexed by word, i.e. connectors are required • the additional index – has data that is not relevant to the substantive matter – usually indexed by phrase, i.e. connectors are not required search options: basic index • select without qualifiers searches in all fields in the basic index • bluesheet lists field indicators available for a database • also note if field is indexed by word or phrase. proximity searching only works with word indices. when phrases are indexed you don't need proximity indicators http://openlib.org/home/krichel Thank you for your attention!