Milestone 8.15: Report on existing processing services and

advertisement
Milestone 8.15: Report on existing processing services and analytical services:
An analysis of the meta-data of the services available by the major projects, initiatives, fora and
organizations
By Christos Arvanitidis, Sarah Faulwetter, Alexander Kruppa
Introduction
This field of research development is vast and fast evolving over the last couple of decades.
The scientific community has made such a progress that at the moment it is not possible to know even
the exact number of the facilities existing for the process and analysis of information and data relevant
with the disciplines of taxonomy, biogeography and ecology. For example, the search over google with
the key words “processing services”, “analytical services”, “taxonomy” brings 3,100,000 hits while it's
being reduced down to 130,000 when the additional key words “biogeography” and “ecology” are
added.
Therefore, this analysis must be restricted to a set of services available on the internet which
are manageable in terms of numbers, operational features and use. For this reason only those offered
by the mostly used sites (e.g. WoRMS, FishBase, GBIF, OBIS, uBIO, etc.) have been included in this
report.
The methodological approach followed is simple and effective. The overall objective is to
explore how (dis)similar are these services and where this (dis)similarity is coming from in terms of
operational features and functionality. In order to define an objective and reproducible approach, the
following four steps have been followed:
(a) to identify the processing and analytical tools and services offered by the sites of major projects,
initiatives, fora and organizations, such as those mentioned above;
(b) to use a set of classification identifiers, that is technical and functional features, for the adequate
characterization of their operational features and functionality (see below);
(c) to describe the processing and analytical tools available through the coding of their classification
identifiers (1,0, for presence, absence, respectively);
(d) to provide information on their relationships ((dis)similarities) through standard multivariate
analysis algorithms.
The following processing and analytical services taken into account are (full names in the
presentation section):
1. WoRMS: TaxonMatch
2. FishBase: Fish Class identification
3. FishBase: Keys of fish species identification
4. FishBase: Species Identification Using Morphometric Measurements
5. FishBase: IncoFish
6. FishBase: Aquamaps
7. GBIF: DWCA&Validator
8. GBIF: taxon tagger
9. GBIF: name finder
10. GBIF: GPAAMP
11. GBIF: BIDDSAT
12. uBIO: LinkIT
13. uBIO: FindIT
14. uBIO: Author Abbreviation Resolver
15. uBIO: TNS name mapper
16. uBIO: ParseIT
17. uBIO: CanonizeIT
18. uBIO: x:ID
19. ALA: Spatial Portal
20. OBIS: mapper
21. IBEM: ecological quality index
22. mMWeb: MAMS
23. mMWeb: CSM
24. mMWeb: ENFA.
The established categories of classification identifiers are:
Description
1. Describes data
2. Describes meta-data
3. Dynamical linking out facility with with reference systems
4. File Validating options
5. Freely distributable
6. Open source
7. Includes / applied to terrestrial species
8. Includes / applied to marine species
9. Includes / applied to fresh water species
10. Provides Literature resources
Input functions
11. Use of info and data from external resources (e.g. URL, other databases or file uploading)
12. Provides Glossary
13. Provides (bio-)Geographic area finder
14. Provides Names finder
15. Provides Authority search and match (name+year)
16. Provides Year match facility (with taxon requested)
17. Provides Extra searching facilities (row, column delimiter)
18. Provides Taxa range match facility (from to, or taxon x)
19. Interactive taxonomic key facility
20. Dichotomous/pictorial key facility
Output functions
21. Tool/service by geographic area
22. Highlighting not matched names
23. Taxonomic rank and nomenclature annotation
24. Taxon morphology info
25. Taxon morphometrics
26. Taxon photo match
27. Taxon description
28. Mapping tools (lat-long, FAO, biodiversity, etc)
29. Modeling facility on mapping (climate change, invasive species, ecological niche, etc)
30. Ecological quality indexing facility
31. Output file options facility
32. Output file as input file for other operations
Detailed presentation of the processing and analytical services
The processing and analytical services taken into account in this report are presented below in
detail. Their names, acronyms, URLs, as well as their coding according to the classification identifiers
is provided.
1. World Register of Marine Species – TaxonMatch
(http://www.marinespecies.org/aphia.php?p=match): This is a processing service which checks the
validity of any list of names of marine taxa and provides the user with a list of currently accepted as
valid names, as well as their higher classification. Unmatched taxon names are highlighted and the
user is prompted to thoroughly check the unmatched names.
Classification identifiers
Describes data: YES
Describes meta-data: NO
Dynamical linking out facility with with reference systems: NO
File Validating options: YES
Freely distributable: NO
Open source: NO
Includes / applied to terrestrial species: NO
Includes / applied to marine species: YES
Includes / applied to fresh water species: NO
Provides Literature resources: NO
Use of info and data from external resources (e.g. URL, other databases or file uploading): YES
Provides Glossary: NO
Provides (bio-)Geographic area finder: NO
Provides Names finder: YES
Provides Authority search and match (name+year): YES
Provides Year match facility (with taxon requested): YES
Provides Extra searching facilities (row, column delimiter): YES
Provides Taxa range match facility (from to, or taxon x): YES
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: NO
Highlighting not matched names: YES
Taxonomic rank and nomenclature annotation:YES
Taxon morphology info: NO
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: NO
Mapping tools (lat-long, FAO, biodiversity, etc): NO
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): NO
Ecological quality indexing facility: NO
Output file options facility: YES
Output file as input file for other operations: YES
2. FishBase – Keys to fish classes (http://www.fishbase.org/identification/classlist.php).
Classification identifiers
Describes data: YES
Describes meta-data: NO
Dynamical linking out facility with with reference systems: NO
File Validating options: NO
Freely distributable: NO
Open source: NO
Includes / applied to terrestrial species: NO
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: YES
Use of info and data from external resources (e.g. URL, other databases or file uploading): NO
Provides Glossary: YES
Provides (bio-)Geographic area finder: YES
Provides Names finder: NO
Provides Authority search and match (name+year): NO
Provides Year match facility (with taxon requested): NO
Provides Extra searching facilities (row, column delimiter): NO
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: YES
Tool/service by geographic area: YES
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: YES
Taxon morphology info: YES
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: YES
Mapping tools (lat-long, FAO, biodiversity, etc): NO
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): NO
Ecological quality indexing facility: NO
Output file options facility: NO
Output file as input file for other operations: NO
3. FishBase – FishBase-Keys of fish species identification
(http://www.fishbase.org/keys/allkeys.php): Keys to families and species identification by
geographic area.
Classification identifiers
Describes data: YES
Describes meta-data: NO
Dynamical linking out facility with with reference systems: NO
File Validating options: NO
Freely distributable: NO
Open source: NO
Includes / applied to terrestrial species: NO
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: YES
Use of info and data from external resources (e.g. URL, other databases or file uploading): NO
Provides Glossary: YES
Provides (bio-)Geographic area finder: YES
Provides Names finder: NO
Provides Authority search and match (name+year): NO
Provides Year match facility (with taxon requested): NO
Provides Extra searching facilities (row, column delimiter): NO
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: YES
Tool/service by geographic area: YES
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: NO
Taxon morphology info: YES
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: YES
Mapping tools (lat-long, FAO, biodiversity, etc): NO
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): NO
Ecological quality indexing facility: NO
Output file options facility: NO
Output file as input file for other operations: NO
4. FishBase – Identification Using Morphometric Measurements
(http://www.fishbase.org/Identification/Morphometrics/centimeters/Index.php): It is an advanced
service of species identification by morphometrics which takes geographic, taxonomic and
morphometric information and retrieves possible matched species names along with their descriptions
and photos.
Classification identifiers
Describes data: YES
Describes meta-data: NO
Dynamical linking out facility with with reference systems: NO
File Validating options: NO
Freely distributable: NO
Open source: NO
Includes / applied to terrestrial species: NO
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: YES
Use of info and data from external resources (e.g. URL, other databases or file uploading): NO
Provides Glossary: NO
Provides (bio-)Geographic area finder: YES
Provides Names finder: NO
Provides Authority search and match (name+year): NO
Provides Year match facility (with taxon requested): NO
Provides Extra searching facilities (row, column delimiter): YES
Provides Taxa range match facility (from to, or taxon x): YES
Interactive taxonomic key facility: YES
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: NO
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: NO
Taxon morphology info: YES
Taxon morphometrics: YES
Taxon photo match: YES
Taxon description: YES
Mapping tools (lat-long, FAO, biodiversity, etc): NO
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): NO
Ecological quality indexing facility: NO
Output file options facility: NO
Output file as input file for other operations: NO.
5. FishBase – IncoFish (Shifting Baseline DatasetsSpecies)
(http://www.fishbase.org/Identification/Morphometrics/centimeters/Index.php): This is a
biogeographic service which takes information on specific datasets and on areas defined as Large
Marine Ecosystems (LMEs) and produces species distribution record tables and maps.
Classification identifiers
Describes data: YES
Describes meta-data: YES
Dynamical linking out facility with with reference systems: NO
File Validating options: NO
Freely distributable: NO
Open source: NO
Includes / applied to terrestrial species: NO
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: YES
Use of info and data from external resources (e.g. URL, other databases or file uploading): NO
Provides Glossary: NO
Provides (bio-)Geographic area finder: YES
Provides Names finder: YES
Provides Authority search and match (name+year): NO
Provides Year match facility (with taxon requested): YES
Provides Extra searching facilities (row, column delimiter): YES
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: YES
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: NO
Taxon morphology info: NO
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: NO
Mapping tools (lat-long, FAO, biodiversity, etc): YES
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): NO
Ecological quality indexing facility: NO
Output file options facility: NO
Output file as input file for other operations: NO
6. FishBase – AquaMaps
(http://www.fishbase.org/Identification/Morphometrics/centimeters/Index.php): A service with
multiple tools aiming at the large-scale predictions, under certain scenarios, through model-based
approach of the currently known species distributions of marine species. Physico-chemical and
biological processes, such as temperature, salinity and primary productivity, are taken into account by
these models in order to produce the prediction maps.
Classification identifiers
Describes data: YES
Describes meta-data: YES
Dynamical linking out facility with with reference systems: NO
File Validating options: NO
Freely distributable: NO
Open source: NO
Includes / applied to terrestrial species: NO
Includes / applied to marine species: YES
Includes / applied to fresh water species: NO
Provides Literature resources: NO
Use of info and data from external resources (e.g. URL, other databases or file uploading): YES
Provides Glossary: NO
Provides (bio-)Geographic area finder: YES
Provides Names finder: YES
Provides Authority search and match (name+year): NO
Provides Year match facility (with taxon requested): YES
Provides Extra searching facilities (row, column delimiter): YES
Provides Taxa range match facility (from to, or taxon x): YES
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: YES
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: NO
Taxon morphology info: NO
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: NO
Mapping tools (lat-long, FAO, biodiversity, etc): YES
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): YES
Ecological quality indexing facility: NO
Output file options facility: YES
Output file as input file for other operations: YES
7. GBIF – Darwin Core Archive Assistant (Global Biodiversity Information Facility)
(http://tools.gbif.org/dwca-assistant/): DCAA is a web application that provides a simple interface
for describing the data elements a data provider serves to the GBIF network as basic text
files. It composes the appropriate XML descriptor file as defined in the Darwin Core Text
Guidelines to accompany the respective data.
Classification identifiers
Describes data: YES
Describes meta-data: YES
Dynamical linking out facility with with reference systems: NO
File Validating options: YES
Freely distributable: YES
Open source: YES
Includes / applied to terrestrial species: YES
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: YES
Use of info and data from external resources (e.g. URL, other databases or file uploading): YES
Provides Glossary: YES
Provides (bio-)Geographic area finder: NO
Provides Names finder: NO
Provides Authority search and match (name+year): NO
Provides Year match facility (with taxon requested): NO
Provides Extra searching facilities (row, column delimiter): NO
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: YES
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: NO
Taxon morphology info: YES
Taxon morphometrics: YES
Taxon photo match: YES
Taxon description: YES
Mapping tools (lat-long, FAO, biodiversity, etc): NO
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): NO
Ecological quality indexing facility: NO
Output file options facility: YES
Output file as input file for other operations: YES
8. GBIF - TaxonTagger (http://tools.gbif.org/taxontagger/): Web application that identifies,
highlights, and extracts scientific names from web pages and PDF documents. It uses the GBIF name
finder web services as it's name-finding engine and highlights names in a document and provides a
limited capacity to annotate the service output such as highlighting missed names, extending a find,
etc. The extracted list of species names can be subsequently exported as a simple DarwinCore list or
can be cross-referenced and mapped to the Catalogue of Life or other GBIF-indexed species lists to
output a complete classification of the taxa retrieved.
Classification identifiers
Describes data: NO
Describes meta-data: NO
Dynamical linking out facility with with reference systems: NO
File Validating options: YES
Freely distributable: YES
Open source: YES
Includes / applied to terrestrial species: YES
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: YES
Use of info and data from external resources (e.g. URL, other databases or file uploading): YES
Provides Glossary: YES
Provides (bio-)Geographic area finder: NO
Provides Names finder: YES
Provides Authority search and match (name+year): YES
Provides Year match facility (with taxon requested): NO
Provides Extra searching facilities (row, column delimiter): YES
Provides Taxa range match facility (from to, or taxon x): YES
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: NO
Highlighting not matched names: YES
Taxonomic rank and nomenclature annotation: NO
Taxon morphology info: NO
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: NO
Mapping tools (lat-long, FAO, biodiversity, etc): NO
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): NO
Ecological quality indexing facility: NO
Output file options facility: YES
Output file as input file for other operations: YES
9. GBIF – ECAT Name Finder (http://tools.gbif.org/namefinder/): An html client to the name finding
web services hosted at GBIF. This service locates and extracts scientific names in a text document. It
is functionally linked to the global name architecture (GNA).
Classification identifiers
Describes data: NO
Describes meta-data: NO
Dynamical linking out facility with with reference systems: YES
File Validating options: NO
Freely distributable: YES
Open source: YES
Includes / applied to terrestrial species: YES
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: NO
Use of info and data from external resources (e.g. URL, other databases or file uploading): YES
Provides Glossary: NO
Provides (bio-)Geographic area finder: NO
Provides Names finder: YES
Provides Authority search and match (name+year): NO
Provides Year match facility (with taxon requested): NO
Provides Extra searching facilities (row, column delimiter): NO
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: NO
Highlighting not matched names: YES
Taxonomic rank and nomenclature annotation: YES
Taxon morphology info: NO
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: NO
Mapping tools (lat-long, FAO, biodiversity, etc): NO
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): NO
Ecological quality indexing facility: NO
Output file options facility: NO
Output file as input file for other operations: NO
10. GBIF – GPAAMP (Global Protected Areas Assessment and Monitoring Pilot viewer)
(http://tools.gbif.org/gpaamp-demo/): This is a visualizing tool of information from disparate sources
delivered through standard-based web services.
Classification identifiers
Describes data: NO
Describes meta-data: NO
Dynamical linking out facility with with reference systems: NO
File Validating options: NO
Freely distributable: YES
Open source: YES
Includes / applied to terrestrial species: YES
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: NO
Use of info and data from external resources (e.g. URL, other databases or file uploading): YES
Provides Glossary: NO
Provides (bio-)Geographic area finder: NO
Provides Names finder: NO
Provides Authority search and match (name+year): NO
Provides Year match facility (with taxon requested): YES
Provides Extra searching facilities (row, column delimiter): YES
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: YES
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: NO
Taxon morphology info: NO
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: NO
Mapping tools (lat-long, FAO, biodiversity, etc): YES
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): YES
Ecological quality indexing facility: NO
Output file options facility: YES
Output file as input file for other operations: YES
11. GBIF – BIDDSAT (Biodiversity Datasets Assessment Tool)
(http://www.gbif.org/orc/?doc_id=4696): Online tool where the user can analyze datasets published
and registered in the GBIF network. Selection options include metrics based on different variables
(e.g. taxonomy, type of record) and automatically produce maps and relevant diagrams.
Classification identifiers
Describes data: YES
Describes meta-data: YES
Dynamical linking out facility with with reference systems: NO
File Validating options: NO
Freely distributable: YES
Open source: YES
Includes / applied to terrestrial species: YES
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: NO
Use of info and data from external resources (e.g. URL, other databases or file uploading): YES
Provides Glossary: NO
Provides (bio-)Geographic area finder: NO
Provides Names finder: NO
Provides Authority search and match (name+year): NO
Provides Year match facility (with taxon requested): YES
Provides Extra searching facilities (row, column delimiter): YES
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: NO
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: NO
Taxon morphology info: NO
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: NO
Mapping tools (lat-long, FAO, biodiversity, etc): YES
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): YES
Ecological quality indexing facility: NO
Output file options facility: NO
Output file as input file for other operations: YES
12. uBIO – LinkIT (Universal Biological Indexer and Organizer)
(http://www.ubio.org/tools/linkit.php): This tool allows the user to develop automated dynamic link
outs from names in his/her own pages, databases, and texts to expert systems.
Classification identifiers
Describes data: YES
Describes meta-data: NO
Dynamical linking out facility with with reference systems: YES
File Validating options: YES
Freely distributable: YES
Open source: YES
Includes / applied to terrestrial species: YES
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: YES
Use of info and data from external resources (e.g. URL, other databases or file uploading): YES
Provides Glossary: NO
Provides (bio-)Geographic area finder: NO
Provides Names finder: YES
Provides Authority search and match (name+year): NO
Provides Year match facility (with taxon requested): NO
Provides Extra searching facilities (row, column delimiter): NO
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: NO
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: NO
Taxon morphology info: NO
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: NO
Mapping tools (lat-long, FAO, biodiversity, etc): NO
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): NO
Ecological quality indexing facility: NO
Output file options facility: YES
Output file as input file for other operations: YES
13. uBIO - FindIT (http://www.ubio.org/tools/recognize.php): Scientific name identification in text.
Classification identifiers
Describes data: YES
Describes meta-data: NO
Dynamical linking out facility with with reference systems: NO
File Validating options: YES
Freely distributable: YES
Open source: YES
Includes / applied to terrestrial species: YES
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: NO
Use of info and data from external resources (e.g. URL, other databases or file uploading): YES
Provides Glossary: NO
Provides (bio-)Geographic area finder: NO
Provides Names finder: YES
Provides Authority search and match (name+year): NO
Provides Year match facility (with taxon requested): YES
Provides Extra searching facilities (row, column delimiter): NO
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: NO
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: YES
Taxon morphology info: NO
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: NO
Mapping tools (lat-long, FAO, biodiversity, etc): NO
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): NO
Ecological quality indexing facility: NO
Output file options facility: NO
Output file as input file for other operations: NO
14. uBIO – Author Abbreviation Resolver (http://www.ubio.org/apps/authors/index.php): Web
service and database of variations throughout the recording of author and taxonomic names in the
literature.
Classification identifiers
Describes data: NO
Describes meta-data: NO
Dynamical linking out facility with with reference systems: NO
File Validating options: NO
Freely distributable: YES
Open source: YES
Includes / applied to terrestrial species: YES
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: NO
Use of info and data from external resources (e.g. URL, other databases or file uploading): YES
Provides Glossary: NO
Provides (bio-)Geographic area finder: NO
Provides Names finder: NO
Provides Authority search and match (name+year): YES
Provides Year match facility (with taxon requested): YES
Provides Extra searching facilities (row, column delimiter): NO
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: NO
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: NO
Taxon morphology info: NO
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: NO
Mapping tools (lat-long, FAO, biodiversity, etc): NO
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): NO
Ecological quality indexing facility: NO
Output file options facility: NO
Output file as input file for other operations: NO
15. uBIO – TNS Name Mapper (http://www.ubio.org/services/mapper/index2.php): This web tool
superimposes lists of names against multiple checklists and classifications.
Classification identifiers
Describes data: NO
Describes meta-data: NO
Dynamical linking out facility with with reference systems: NO
File Validating options: NO
Freely distributable: YES
Open source: YES
Includes / applied to terrestrial species: YES
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: NO
Use of info and data from external resources (e.g. URL, other databases or file uploading): YES
Provides Glossary: NO
Provides (bio-)Geographic area finder: NO
Provides Names finder: YES
Provides Authority search and match (name+year): YES
Provides Year match facility (with taxon requested): YES
Provides Extra searching facilities (row, column delimiter): NO
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: NO
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: NO
Taxon morphology info: NO
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: NO
Mapping tools (lat-long, FAO, biodiversity, etc): NO
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): NO
Ecological quality indexing facility: NO
Output file options facility: YES
Output file as input file for other operations: YES
16. uBIO – ParseIT (http://www.ubio.org/tools/explode.php): It receives complex scientific names and
breaks them down to its component parts. Useful for the name variants.
Classification identifiers
Describes data: NO
Describes meta-data: NO
Dynamical linking out facility with with reference systems: NO
File Validating options: NO
Freely distributable: YES
Open source: YES
Includes / applied to terrestrial species: YES
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: NO
Use of info and data from external resources (e.g. URL, other databases or file uploading): NO
Provides Glossary: NO
Provides (bio-)Geographic area finder: NO
Provides Names finder: YES
Provides Authority search and match (name+year): YES
Provides Year match facility (with taxon requested): NO
Provides Extra searching facilities (row, column delimiter): NO
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: NO
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: YES
Taxon morphology info: NO
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: NO
Mapping tools (lat-long, FAO, biodiversity, etc): NO
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): NO
Ecological quality indexing facility: NO
Output file options facility: NO
Output file as input file for other operations: NO
17. uBIO – CanonizeIT (http://www.ubio.org/tools/canonize.php): The tool receives any scientific
name and author combination and provides the canonical form of the name, that is the primary
nomenclatural components of a scietific name combination that can be used for comparing lexical
equivalence among name strings.
Classification identifiers
Describes data: NO
Describes meta-data: NO
Dynamical linking out facility with with reference systems: NO
File Validating options: NO
Freely distributable: YES
Open source: YES
Includes / applied to terrestrial species: YES
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: NO
Use of info and data from external resources (e.g. URL, other databases or file uploading): YES
Provides Glossary: NO
Provides (bio-)Geographic area finder: NO
Provides Names finder: YES
Provides Authority search and match (name+year): NO
Provides Year match facility (with taxon requested): NO
Provides Extra searching facilities (row, column delimiter): NO
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: NO
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: NO
Taxon morphology info: NO
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: NO
Mapping tools (lat-long, FAO, biodiversity, etc): NO
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): NO
Ecological quality indexing facility: NO
Output file options facility: NO
Output file as input file for other operations: NO
18. uBIO – x:ID (http://www.ubio.org/index.php?pagename=XID/key): This tool provides the
opportunity to the users to build their own web-based diagnostic or identification keys and run them
over locally or on the internet.
Classification identifiers
Describes data: YES
Describes meta-data: YES
Dynamical linking out facility with with reference systems: NO
File Validating options: YES
Freely distributable: YES
Open source: YES
Includes / applied to terrestrial species: YES
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: YES
Use of info and data from external resources (e.g. URL, other databases or file uploading): YES
Provides Glossary: NO
Provides (bio-)Geographic area finder: NO
Provides Names finder: NO
Provides Authority search and match (name+year): NO
Provides Year match facility (with taxon requested): NO
Provides Extra searching facilities (row, column delimiter): NO
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: YES
Dichotomous/pictorial key facility: YES
Tool/service by geographic area: NO
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: NO
Taxon morphology info: YES
Taxon morphometrics: YES
Taxon photo match: YES
Taxon description: YES
Mapping tools (lat-long, FAO, biodiversity, etc): NO
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): NO
Ecological quality indexing facility: NO
Output file options facility: NO
Output file as input file for other operations: NO
19. ALA - Spatial Portal (Atlas of Living Australia) (http://spatial.ala.org.au/): Allows the user to
create various maps on-the-fly, according to specified criteria or uploaded species lists.
Classification identifiers
Describes data: YES
Describes meta-data: YES
Dynamical linking out facility with with reference systems: NO
File Validating options: NO
Freely distributable: YES
Open source: YES
Includes / applied to terrestrial species: YES
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: NO
Use of info and data from external resources (e.g. URL, other databases or file uploading): YES
Provides Glossary: NO
Provides (bio-)Geographic area finder: YES
Provides Names finder: YES
Provides Authority search and match (name+year): NO
Provides Year match facility (with taxon requested): NO
Provides Extra searching facilities (row, column delimiter): YES
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: YES
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: NO
Taxon morphology info: NO
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: NO
Mapping tools (lat-long, FAO, biodiversity, etc): YES
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): NO
Ecological quality indexing facility: NO
Output file options facility: YES
Output file as input file for other operations: YES
20. OBIS - mapping tool (Ocean Biogeographi Information System) (http://iobis.org/mapper/):
Allows the user to query the database and create various distribution maps on-the-fly (species
distribution datasets).
Classification identifiers
Describes data: YES
Describes meta-data: YES
Dynamical linking out facility with with reference systems: NO
File Validating options: NO
Freely distributable: YES
Open source: YES
Includes / applied to terrestrial species: NO
Includes / applied to marine species: YES
Includes / applied to fresh water species: NO
Provides Literature resources: NO
Use of info and data from external resources (e.g. URL, other databases or file uploading): NO
Provides Glossary: YES
Provides (bio-)Geographic area finder: YES
Provides Names finder: YES
Provides Authority search and match (name+year): NO
Provides Year match facility (with taxon requested): YES
Provides Extra searching facilities (row, column delimiter): YES
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: YES
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: NO
Taxon morphology info: NO
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: NO
Mapping tools (lat-long, FAO, biodiversity, etc): YES
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): NO
Ecological quality indexing facility: NO
Output file options facility: YES
Output file as input file for other operations: YES
21. IBEM-ecological quality index (indice de biodiversité des étangs et mares)
(http://campus.hesge.ch/ibem/welcome.asp): Calculates certain indices on-the-fly according to
parameters chosen by the user, focus on lakes.
Classification identifiers
Describes data: NO
Describes meta-data: NO
Dynamical linking out facility with with reference systems: NO
File Validating options: YES
Freely distributable: YES
Open source: YES
Includes / applied to terrestrial species: NO
Includes / applied to marine species: NO
Includes / applied to fresh water species: YES
Provides Literature resources: NO
Use of info and data from external resources (e.g. URL, other databases or file uploading): YES
Provides Glossary: NO
Provides (bio-)Geographic area finder: NO
Provides Names finder: NO
Provides Authority search and match (name+year): NO
Provides Year match facility (with taxon requested): NO
Provides Extra searching facilities (row, column delimiter): NO
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: NO
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: NO
Taxon morphology info: NO
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: NO
Mapping tools (lat-long, FAO, biodiversity, etc): NO
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): NO
Ecological quality indexing facility: YES
Output file options facility: NO
Output file as input file for other operations: NO
22. mMWeb – MAMS (multi-models web - Marble Algorithm Modeling System)
(http://mmweb.animal.net.cn/; http://mmweb.animal.net.cn/alglist.jsp): This service provides an
integrated spatial analysis algorithm for predicting distributions of a given organism. MAMS extracts
and analyzes data from the occurrence localities of a given species and is a web-based application.
Classification identifiers
Describes data: NO
Describes meta-data: NO
Dynamical linking out facility with with reference systems: NO
File Validating options: YES
Freely distributable: YES
Open source: YES
Includes / applied to terrestrial species: YES
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: YES
Use of info and data from external resources (e.g. URL, other databases or file uploading): NO
Provides Glossary: NO
Provides (bio-)Geographic area finder: YES
Provides Names finder: NO
Provides Authority search and match (name+year): NO
Provides Year match facility (with taxon requested): NO
Provides Extra searching facilities (row, column delimiter): NO
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: YES
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: NO
Taxon morphology info: NO
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: NO
Mapping tools (lat-long, FAO, biodiversity, etc): YES
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): YES
Ecological quality indexing facility: NO
Output file options facility: YES
Output file as input file for other operations: YES
23. mMWeb – CSM (multi-models web – Climate Space Model)
(http://mmweb.animal.net.cn/index.jsp): This service is a principle components based algorithm.
The component selection processed in this algorithm implementation is based on the Broken-Stick
cutoff where any component with an eigenvalue less than (n stddevs above a randomised sample) is
discarded.
Classification identifiers
Describes data: NO
Describes meta-data: NO
Dynamical linking out facility with with reference systems: NO
File Validating options: YES
Freely distributable: YES
Open source: YES
Includes / applied to terrestrial species: YES
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: YES
Use of info and data from external resources (e.g. URL, other databases or file uploading): NO
Provides Glossary: NO
Provides (bio-)Geographic area finder: YES
Provides Names finder: NO
Provides Authority search and match (name+year): NO
Provides Year match facility (with taxon requested): NO
Provides Extra searching facilities (row, column delimiter): NO
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: YES
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: NO
Taxon morphology info: NO
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: NO
Mapping tools (lat-long, FAO, biodiversity, etc): YES
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): YES
Ecological quality indexing facility: NO
Output file options facility: YES
Output file as input file for other operations: YES
24. mMWeb – ENFA (multi-models web – Ecological Niche Factor Analysis)
(http://mmweb.animal.net.cn/index.jsp): This service uses a modified principal components
analysis to develop a model based on presence only data. The observed environment is compared to
the background data of the study area. The analysis produces factors similar to a PCA.
Classification identifiers
Describes data: NO
Describes meta-data: NO
Dynamical linking out facility with with reference systems: NO
File Validating options: YES
Freely distributable: YES
Open source: YES
Includes / applied to terrestrial species: YES
Includes / applied to marine species: YES
Includes / applied to fresh water species: YES
Provides Literature resources: YES
Use of info and data from external resources (e.g. URL, other databases or file uploading): NO
Provides Glossary: NO
Provides (bio-)Geographic area finder: YES
Provides Names finder: NO
Provides Authority search and match (name+year): NO
Provides Year match facility (with taxon requested): NO
Provides Extra searching facilities (row, column delimiter): NO
Provides Taxa range match facility (from to, or taxon x): NO
Interactive taxonomic key facility: NO
Dichotomous/pictorial key facility: NO
Tool/service by geographic area: YES
Highlighting not matched names: NO
Taxonomic rank and nomenclature annotation: NO
Taxon morphology info: NO
Taxon morphometrics: NO
Taxon photo match: NO
Taxon description: NO
Mapping tools (lat-long, FAO, biodiversity, etc): YES
Modeling facility on mapping (climate change, invasive species, ecological niche, etc): YES
Ecological quality indexing facility: NO
Output file options facility: YES
Output file as input file for other operations: YES
Multivariate analysis of the coded data
The ultimate purpose of the coding of these services and tools available is to explore any
commonalities and/or complementary aspects in their operational features and functionality. For this
reason, a features-by-service(/tool) matrix was constructed, based on the coded data presented
above. Subsequently, the Jaccard similarity coefficient was applied in order to construct the triangular
similarity matrix which describes similarities between all possible pairs of the web services and tools
taken into account in the context of this report. The standard hierarchical cluster analysis and the nonmetric mutli-dimensional scaling (nMDS) algorithms were applied to this matrix in order to visualize
their relationships as similarity dendrogram and MDS plot.
The assumed hypothesis is that there must be two factors that can affect the relationships
between the studied web sites and tools: (a) the web site development, that is the services and tools
developed under the same site, hence framework, are supposed to appear more similar to each other
than to those from the remainder web sites; (b) the processing and analytical function they carry out,
which means that services and tools developed to carry out similar functions or client needs must be
similar to each other. In order to statistically test the above two assumptions the permutational
mutlivariate analysis of variance (PERMANOVA) was performed to the same similarity matrix.
No clear separation of the services and tools can be seen in the following MDS plot.
However, the cluster analysis dendrogram is showing a much more clear picture of the
interrelations of the various services and tools.
Here one can identify clusters of similar services, such as those included in the mMWeb site or
the ones that offer mapping options (FishBase, OBIS, and ALA). However, the testing of the above
assumptions is only carried out through the application of the PERMANOVA .
Source
df
SS
MS
Website
Function
Website x Function
6
4
11980
9960.4
1996.7
2490.1
2.4312
3.032
0.004
0.004
996
998
1
11
23
3277.5
9034.2
51033
3277.5
821.29
3.9907
0.003
999
Res
Total
Pseudo-F p-Value
Permutation No
In the above table with the results of the PERMANOVA it is clearly shown that both factors, as
well as their combination, seem to have a significant effect on the relationships of the analyzed web
services and tools.
Conclusions
The analysis of this report shows that the existing tools and services in the web sites of the
major projects, initiatives, fora and organizations show many similarities in their features which
describe their operation and their input and output functions.
Both the web site and the function of the services and tools seem to affect their interrelations.
This simply means that there occur many commonalities in their functions which are derived by the
fact that they have been built under the same framework (web site) or to address similar needs of their
clients.
These results should be taken into account in the development of ViBRANT's services and tools
and some elements in their performance, such as standard protocols and speed, should perhaps take a
priority.
Download