NPI-RIVM-QA-flagging..

advertisement
CheckUVSpec
A quality assurance program for measured surface UV spectra
Contact: Ola Engelsen (Norwegian Polar Institute)
Last update: March 31, 2003
The CheckUVSpec QA program performs quality checks of measured downward hemispherical UV
spectra in flexstor file format (Heikkilä et al. 1998). It joins the two quality indicator programs
AtmosphericSignature and SHICrivm outlined below, and provides flagging output information from
either of the programs or both. The CheckUVSpec QA program is meant to supplement other existing
on site quality check procedures, not replace them. It is run operationally on all surface UV spectra
submitted to the EDUCE European UV database. The CheckUVSpec program is freely available at
http://zardoz.nilu.no/~olaeng/CheckUVSpec/CheckUVSpec.html
AtmosphericSignature by Ola Engelsen (Norwegian Polar Institute)
Introduction
The AtmosphericSignature QA program provides checks on the levels of irradiances in measured UV
spectra to flag obviously erroneous spectra or indicate approximately what kind of atmospheric
scenarios each measured spectrum resembles. This may provide important information both to the user
of the UV spectra as well as the data provider. Most users would want to reject any spectra with
obvious instrument errors from their analysis, or may want to focus on particular atmospheric
scenarios. As most UV spectra are measured automatically, little ancillary metadata on atmospheric
and surface conditions will usually be available. Furthermore, automatic unattended measurements of
UV spectra may be influenced by circumstances like ice and snow, birds, dirt, calibration errors, etc.
Most UV instruments are delicate and its calibration can easily be degraded.
Method
The Fastrt simulation tool (http://zardoz.nilu.no/~olaeng/fastrt/fastrt.html) is a central
component in the diagnosis of UV spectra. The Fastrt program is called by the AtmosphericSignature
tool to perform UV simulations in the upper and lower end of UVA of relevant atmospheric scenarios,
when needed. The relevant atmospheric scenarios are denoted MIN, MAX, AERO, CLEA and CLOU
and are described below.
MIN: We simulate the lowest naturally observable radiation levels. These, we assume to exist under
an extremely thick homogenous water cloud. This cloud is here assumed to be four km thick and have
a liquid water column of 4000g m-2. This is equivalent to a cloud optical depth of 650 at a wavelength
of 360nm. The cloud optical properties match that of alto-stratus clouds (Shettle, 1989) and the cloud
water droplets are distributed evenly between 2 and 7 km height above the terrestrial surface. There is
no surface reflection.
MAX: We have a broken clouds scenario where all downwelling radiation is initially transmitted
through the atmosphere as if only a clear atmosphere and a black surface were present. All radiation
reaching the ground is however trapped between a snow-covered ground and an extremely thick
homogenous alto-stratus cloud. This cloud is identical to the one in the MIN scenario above. We
assume that the MAX scenario yields the highest radiation levels obtainable naturally.
MAX_0: This is similar to the MAX scenario, except that now all diffuse downward radiation
transmitted through the clear atmosphere also is absorbed and scattered by the thick homogeneous
cloud.
AERO: This pertains to a cloudless but very turbid aerosol-loaded atmosphere with a visibility of 5km.
There is no surface reflection.
CLEA: This is a clear atmosphere with no aerosols or clouds present, and there is no surface
reflection.
CLOU: Cloudy atmosphere similar to MAX but with variable cloud density. The resulting irradiances
are computed iteratively in the search for a cloud liquid water column which yields the best match
between measurements and simulations.
The AtmosphericSignature QA tool relies on comparisons of measured UV spectra with modelled UV
spectra under identical observation conditions for the various atmospheric scenarios above. The
quality indicator program will print the spectrum ID followed by one of the rows in table 1 for each
measured spectrum.
Output
Spectrum
Flag
Diagnosis
name
FILE
BLACK
Flexstor file read error
CHECK
FILE
BLACK/GREY
Wavelengths
CHECK
inadequately
specified
FILE
BLACK
Irradiances
CHECK
inadequately
specified
Metric,
f
m1
M2
Sim. 1 Sim. 2
Metric,
f
m1
M2
and then followed by one of the lines below:
Spectrum
name
ATMSIG
ATMSIG
Flag
GREY
RED if:
f < 1.15
Diagnosis
Solar zenith angles off
limits [0-84] degrees
Radiation level too low
ATMSIG
BLACK if: Radiation level too high
f > 1.15
RED if:
0.85 < f <= 1.15
ATMSIG YELLOW if:
Radiation
m2 > MAX2_0
enhancement
ATMSIG
GREEN
otherwise
GREEN
Cloudless sky
ATMSIG
GREEN
CL-NCU
Sim. 1 Sim. 2
m2

MIN2
M2
MIN2
m2

MAX2
M2
MAX2
(m2-CLEA2)

(MAX2-CLEA2)
(0 > f >1)
Atmospheric visibility
(5-340 km)
M1/M2-CLEA1/CLEA2
MAX2_0 M2 CLEA2 MAX2
M2 CLEA2 AERO2
m1
M2 CLEA1 CLEA2
m1
M2 MIN1
(f < -0.15)
ATMSIG
GREEN
NCL-CU
ATMSIG
GREEN
Cloudy
M1/M2 - MIN1/MIN2
(f > 0.15)
Cloud liquid water
column (g m-2)
m2
Table 1. Output from NPI’s QA diagnosis tool AtmosphericSignature. m1 and m2 are measurements in
lower and upper part of the UVA spectral region, respectively. Subscripts 1 and 2 associated with
simulated scenarios MIN, MAX, AERO and CLEA indicates the lower and upper part of UVA,
respectively. Empty cells are filled with NaN which indicates that no legitimate number exists for this
MIN2
cell under this indication. The NaN quantity is set by the C compiler, e.g. to -99999.99. Cloud optical
depth at 360nm is the cloud liquid water column multiplied by 650/4000. N is the spectrum number
within the flexstor file. We conservatively estimate an overall joint uncertainty of 15% for
measurements and simulations.
More detailed information on some of the spectrum indicators:
Flexstor file read error: The input flexstor file is missing, has a format not fully recognized by the
program, or essential information cannot be read correctly from the file, e.g. unrecognisable units has
been encountered in the file.
Wavelengths inadequately specified: The spectra may be flagged GREY if there are no data at
wavelengths of interest to the AtmosphericSignature program, BLACK if e.g. the wavelength
units are unrecognisable.
Radiation enhancement: there is most likely a high surface albedo present, and/or a cloud constellation
causing the radiation level to be higher than for a clear atmosphere.
Cloudless sky: there are most likely no clouds present. Aerosols and haze may still be present, but not
necessarily.
CL-NCU (Cloud at Lower band- No Cloud at Upper band): there are most likely clouds in the lower
end of the scanned spectrum and cloudless sky in the upper end.
NCL-CU (No Cloud at Lower band- Cloud at Upper band): there are clouds in the upper end of the
spectrum and possibly no clouds in the lower end. There are at least clouds with very different
thickness in the lower end of the spectrum than in the upper part.
The other indicators should be self-explanatory.
Flagging from SHICrivm package to be included in the EDUCE database
Harry Slaper 29-11-2002
Introduction
Quality indicators will be included in the EDUCE-database. The quality indicators will be
related to several aspects of the spectral quality, including wavelength scale errors, spectral
shape errors and irradiance scale errors. The quality indicators will be referred to as quality
flags. In general GREEN flags meet the highest quality criteria, YELLOW flags do not fully
meet the highest criteria, but do meet the secondary criteria, RED flags identify spectra with a
doubtful quality, not meeting the secondary criteria, but not exceeding the lowest/rejection
criteria. BLACK flags indicate spectra that do not meet the lowest criteria and therefore
normally are not retrieved from the database, unless specifically requested by the user. In
addition a GREY flag is used if the spectra are doubtful but the algorithm is not allowing a
final conclusion.
Several flagging indicators are used at the database to allow a check of the data for different
aspects.
This document describes the flagging contribution from RIVM by means of the SHICrivm
package. In addition NPI has provided a quality checking routine for the irradiance scale,
which is described in a separate document.
Quality reporting by SHICrivm
Each spectral run provides quality-flagging output as shown below in the example. It is
proposed to include the following flags/indicators in the EDUCE data-base:
Shift_1
quality-flag
Shift_2
quality-flag
Start_irradiance
quality-flag
Spike+local_shape
Transmission_2
Scan_variability_2
quality-flag
quality-flag also indicator for possible atmospheric conditions
indicator providing information on the variability of conditions during
the scan
Example standard output for Quality flags obtained using SHIC:
1350801G 2002 sza 48.164 UVA_transmission: GREEN LOW_OR_NO_CLOUDS
scan_variability: STABLE_SCAN
local: GREEN STABLE indicative eff. ozone: 301.17 DU
shift1_flagging
GREEN
-0.009 (nm)
shift2_flagging
GREEN
-0.000 (nm)
start_irradiance_flag
GREEN
0.0000066 maximum_below_first: 0.0000024
Spike+local_shape
GREEN
Transmission_2
GREEN LOW_OR_NO_CLOUDS
0.86454561 2.40 all 0.81811091 12.58
scan_variability_2
STABLE_SCAN
2.40 =sd trans2
Transmission_1
NOT_IMPLEMENTED
0.68294455 18.33
local_shape_flag
GREEN STABLE
2.11 % criterium and nr_spikes 0.250 0 0.500 0 5.000 0
spike_flag_median_based GREEN nr_spikes_above_irr: 0.00050000 0 nr_spikes_above_irr0: 0.00010000 0 detected:
start_wavelength_flag GREEN
295.25 (nm) efuv_below: 0.01 %
last_wavelength_flag
GREEN
400.00 (nm) efuv_above: 0.00 %
Median_irradiance_flag NOT_EXTREME
Median_Irradiance: 0.03711040 at 310.00 nm
0
1
Shift1_flagging
Wavelength range 300-325 nm wavelength shifts of measured spectrum; numeric value
provides shift in nm.
criterium for flagging based on absolute number of shift, which is given in nm: ABS(f)
0 < GREEN < 0.1 < YELLOW < 0.2 < RED < 0.4 < BLACK
values of wavelength shifts: -0.9 up to +0.9 nm in standard analysis
The flag is returned as GREY if the algorithm does not find at least five reliable shift
determinations, or if the flag is BLACK but the median irradiance around 310 nm is
lower than 5e-4 W/(m2 nm)
2
Shift2_flagging
Wavelength range 325-400 nm wavelength shifts of measured spectrum; numeric value
provides shift in nm.
criterium for flagging based on absolute number of shift, which is given in nm: ABS(f)
0 < GREEN < 0.1 < YELLOW < 0.2 < RED < 0.4 < BLACK
values of wavelength shifts: -0.9 up to +0.9 nm in standard analysis; if outside the wavelength
range the values 9.999 is given (together with GREY flag identifier)
The flag is returned as GREY if the algorithm does not find at least five reliable shift
determinations, or if the flag is BLACK but the median irradiance around 310 nm is
lower than 5e-4 W/(m2 nm)
3
start_irradiance_flag
lowest reliable irradiance reading (criterium five subsequent ratios of irradiance readings
require to be within 25% of modelled ratios); two numeric values provide: the irradiance at
the first reliable reading, and the highest irrradiance reading below the first reliable reading;
criterium based on highest of the two numeric values:
GREEN < 5e-4 (W/(m2 nm)) < YELLOW < 1.5e-3 < RED < 5e-3 < BLACK
Values can be in principle any real number
Flag is GREY if the median irradiance level around 310 nm is lower than 5e-4 W/(m2 nm)
4
Spike and local shape flag
This flag combines results from the spike flag and the local shape flag. The worst of those two
flags is taken: the spike_flag_median_based and the local_shape_flag. There is not one
number corresponding to this flag, so it should be identified by the colour only. In addition to
the flag-colour the variability for the local_shape flag could be used as a number that is
included in the database, but it is not possible to link this one number to the flag-colour.
A spike is identified if the ratio of an irradiance measurement with the median of 10 readings
around the measured wavelength deviates more than a factor of two from a similar ratio
obtained from the modelled spectrum. A spike should at the database al ways lead to a
BLACK flag. A RED flag for spikes is identified if the spectral ratio of two subsequent
readings deviates more than 50% from the modelled ratio for all spectral irradiance levels
above the start_irradiance. Similarly a YELLOW flag is identified if a measured ratio deviates
25%.
The local shape flag indicates the variability of the ratio of two subsequent readings versus a
similar modelled ratio and therefore identifies local deviations/variations in the spectral shape.
The flagging is as follows:
GREEN < 10 % < YELLOW < 15 % <RED < 20% < BLACK
If the local shape flag is GREEN a further specification occurs:
STABLE < 3% < VARIABLE < 5% < HIGHLY_VARIABLE < 10%
Reported as GREEN STABLE etc. I propose to not include this differentiation of the GREEN
flag at the database.
5
Transmission_2
Transmission above 325 nm (up to highest wavelength, maximally 400 nm)
The transmission is calculated comparing with cloudless modelled values taking into account
earth sun distances.
Transmission is given in the first numeric value, usually between 0 and indefinite, but an
undetermined value leads to –999.999 (this occurs if the wavelength range does not include
wavelengths above 325 nm).
I now consider the implementation of a transmission for the 320-325 nm range, because many
single Brewers are found at the database for which we now do not have an irradiance check.
This will change the Transmission_2 line from:
Transmission_2
GREEN LOW_OR_NO_CLOUDS
0.86454561 2.40 all
0.81811091 12.58
To:
Transmission_2 GREEN LOW_OR_NO_CLOUDS 0.8645 2.40 region_320_325 0.8543
1.15 all 0.81811091 12.58
At the database just the numbers 0.8645 (transmission from 325-400 nm) and 0.8543
(transmission from 320-325 nm) are required, and the flagging criteria are compared with the
first number unless the value of –999.999 is given. In the latter case the value following the
region_320_325 should be used with similar border settings. The 2.40 and 1.15 in the
example are numbers to identify if large variations occur (percental variation), and are not
required for this flag.
Here the flagging is GREEN, YELLOW, RED and BLACK and in addition an identification
on the transmission is given
>2.0 'BLACK
EXTREMELY_HIGH
';
>1.5;
>1.25;
>0.75;
>0.25;
>0.10;
>0.05;
>0.01;
<0.01;
'RED
'YELLOW
'GREEN
’GREEN
'GREEN
'YELLOW
‘RED
‘BLACK
VERY_VERY_HIGH
';
VERY_HIGH
';
LOW_OR_NO_CLOUDS
';
CLOUDS
';
THICK_CLOUDS
';
VERY_THICK_CLOUDS
';
VERY_VERY_THICK_CLOUDS ';
EXTREMELY_LOW_TRANSMISS';
GREY is used instead of BLACK flags if the irradiance levels are below 5e-4 W/(m2 nm);
NOT_DETERMINED is provided if the spectral range does not cover wavelengths above 325
nm. GREY NOT_DETERMINED if irradiance level below 5e-4 W/(m2 nm) and the spectral
range does not cover wavelengths above 325 nm.
At the EDUCE meeting comments were received, implying that only BLACK and GREEN
flagging should be used as quality indicator at the database for the irradiance scale. I propose
to take over only the margins for BLACK and GREEN for the quality indication and in
addition the other transmission remarks can be relevant for data-selections, not so much as a
quality flag but rather an identifier of the conditions.
I think we still need to look at the exact borders to be used. Gunther Seckmeyer argued that
the lower limit at 0.01 is still too high. This is under consideration. The too high flag was not
disputed and makes sense when looking at the data from the database. I note that Ola
Engelsens high scenario corresponds to a factor of nearly five. It is my impression that this is
quite an extreme scenario which I have not seen in data tested so far, except for some obvious
errors.
Suggestion just take the two numbers, both transmissions as indicated above. In the testing
phase you could use a similar approach but taking the transmission following the string “all “.
For the rest this will remain identical.
6
Scan_variability_2
(NOT primarily a quality flag, but important to identify variability of conditions during the
scan)
This identifies the variability in the transmission over the scan for the wavelength range above
325 nm (up to 400 nm). It is not so much a direct quality indicator, but it is a diagnostic
identifier for large variations occurring during a scan.
Indicator is numerical real number.
> 30 %
> 20 %
>15 %
>10 %
> 5%
> 3%
>2%
>0
corresponds to
‘EXTREME_VARIATION_SCAN
';
'VERY_LARGE_VARIATION_SCAN ';
'LARGE_VARIATION_SCAN
';
'CONSIDERABLE_VARIATION_SCAN ';
'SOME_VARIATION_SCAN
';
'LOW_VARIATION_SCAN
';
'STABLE_SCAN
';
'VERY_STABLE_SCAN
';
NOT_DETERMINED is provided if the wavelength range above 325 nm is not covered in the
scan. Number of 100.00 is then given.
Acknowledgment
The development of CheckUVSpec, SHICrivm and AtmosphericSignature tools has been funded by
the European Commission (EC) through the CEC project "European Database for UV Climatology
and Evaluation (EDUCE)".
References
Shettle E P, "Models of aerosols, clouds and precipitation for atmospheric propagation
studies, In Atmospheric propagation in the UV, visible, IR and MM-region and related system
aspects, AGARD Conf. Proc. pp. 15-1-15-13, 1989
A. Heikkilä, A. Albold, G. Bernhard, B. Gardiner, K. Hurtta, P. Kirsch, A. Kylling, B. Mayer,
H. Slaper, P. Taalas, B. Walravens, A. Webb, G. Seckmeyer, SUVDAMA Data Storage,
Edition 4, 1998, Documentation for the European UV Database, http://www.muk.unihannover.de/EDUCE
Download