scholarly journals MIPAS Level 2 Processor Prototype: from validation to operation

2014 ◽  
Author(s):  
Marc Bernau ◽  
Heidrun Weber ◽  
Michael Schmitt ◽  
Sven Bartha ◽  
Roland Gessner

In the research field of atmospheric chemistry a central question for acquired data sets is about validation. Have the data been validated to be useful for science? Has the data set been compared to other data sets? If deviations occur, which cause could be identified? Ultimately, two causes are possible when the same scene is observed: either the acquired raw data set is erroneous (hardware problem) or the data processing infers erroneous information (software problem). In order to make sure that the software works as expected, software validation plays a key role in the overall data set validation campaigns. This paper deals with operational software validation, which is an important component of the entire scientific validation chain. [...]

2020 ◽  
Vol 20 (11) ◽  
pp. 6991-7019
Author(s):  
Markus Kunze ◽  
Tim Kruschke ◽  
Ulrike Langematz ◽  
Miriam Sinnhuber ◽  
Thomas Reddmann ◽  
...  

Abstract. Variations in the solar spectral irradiance (SSI) with the 11-year sunspot cycle have been shown to have a significant impact on temperatures and the mixing ratios of atmospheric constituents in the stratosphere and mesosphere. Uncertainties in modelling the effects of SSI variations arise from uncertainties in the empirical models reconstructing the prescribed SSI data set as well as from uncertainties in the chemistry–climate model (CCM) formulation. In this study CCM simulations with the ECHAM/MESSy Atmospheric Chemistry (EMAC) model and the Community Earth System Model 1 (CESM1)–Whole Atmosphere Chemistry Climate Model (WACCM) have been performed to quantify the uncertainties of the solar responses in chemistry and dynamics that are due to the usage of five different SSI data sets or the two CCMs. We apply a two-way analysis of variance (ANOVA) to separate the influence of the SSI data sets and the CCMs on the variability of the solar response in shortwave heating rates, temperature, and ozone. The solar response is derived from climatological differences of time slice simulations prescribing SSI for the solar maximum in 1989 and near the solar minimum in 1994. The SSI values for the solar maximum of each SSI data set are created by adding the SSI differences between November 1994 and November 1989 to a common SSI reference spectrum for near-solar-minimum conditions based on ATLAS-3 (Atmospheric Laboratory of Applications and Science-3). The ANOVA identifies the SSI data set with the strongest influence on the variability of the solar response in shortwave heating rates in the upper mesosphere and in the upper stratosphere–lower mesosphere. The strongest influence on the variability of the solar response in ozone and temperature is identified in the upper stratosphere–lower mesosphere. However, in the region of the largest ozone mixing ratio, in the stratosphere from 50 to 10 hPa, the SSI data sets do not contribute much to the variability of the solar response when the Spectral And Total Irradiance REconstructions-T (SATIRE-T) SSI data set is omitted. The largest influence of the CCMs on variability of the solar responses can be identified in the upper mesosphere. The solar response in the lower stratosphere also depends on the CCM used, especially in the tropics and northern hemispheric subtropics and mid-latitudes, where the model dynamics modulate the solar responses. Apart from the upper mesosphere, there are also regions where the largest fraction of the variability of the solar response is explained by randomness, especially for the solar response in temperature.


2020 ◽  
Vol 13 (1) ◽  
pp. 287-308
Author(s):  
Stefan Lossow ◽  
Charlotta Högberg ◽  
Farahnaz Khosrawi ◽  
Gabriele P. Stiller ◽  
Ralf Bauer ◽  
...  

Abstract. The annual variation of δD in the tropical lower stratosphere is a critical indicator for the relative importance of different processes contributing to the transport of water vapour through the cold tropical tropopause region into the stratosphere. Distinct observational discrepancies of the δD annual variation were visible in the works of Steinwagner et al. (2010) and Randel et al. (2012). Steinwagner et al. (2010) analysed MIPAS (Michelson Interferometer for Passive Atmospheric Sounding) observations retrieved with the IMK/IAA (Institut für Meteorologie und Klimaforschung in Karlsruhe, Germany, in collaboration with the Instituto de Astrofísica de Andalucía in Granada, Spain) processor, while Randel et al. (2012) focused on ACE-FTS (Atmospheric Chemistry Experiment Fourier Transform Spectrometer) observations. Here we reassess the discrepancies based on newer MIPAS (IMK/IAA) and ACE-FTS data sets, also showing for completeness results from SMR (Sub-Millimetre Radiometer) observations and a ECHAM/MESSy (European Centre for Medium-Range Weather Forecasts Hamburg and Modular Earth Submodel System) Atmospheric Chemistry (EMAC) simulation (Eichinger et al., 2015b). Similar to the old analyses, the MIPAS data set yields a pronounced annual variation (maximum about 75 ‰), while that derived from the ACE-FTS data set is rather weak (maximum about 25 ‰). While all data sets exhibit the phase progression typical for the tape recorder, the annual maximum in the ACE-FTS data set precedes that in the MIPAS data set by 2 to 3 months. We critically consider several possible reasons for the observed discrepancies, focusing primarily on the MIPAS data set. We show that the δD annual variation in the MIPAS data up to an altitude of 40 hPa is substantially impacted by a “start altitude effect”, i.e. dependency between the lowermost altitude where MIPAS retrievals are possible and retrieved data at higher altitudes. In itself this effect does not explain the differences with the ACE-FTS data. In addition, there is a mismatch in the vertical resolution of the MIPAS HDO and H2O data (being consistently better for HDO), which actually results in an artificial tape-recorder-like signal in δD. Considering these MIPAS characteristics largely removes any discrepancies between the MIPAS and ACE-FTS data sets and shows that the MIPAS data are consistent with a δD tape recorder signal with an amplitude of about 25 ‰ in the lowermost stratosphere.


F1000Research ◽  
2017 ◽  
Vol 6 ◽  
pp. 967 ◽  
Author(s):  
Ting-Li Han ◽  
Yang Yang ◽  
Hua Zhang ◽  
Kai P. Law

Background: A challenge of metabolomics is data processing the enormous amount of information generated by sophisticated analytical techniques. The raw data of an untargeted metabolomic experiment are composited with unwanted biological and technical variations that confound the biological variations of interest. The art of data normalisation to offset these variations and/or eliminate experimental or biological biases has made significant progress recently. However, published comparative studies are often biased or have omissions. Methods: We investigated the issues with our own data set, using five different representative methods of internal standard-based, model-based, and pooled quality control-based approaches, and examined the performance of these methods against each other in an epidemiological study of gestational diabetes using plasma. Results: Our results demonstrated that the quality control-based approaches gave the highest data precision in all methods tested, and would be the method of choice for controlled experimental conditions. But for our epidemiological study, the model-based approaches were able to classify the clinical groups more effectively than the quality control-based approaches because of their ability to minimise not only technical variations, but also biological biases from the raw data. Conclusions: We suggest that metabolomic researchers should optimise and justify the method they have chosen for their experimental condition in order to obtain an optimal biological outcome.


2018 ◽  
Author(s):  
Charlotta Högberg ◽  
Stefan Lossow ◽  
Ralf Bauer ◽  
Kaley A. Walker ◽  
Patrick Eriksson ◽  
...  

Abstract. Within the framework of the second SPARC (Stratosphere-troposphere Processes And their Role in Climate) water vapour assessment (WAVAS-II), we have evaluated five data sets of δD(H2O) obtained from observations of Odin/SMR (Sub-Millimetre Radiometer), Envisat/MIPAS (Environmental Satellite/Michelson Interferometer for Passive Atmospheric Sounding) and SCISAT/ACE-FTS (Science Satellite/Atmospheric Chemistry Experiment-Fourier Transform Spectrometer) using profile-to-profile and climatological comparisons. Our focus is on stratospheric altitudes, but results from the upper troposphere to the lower mesosphere are provided. There are clear quantitative differences in the measurements of the isotopic ratio, which primarily concerns the comparisons to the SMR data set. In the lower stratosphere, this data set shows a higher depletion than the MIPAS and ACE-FTS data sets. The differences maximise close to 50 hPa and exceed 200 per mille. With increasing altitude, the biases typically decrease. Above 4 hPa, the SMR data set shows a lower depletion than the MIPAS data sets, on occasion exceeding 100 per mille. Overall, the δD biases of the SMR data set are driven by HDO biases in the lower stratosphere and by H2O biases in the upper stratosphere and lower mesosphere. In between, in the middle stratosphere, the biases in δD are a combination of deviations in both HDO and H2O. These biases are attributed to issues with the calibration, in particular in terms of the sideband filtering for H2O, and uncertainties in spectroscopic parameters. The MIPAS and ACE-FTS data sets agree rather well between about 100 hPa and 10 hPa. The MIPAS data sets show less depletion below about 15 hPa (up to about 30 per mille), due to differences in both HDO and H2O. Higher up the picture is reversed, and towards the upper stratosphere the biases typically increase. This is driven by increasing biases in H2O and on occasion the differences in δD exceed 80 per mille. Below 100 hPa, the differences between the MIPAS and ACE-FTS data sets are even larger. In the climatological comparisons, the MIPAS data sets continue to show less depletion than the ACE-FTS data sets below 15 hPa during all seasons, with some variations in magnitude. The differences between the MIPAS and ACE-FTS data come from different aspects, such as differences in the temporal and spatial sampling (except for the profile-to-profile comparisons), cloud influence, vertical resolution, and the microwindows and spectroscopic database chosen. Differences between data sets from the same instrument are typically small in the stratosphere.


2020 ◽  
Author(s):  
Oleg Skrynyk ◽  
Enric Aguilar ◽  
José A. Guijarro ◽  
Sergiy Bubin

<p>Before using climatological time series in research studies, it is necessary to perform their quality control and homogenization in order to remove possible artefacts (inhomogeneities) usually present in the raw data sets. In the vast majority of cases, the homogenization procedure allows to improve the consistency of the data, which then can be verified by means of the statistical comparison of the raw and homogenized time series. However, a new question then arises: how far are the homogenized data from the true climate signal or, in other words, what errors could still be present in homogenized data?</p><p>The main objective of our work is to estimate the uncertainty produced by the adjustment algorithm of the widely used Climatol homogenization software when homogenizing daily time series of the additive climate variables. We focused our efforts on the minimum and maximum air temperature. In order to achieve our goal we used a benchmark data set created by the INDECIS<sup>*</sup> project. The benchmark contains clean data, extracted from an output of the Royal Netherlands Meteorological Institute Regional Atmospheric Climate Model (version 2) driven by Hadley Global Environment Model 2 - Earth System, and inhomogeneous data, created by introducing realistic breaks and errors.</p><p>The statistical evaluation of discrepancies between the homogenized (by means of Climatol with predefined break points) and clean data sets was performed using both a set of standard parameters and a metrics introduced in our work. All metrics used clearly identifies the main features of errors (systematic and random) present in the homogenized time series. We calculated the metrics for every time series (only over adjusted segments) as well as their averaged values as measures of uncertainties in the whole data set.</p><p>In order to determine how the two key parameters of the raw data collection, namely the length of time series and station density, influence the calculated measures of the adjustment error we gradually decreased the length of the period and number of stations in the area under study. The total number of cases considered was 56, including 7 time periods (1950-2005, 1954-2005, …, 1974-2005) and 8 different quantities of stations (100, 90, …, 30). Additionally, in order to find out how stable are the calculated metrics for each of the 56 cases and determine their confidence intervals we performed 100 random permutations in the introduced inhomogeneity time series and repeated our calculations With that the total number of homogenization exercises performed was 5600 for each of two climate variables.</p><p>Lastly, the calculated metrics were compared with the corresponding values, obtained for raw time series. The comparison showed some substantial improvement of the metric values after homogenization in each of the 56 cases considered (for the both variables).</p><p>-------------------</p><p><sup>*</sup>INDECIS is a part of ERA4CS, an ERA-NET initiated by JPI Climate, and funded by FORMAS (SE), DLR (DE), BMWFW (AT), IFD (DK), MINECO (ES), ANR (FR) with co-funding by the European Union (Grant 690462). The work has been partially supported by the Ministry of Education and Science of Kazakhstan (Grant BR05236454) and Nazarbayev University (Grant 090118FD5345).</p>


2009 ◽  
Vol 9 (1) ◽  
pp. 1977-2020
Author(s):  
F. Khosrawi ◽  
R. Müller ◽  
M. H. Proffitt ◽  
R. Ruhnke ◽  
O. Kirner ◽  
...  

Abstract. 1-year data sets of monthly averaged nitrous oxide (N2O) and ozone (O3) derived from satellite measurements were used as a tool for the evaluation of atmospheric photochemical models. Two 1-year data sets, one derived from the Improved Limb Atmospheric Spectrometer (ILAS and ILAS-II) and one from the Odin Sub-Millimetre Radiometer (Odin/SMR) were employed. Here, these data sets are used for the evaluation of two Chemical Transport Models (CTMs), the Karlsruhe Simulation Model of the Middle Atmosphere (KASIMA) and the Chemical Lagrangian Model of the Stratosphere (CLaMS) as well as for one Chemistry-Climate Model (CCM), the atmospheric chemistry general circulation model ECHAM5/MESSy1 (E5M1) in the lower stratosphere with focus on the Northern Hemisphere. Since the Odin/SMR measurements cover the entire hemisphere, the evaluation is performed for the entire hemisphere as well as for the low latitudes, midlatitudes and high latitudes using the Odin/SMR 1-year data set as reference. To assess the impact of using different data sets for such an evaluation study we repeat the evaluation for the polar lower stratosphere using the ILAS/ILAS-II data set. Only small differences were found using ILAS/ILAS-II instead of Odin/SMR as a reference, thus, showing that the results are not influenced by the particular satellite data set used for the evaluation. The evaluation of CLaMS, KASIMA and E5M1 shows that all models are in good agreement with Odin/SMR and ILAS/ILAS-II. Differences are generally in the range of ±20%. Larger differences (up to −40%) are found in all models at 500±25 K for N2O mixing ratios greater than 200 ppb. Generally, the largest differences were found for the tropics and the lowest for the polar regions. However, an underestimation of polar winter ozone loss was found both in KASIMA and E5M1 both in the Northern and Southern Hemisphere.


2021 ◽  
Author(s):  
Francesco Grieco ◽  
Kristell Pérot ◽  
Donal Murtagh ◽  
Patrick Eriksson ◽  
Bengt Rydberg ◽  
...  

Abstract. Its long photochemical lifetime makes H2O a good tracer for mesospheric dynamics. Temperature is also an important tracer of seasonal circulation as well as multi-year trends. In this study we present the reprocessing of 18 years of mesospheric H2O and temperature measurements from the Sub-Millimetre Radiometer (SMR) on board the Odin satellite, resulting in a part of the SMR version 3.0 level 2 data set. The previous version of the dataset showed poor accordance with measurements from other instruments, which suggested that the retrieved concentrations and temperature were subject to instrumental artifacts. Different hypotheses have been explored, and the idea of an underestimation of the single sideband leakage turned out to be the most reasonable one. The value of the lowest transmission achievable has therefore been raised to account for greater sideband leakage, and new retrievals have been performed with the new settings. The retrieved profiles extend between 40–100 km altitude and cover the whole globe to reach 85° latitudes. A validation study has been carried out, revealing an overall better accordance with the compared instruments. In particular, relative differences in H2O concentration are always in the ±20 % range between 40 and 70 km and diverge at higher altitudes, while temperature absolute differences are within ± 5 K between 40–80 km (with the exception of FM13 SMR–MLS difference reaching almost 10 K) and also diverge at higher altitudes.


2019 ◽  
Vol 19 (4) ◽  
pp. 2497-2526 ◽  
Author(s):  
Charlotta Högberg ◽  
Stefan Lossow ◽  
Farahnaz Khosrawi ◽  
Ralf Bauer ◽  
Kaley A. Walker ◽  
...  

Abstract. Within the framework of the second SPARC (Stratosphere-troposphere Processes And their Role in Climate) water vapour assessment (WAVAS-II), we evaluated five data sets of δD(H2O) obtained from observations by Odin/SMR (Sub-Millimetre Radiometer), Envisat/MIPAS (Environmental Satellite/Michelson Interferometer for Passive Atmospheric Sounding), and SCISAT/ACE-FTS (Science Satellite/Atmospheric Chemistry Experiment – Fourier Transform Spectrometer) using profile-to-profile and climatological comparisons. These comparisons aimed to provide a comprehensive overview of typical uncertainties in the observational database that could be considered in the future in observational and modelling studies. Our primary focus is on stratospheric altitudes, but results for the upper troposphere and lower mesosphere are also shown. There are clear quantitative differences in the measurements of the isotopic ratio, mainly with regard to comparisons between the SMR data set and both the MIPAS and ACE-FTS data sets. In the lower stratosphere, the SMR data set shows a higher depletion in δD than the MIPAS and ACE-FTS data sets. The differences maximise close to 50 hPa and exceed 200 ‰. With increasing altitude, the biases decrease. Above 4 hPa, the SMR data set shows a lower δD depletion than the MIPAS data sets, occasionally exceeding 100 ‰. Overall, the δD biases of the SMR data set are driven by HDO biases in the lower stratosphere and by H2O biases in the upper stratosphere and lower mesosphere. In between, in the middle stratosphere, the biases in δD are the result of deviations in both HDO and H2O. These biases are attributed to issues with the calibration, in particular in terms of the sideband filtering, and uncertainties in spectroscopic parameters. The MIPAS and ACE-FTS data sets agree rather well between about 100 and 10 hPa. The MIPAS data sets show less depletion below approximately 15 hPa (up to about 30 ‰), due to differences in both HDO and H2O. Higher up this behaviour is reversed, and towards the upper stratosphere the biases increase. This is driven by increasing biases in H2O, and on occasion the differences in δD exceed 80 ‰. Below 100 hPa, the differences between the MIPAS and ACE-FTS data sets are even larger. In the climatological comparisons, the MIPAS data sets continue to show less depletion in δD than the ACE-FTS data sets below 15 hPa during all seasons, with some variations in magnitude. The differences between the MIPAS and ACE-FTS data have multiple causes, such as differences in the temporal and spatial sampling (except for the profile-to-profile comparisons), cloud influence, vertical resolution, and the microwindows and spectroscopic database chosen. Differences between data sets from the same instrument are typically small in the stratosphere. Overall, if the data sets are considered together, the differences in δD among them in key areas of scientific interest (e.g. tropical and polar lower stratosphere, lower mesosphere, and upper troposphere) are too large to draw robust conclusions on atmospheric processes affecting the water vapour budget and distribution, e.g. the relative importance of different mechanisms transporting water vapour into the stratosphere.


2020 ◽  
Author(s):  
Markus Kunze ◽  
Tim Kruschke ◽  
Ulrike Langematz ◽  
Miriam Sinnhuber ◽  
Thomas Reddmann ◽  
...  

Abstract. Variations of the solar spectral irradiance (SSI) with the 11-year sunspot cycle have been shown to have a significant impact on temperatures and the mixing ratios of atmospheric constituents in the stratosphere and mesosphere. Uncertainties in modelling the effects of SSI variations arise from uncertainties in the empirical models reconstructing the prescribed SSI data set as well as from uncertainties in the chemistry-climate model (CCM) formulation. In this study CCM simulations with the ECHAM MESSy Atmospheric Chemistry (EMAC) model and the Community Earth System Model 1 (CESM1) – Whole Atmosphere Chemistry Climate Model (WACCM) have been performed to quantify the uncertainties of the solar responses in chemistry and dynamics that are due to the usage of five different SSI data sets or the two CCMs. We apply a two-way analysis of variance (ANOVA) to separate the influence of the SSI data sets and the CCMs on the variability of the solar response in shortwave heating rates, temperature and ozone. The ANOVA identifies the SSI data set with the strongest influence on the variability of the solar signal in shortwave heating rates in the upper mesosphere and in the upper stratosphere/lower mesosphere. The strongest influence on the variability of the solar signal in ozone and temperature is identified in the upper stratosphere/lower mesosphere. The largest influence of the CCMs on variability of the solar responses can be identified in the upper mesosphere. The solar response in the lower stratosphere also depends on the CCM used, especially in the tropics and northern hemispheric subtropics and mid latitudes, where the model dynamics modulate the solar responses.


2021 ◽  
Vol 39 (15_suppl) ◽  
pp. e18725-e18725
Author(s):  
Ravit Geva ◽  
Barliz Waissengrin ◽  
Dan Mirelman ◽  
Felix Bokstein ◽  
Deborah T. Blumenthal ◽  
...  

e18725 Background: Healthcare data sharing is important for the creation of diverse and large data sets, supporting clinical decision making, and accelerating efficient research to improve patient outcomes. This is especially vital in the case of real world data analysis. However, stakeholders are reluctant to share their data without ensuring patients’ privacy, proper protection of their data sets and the ways they are being used. Homomorphic encryption is a cryptographic capability that can address these issues by enabling computation on encrypted data without ever decrypting it, so the analytics results are obtained without revealing the raw data. The aim of this study is to prove the accuracy of analytics results and the practical efficiency of the technology. Methods: A real-world data set of colorectal cancer patients’ survival data, following two different treatment interventions, including 623 patients and 24 variables, amounting to 14,952 items of data, was encrypted using leveled homomorphic encryption implemented in the PALISADE software library. Statistical analysis of key oncological endpoints was blindly performed on both the raw data and the homomorphically-encrypted data using descriptive statistics and survival analysis with Kaplan-Meier curves. Results were then compared with an accuracy goal of two decimals. Results: The difference between the raw data and the homomorphically encrypted data results, regarding all variables analyzed was within the pre-determined accuracy range goal, as well as the practical efficiency of the encrypted computation measured by run time, are presented in table. Conclusions: This study demonstrates that data encrypted with Homomorphic Encryption can be statistical analyzed with a precision of at least two decimal places, allowing safe clinical conclusions drawing while preserving patients’ privacy and protecting data owners’ data assets. Homomorphic encryption allows performing efficient computation on encrypted data non-interactively and without requiring decryption during computation time. Utilizing the technology will empower large-scale cross-institution and cross- stakeholder collaboration, allowing safe international collaborations. Clinical trial information: 0048-19-TLV. [Table: see text]


Sign in / Sign up

Export Citation Format

Share Document