Catalogue Search | MBRL

Curating 62 Years of Walnut Gulch Experimental Watershed Data: Improving the Quality of Long-Term Rainfall and Runoff Datasets

by Kautz, Mark A. , Wei, Haiyan , Perumal, Anandraj Thiyagaraja in data quality , Datasets , Gauges

2022

The curation of hydrologic data includes quality control, documentation, database development, and provisions for public access. This article describes the development of new quality control procedures for experimental watersheds like the Walnut Gulch Experimental Watersheds (WGEW). WGEW is a 149 km2 watershed outdoor hydrologic laboratory equipped with a dense network of hydro-climatic instruments since the 1950s. To improve data accuracy from the constantly growing instrumentation networks in numerous experimental watersheds, we developed five new QAQC tools based on fundamental hydrologic principles. The tools include visual analysis of interpolated rainfall maps and evaluating temporal, spatial, and quantitative relationships between paired rainfall-runoff events, including runoff lag time, runoff coefficients, multiple regression, and association methods. The methods identified questionable rainfall and runoff observations in the WGEW database that were not usually captured by the existing QAQC procedures. The new tools were evaluated and confirmed using existing metadata, paper charts, and graphical visualization tools. It was found that 13% of the days (n = 780) with rainfall and 7% of the runoff events sampled had errors. Omitting these events improved the quality and reliability of the WGEW dataset for hydrologic modeling and analyses. This indicated the effectiveness of application of conventional hydrologic relations to improve the QAQC strategy for experimental watershed datasets.

Journal Article

Share this book

Add to My Shelf

HYDROLOGICAL FORECASTS AND PROJECTIONS FOR IMPROVED DECISION-MAKING IN THE WATER SECTOR IN EUROPE

by Wanders, Niko , Watts, Glenn , Samaniego, Luis in Adaptation , Applications programs , Bias

2019

Simulations of water fluxes at high spatial resolution that consistently cover historical observations, seasonal forecasts, and future climate projections are key to providing climate services aimed at supporting operational and strategic planning, and developing mitigation and adaptation policies. The End-to-end Demonstrator for improved decision-making in the water sector in Europe (EDgE) is a proof-of-concept project funded by the Copernicus Climate Change Service program that addresses these requirements by combining a multimodel ensemble of state-of-the-art climate model outputs and hydrological models to deliver sectoral climate impact indicators (SCIIs) codesigned with private and public water sector stakeholders from three contrasting European countries. The final product of EDgE is a water-oriented information system implemented through a web application. Here, we present the underlying structure of the EDgE modeling chain, which is composed of four phases: 1) climate data processing, 2) hydrological modeling, 3) stakeholder codesign and SCII estimation, and 4) uncertainty and skill assessments. Daily temperature and precipitation from observational datasets, four climate models for seasonal forecasts, and five climate models under two emission scenarios are consistently downscaled to 5-km spatial resolution to ensure locally relevant simulations based on four hydrological models. The consistency of the hydrological models is guaranteed by using identical input data for land surface parameterizations. The multimodel outputs are composed of 65 years of historical observations, a 19-yr ensemble of seasonal hindcasts, and a century-long ensemble of climate impact projections. These unique, high-resolution hydroclimatic simulations and SCIIs provide an unprecedented information system for decision-making over Europe and can serve as a template for water-related climate services in other regions.

Journal Article

Share this book

Add to My Shelf

Harnessing Generative Deep Learning for Enhanced Ensemble Data Assimilation

by Moradkhani, Hamid , oumandi, Ehsan in Artificial intelligence , Data assimilation , Data collection

2025

Hydrologic modeling faces challenges due to various sources of uncertainty, the inherent nonlinearity, and high dimensionality of Earth systems. Data assimilation (DA) methods are known to improve the accuracy and account for uncertainties in modeling; however, they may be limited by restrictive assumptions about error distributions and challenges associated with updating model prognostic variables, hence, representing the posterior distributions. To address these challenges, we present a new hydrologic DA method inspired by the similarities in theoretical backgrounds of DA and generative deep learning. The proposed Hydrologic Generative Ensemble Data Assimilation (HydroGEnDA) leverages deep learning‐based autoencoders, and deep generative modeling to perform DA in a unified latent space and finally a resampling method in physical space. The HydroGEnDA benefits from an autoencoder that transforms data to a latent space and a generative model that learns the underlying distribution of model states, then conditioning the sampling from this distribution to the observed data. Finally, resampling in physical space further improves the performance of the DA method. The HydroGEnDA involves an offline training stage without relying on observations, utilizing hydrologic model outputs instead to train the deep learning models. Following the training stage, the inference stage assimilates observed data to update the states. The method is tested through several synthetic experiments with varying observation noise levels with the Lorenz‐63 model, as well as real hydrologic case studies using the coupled SNOW‐17 and SAC‐SMA models across diverse watersheds. The results demonstrate that the HydroGEnDA outperforms previous DA methods in both experiments.

Journal Article

Share this book

Add to My Shelf

TPHiPr: a long-term (1979–2020) high-accuracy precipitation dataset (1∕30°, daily) for the Third Pole region based on high-resolution atmospheric modeling and dense observations

by He, Jie , Zhou, Bingrong , Jiang, Yaozhi in Accuracy , Atmospheric models , Datasets

2023

Reliable precipitation data are highly necessary for geoscience research in the Third Pole (TP) region but still lacking, due to the complex terrain and high spatial variability of precipitation here. Accordingly, this study produces a long-term (1979–2020) high-resolution (1/30∘, daily) precipitation dataset (TPHiPr) for the TP by merging the atmospheric simulation-based ERA5_CNN with gauge observations from more than 9000 rain gauges, using the climatologically aided interpolation and random forest methods. Validation shows that TPHiPr is generally unbiased and has a root mean square error of 5.0 mm d−1, a correlation of 0.76 and a critical success index of 0.61 with respect to 197 independent rain gauges in the TP, demonstrating that this dataset is remarkably better than the widely used datasets, including the latest generation of reanalysis (ERA5-Land), the state-of-the-art satellite-based dataset (IMERG) and the multi-source merging datasets (MSWEP v2 and AERA5-Asia). Moreover, TPHiPr can better detect precipitation extremes compared with these widely used datasets. Overall, this study provides a new precipitation dataset with high accuracy for the TP, which may have broad applications in meteorological, hydrological and ecological studies. The produced dataset can be accessed via https://doi.org/10.11888/Atmos.tpdc.272763 (Yang and Jiang, 2022).

Journal Article

Share this book

Add to My Shelf

CNRD v1.0

by Gou, Jiaojiao , Guo, Xiaoying , Samaniego, Luis in Analysis , Archives & records , Catchments

2021

Reliable, spatiotemporally continuous runoff records are necessary for identifying climate change impacts and planning effective water management strategies. Existing Chinese runoff data to date have been produced from sparse, poor-quality gauge measurements at different time scales. We have developed a new, quality-controlled gridded runoff dataset, the China Natural Runoff Dataset version 1.0 (CNRD v1.0), which provides daily, monthly, and annual 0.25° runoff estimates for the period 1961–2018 over China. CNRD v1.0 was generated using the Variable Infiltration Capacity (VIC) model. A comprehensive parameter uncertainty analysis framework incorporating parameter sensitivity analysis, optimization, and regionalization with 200 natural or near-natural gauge catchments was used to train the VIC model. Overall, the results show well-calibrated parameters for most gauged catchments except arid and semiarid areas, and the skill scores present high values for all catchments. For the pseudo-/test-ungauged catchments, the model parameters estimated by the multiscale parameter regionalization technique offer the best regionalization solution. CNRD v1.0 is the first free public dataset of gridded natural runoff estimated using a comprehensive model parameter uncertainty analysis framework for China. These results indicate that CNRD v1.0 has high potential for application to long-term hydrological and climate studies in China and to improve international runoff databases for global-scale studies.

Journal Article

Share this book

Add to My Shelf

The New Version 3.2 Global Precipitation Climatology Project (GPCP) Monthly and Daily Precipitation Products

by Adler, Robert F. , Huffman, George J. , Bolvin, David T. in Algorithms , Atmospheric precipitations , Climate

2023

The Global Precipitation Climatology Project (GPCP) Version 3.2 Precipitation Analysis provides globally complete analyses of surface precipitation on a 0.5° × 0.5° latitude–longitude grid at both monthly and daily time scales, covering from 1983 to the present and from June 2000 to the present, respectively. These merged products continue the GPCP heritage of incorporating precipitation estimates from low-orbit satellite microwave data, geosynchronous-orbit satellite infrared data, sounder-based estimates, and surface rain gauge observations emphasizing the strengths of various inputs and striving for time and space homogeneity. Furthermore, these analyses incorporate modern algorithms, refined intercalibrations among sensors, climatologies of recent high-quality satellite precipitation data, and fine-scale multisatellite estimates. New data fields have been introduced to better characterize the precipitation, including the fraction of the precipitation that is liquid (rain) in both the monthly and daily products, and a quality index for the monthly product. Compared to the operational GPCP Version 2.3 Monthly, the Version 3.2 Monthly product provides a more reasonable climatology in the Southern Ocean and increases the estimated global average precipitation by about 4.5%, which is similar to estimates from recent global water budget assessments. Global and regional trends for 1983–2020 with this new Monthly dataset are very similar to those computed from Version 2.3. Compared to the operational One-Degree Daily (Version 1.3) product, the new Version 3.2 Daily is designed to better represent the histogram of precipitation rates, particularly at high values and shifts the start of less-certain high-latitude estimates from 40° to 58° latitude in each hemisphere.

Journal Article

Share this book

Add to My Shelf

Global prediction of extreme floods in ungauged watersheds

by Nearing, Grey , Cohen, Deborah , Harrigan, Shaun in 704/106/242 , 704/172/4081 , 704/242

2024

Floods are one of the most common natural disasters, with a disproportionate impact in developing countries that often lack dense streamflow gauge networks 1 . Accurate and timely warnings are critical for mitigating flood risks 2 , but hydrological simulation models typically must be calibrated to long data records in each watershed. Here we show that artificial intelligence-based forecasting achieves reliability in predicting extreme riverine events in ungauged watersheds at up to a five-day lead time that is similar to or better than the reliability of nowcasts (zero-day lead time) from a current state-of-the-art global modelling system (the Copernicus Emergency Management Service Global Flood Awareness System). In addition, we achieve accuracies over five-year return period events that are similar to or better than current accuracies over one-year return period events. This means that artificial intelligence can provide flood warnings earlier and over larger and more impactful events in ungauged basins. The model developed here was incorporated into an operational early warning system that produces publicly available (free and open) forecasts in real time in over 80 countries. This work highlights a need for increasing the availability of hydrological data to continue to improve global access to reliable flood warnings. Artificial intelligence-based forecasting improves the reliability of predicting extreme flood events in ungauged watersheds, with predictions at five days lead time that are as good as current systems are for same-day predictions.

Journal Article

Share this book

Add to My Shelf

Development of a Distributed Physics‐Informed Deep Learning Hydrological Model for Data‐Scarce Regions

by Lei, Huimin , Yang, Jingjing , Zhong, Liangjin in Basins , Climate change , Climate models

2024

Climate change has exacerbated water stress and water‐related disasters, necessitating more precise streamflow simulations. However, in the majority of global regions, a deficiency of streamflow data constitutes a significant constraint on modeling endeavors. Traditional distributed hydrological models and regionalization approaches have shown suboptimal performance. While current deep learning (DL)‐related models trained on large data sets excel in spatial generalization, the direct applicability of these models in certain regions with unique hydrological processes can be challenging due to the limited representativeness within the training data set. Furthermore, transfer learning DL models pre‐trained on large data sets still necessitate local data for retraining, thereby constraining their applicability. To address these challenges, we present a physics‐informed DL model based on a distributed framework. It involves spatial discretization and the establishment of differentiable hydrological models for discrete sub‐basins, coupled with a differentiable Muskingum method for channel routing. By introducing upstream‐downstream relationships, model errors in sub‐basins propagate through the river network to the watershed outlet, enabling the optimization using limited downstream streamflow data, thereby achieving spatial simulation of ungauged internal sub‐basins. The model, when trained solely on the downstream‐most station, outperforms the distributed hydrological model in streamflow simulation at both the training station and upstream held‐out stations. Additionally, in comparison to transfer learning models, our model requires fewer gauge stations for training, but achieves higher precision in simulating streamflow on spatially held‐out stations, indicating better spatial generalization ability. Consequently, this model offers a novel approach to hydrological simulation in data‐scarce regions, especially those with poor hydrological representativeness. Plain Language Summary Climate change leads to more water shortages and disasters, requiring better streamflow predictions. Yet, a big hurdle in dealing with this issue is the lack of streamflow data across many parts of the world. Traditional physics‐based distributed hydrological models and current deep learning (DL) models have their limitations, especially for regions with unique hydrological processes and limited observations. To address these challenges, we developed a new tool combining physics‐informed DL and a traditional river routing model based on the distributed framework. The model divides the region into sub‐basins, where a physics‐informed DL rainfall‐runoff model calculates runoff generation, and a physics‐informed DL routing model computes the movement of water within each subunit toward the river. Model errors propagate downstream through the river network, thus requiring only a small amount of downstream data to optimize all sub‐basin models and effectively simulate internal unmonitored sub‐basins. When solely using the downstream‐most discharge stations for training, our model outperforms the traditional physics‐based distributed hydrological model. In addition, our approach requires less training data than transfer learning, while achieving higher spatial generalization accuracy. In summary, our model provides a new way to simulate streamflow in data‐scarce regions with unique processes. Key Points A distributed physics‐informed deep learning hydrological model was proposed for data‐scarce regions The new model outperforms the traditional distributed hydrologic model in simulating streamflow in upstream held‐out stations Our model requires less data for training but performs better than the transfer learning model in spatial generalization

Journal Article

Share this book

Add to My Shelf

Large-sample hydrology – a few camels or a whole caravan?

by Neri, Mattia , Seibert, Jan , van Meerveld, Ilja in Aridity , Atmospheric forcing , Calibration

2024

Large-sample datasets containing hydrometeorological time series and catchment attributes for hundreds of catchments in a country, many of them known as “CAMELS” (Catchment Attributes and MEteorology for Large-sample Studies), have revolutionized hydrological modelling and have enabled comparative analyses. The Caravan dataset is a compilation of several (CAMELS and other) large-sample datasets with uniform attribute names and data structures. This simplifies large-sample hydrology across regions, continents, or the globe. However, the use of the Caravan dataset instead of the original CAMELS or other large-sample datasets may affect model results and the conclusions derived thereof. For the Caravan dataset, the meteorological forcing data are based on ERA5-Land reanalysis data. Here, we describe the differences between the original precipitation, temperature, and potential evapotranspiration (Epot) data for 1252 catchments in the CAMELS-US, CAMELS-BR, and CAMELS-GB datasets and the forcing data for these catchments in the Caravan dataset. The Epot in the Caravan dataset is unrealistically high for many catchments, but there are, unsurprisingly, also considerable differences in the precipitation data. We show that the use of the forcing data from the Caravan dataset impairs hydrological model calibration for the vast majority of catchments; i.e. there is a drop in the calibration performance when using the forcing data from the Caravan dataset compared to the original CAMELS datasets. This drop is mainly due to the differences in the precipitation data. Therefore, we suggest extending the Caravan dataset with the forcing data included in the original CAMELS datasets wherever possible so that users can choose which forcing data they want to use or at least indicating clearly that the forcing data in Caravan come with a data quality loss and that using the original datasets is recommended. Moreover, we suggest not using the Epot data (and derived catchment attributes, such as the aridity index) from the Caravan dataset and instead recommend that these should be replaced with (or based on) alternative Epot estimates.

Journal Article

Share this book

Add to My Shelf

Why does a conceptual hydrological model fail to correctly predict discharge changes in response to climate change?

by Parajka, Juraj , Duethmann, Doris , Blöschl, Günter in Air temperature , Analysis , Calibration

2020

Several studies have shown that hydrological models do not perform well when applied to periods with climate conditions that differ from those during model calibration. This has important implications for the application of these models in climate change impact studies. The causes of the low transferability to changed climate conditions have, however, only been investigated in a few studies. Here we revisit a study in Austria that demonstrated the inability of a conceptual semi-distributed HBV-type model to simulate the observed discharge response to increases in precipitation and air temperature. The aim of the paper is to shed light on the reasons for these model problems. We set up hypotheses for the possible causes of the mismatch between the observed and simulated changes in discharge and evaluate these using simulations with modifications of the model. In the baseline model, trends of simulated and observed discharge over 1978–2013 differ, on average over all 156 catchments, by 95±50 mm yr−1 per 35 years. Accounting for variations in vegetation dynamics, as derived from a satellite-based vegetation index, in the calculation of reference evaporation explains 36±9 mm yr−1 per 35 years of the differences between the trends in simulated and observed discharge. Inhomogeneities in the precipitation data, caused by a variable number of stations, explain 39±26 mm yr−1 per 35 years of this difference. Extending the calibration period from 5 to 25 years, including annually aggregated discharge data or snow cover data in the objective function, or estimating evaporation with the Penman–Monteith instead of the Blaney–Criddle approach has little influence on the simulated discharge trends (5 mm yr−1 per 35 years or less). The precipitation data problem highlights the importance of using precipitation data based on a stationary input station network when studying hydrologic changes. The model structure problem with respect to vegetation dynamics is likely relevant for a wide spectrum of regions in a transient climate and has important implications for climate change impact studies.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter