Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
128,615
result(s) for
"evaluation model"
Sort by:
Performance evaluation of global hydrological models in six large Pan-Arctic watersheds
by
Satoh Yusuke
,
Müller, Schmied Hannes
,
Krysanova Valentina
in
Algorithms
,
Climate and vegetation
,
Climate change
2020
Global Water Models (GWMs), which include Global Hydrological, Land Surface, and Dynamic Global Vegetation Models, present valuable tools for quantifying climate change impacts on hydrological processes in the data scarce high latitudes. Here we performed a systematic model performance evaluation in six major Pan-Arctic watersheds for different hydrological indicators (monthly and seasonal discharge, extremes, trends (or lack of), and snow water equivalent (SWE)) via a novel Aggregated Performance Index (API) that is based on commonly used statistical evaluation metrics. The machine learning Boruta feature selection algorithm was used to evaluate the explanatory power of the API attributes. Our results show that the majority of the nine GWMs included in the study exhibit considerable difficulties in realistically representing Pan-Arctic hydrological processes. Average APIdischarge (monthly and seasonal discharge) over nine GWMs is > 50% only in the Kolyma basin (55%), as low as 30% in the Yukon basin and averaged over all watersheds APIdischarge is 43%. WATERGAP2 and MATSIRO present the highest (APIdischarge > 55%) while ORCHIDEE and JULES-W1 the lowest (APIdischarge ≤ 25%) performing GWMs over all watersheds. For the high and low flows, average APIextreme is 35% and 26%, respectively, and over six GWMs APISWE is 57%. The Boruta algorithm suggests that using different observation-based climate data sets does not influence the total score of the APIs in all watersheds. Ultimately, only satisfactory to good performing GWMs that effectively represent cold-region hydrological processes (including snow-related processes, permafrost) should be included in multi-model climate change impact assessments in Pan-Arctic watersheds.
Journal Article
Scientific and Human Errors in a Snow Model Intercomparison
2021
Twenty-seven models participated in the Earth System Model–Snow Model Intercomparison Project (ESM-SnowMIP), the most data-rich MIP dedicated to snow modeling. Our findings do not support the hypothesis advanced by previous snow MIPs: evaluating models against more variables and providing evaluation datasets extended temporally and spatially does not facilitate identification of key new processes requiring improvement to model snow mass and energy budgets, even at point scales. In fact, the same modeling issues identified by previous snow MIPs arose: albedo is a major source of uncertainty, surface exchange parameterizations are problematic, and individual model performance is inconsistent. This lack of progress is attributed partly to the large number of human errors that led to anomalous model behavior and to numerous resubmissions. It is unclear how widespread such errors are in our field and others; dedicated time and resources will be needed to tackle this issue to prevent highly sophisticated models and their research outputs from being vulnerable because of avoidable human mistakes. The design of and the data available to successive snow MIPs were also questioned. Evaluation of models against bulk snow properties was found to be sufficient for some but inappropriate for more complex snow models whose skills at simulating internal snow properties remained untested. Discussions between the authors of this paper on the purpose of MIPs revealed varied, and sometimes contradictory, motivations behind their participation. These findings started a collaborative effort to adapt future snow MIPs to respond to the diverse needs of the community.
Journal Article
Linking Large-Scale Double-ITCZ Bias to Local-Scale Drizzling Bias in Climate Models
2022
Tropical precipitation in climate models presents significant biases in both the large-scale pattern (i.e., double intertropical convergence zone bias) and local-scale characteristics (i.e., drizzling bias with too frequent drizzle/convection and reduced occurrences of no and heavy precipitation). By untangling the coupled system and analyzing the biases in precipitation, cloud, and radiation, this study shows that local-scale drizzling bias in atmospheric models can lead to large-scale double-ITCZ bias in coupled models by inducing convective-regime-dependent biases in precipitation and cloud radiative effects (CRE). The double-ITCZ bias consists of a hemispherically asymmetric component that arises from the asymmetric SST bias and a nearly symmetric component that exists in atmospheric models without the SST bias. By increasing light rain but reducing heavy rain, local-scale drizzling bias induces positive (negative) precipitation bias in the moderate (strong) convective regime, leading to the nearly symmetric wet bias in atmospheric models. By affecting the cloud profile, local-scale drizzling bias induces positive (negative) CRE bias in the stratocumulus (convective) regime in atmospheric models. Because the stratocumulus (convective) region is climatologically more pronounced in the southern (northern) tropics, the CRE bias is deemed to be hemispherically asymmetric and drives warm and wet (cold and dry) biases in the southern (northern) tropics when coupled to ocean. Our results suggest that correcting local-scale drizzling bias is critical for fixing large-scale double-ITCZ bias. The drizzling and double-ITCZ biases are not alleviated in models with mesoscale (0.25°–0.5°) or even storm-resolving (∼3 km) resolution, implying that either large-eddy simulation or fundamental improvement in small-scale subgrid parameterizations is needed.
Journal Article
Process-Oriented Diagnostics
by
Gettelman, Andrew
,
Ullrich, Paul
,
Ming, Yi
in
Atmosphere-ocean interaction
,
Best practice
,
Best practices
2023
Process-oriented diagnostics (PODs) aim to provide feedback for model developers through model analysis based on physical hypotheses. However, the step from a diagnostic based on relationships among variables, even when hypothesis driven, to specific guidance for revising model formulation or parameterizations can be substantial. The POD may provide more information than a purely performance-based metric, but a gap between POD principles and providing actionable information for specific model revisions can remain. Furthermore, in coordinating diagnostics development, there is a trade-off between freedom for the developer, aiming to capture innovation, and near-term utility to the modeling center. Best practices that allow for the former, while conforming to specifications that aid the latter, are important for community diagnostics development that leads to tangible model improvements. Promising directions to close the gap between principles and practice include the interaction of PODs with perturbed physics experiments and with more quantitative process models as well as the inclusion of personnel from modeling centers in diagnostics development groups for immediate feedback during climate model revisions. Examples are provided, along with best-practice recommendations, based on practical experience from the NOAA Model Diagnostics Task Force (MDTF). Common standards for metrics and diagnostics that have arisen from a collaboration between the MDTF and the Department of Energy’s Coordinated Model Evaluation Capability are advocated as a means of uniting community diagnostics efforts.
Journal Article
Benchmarking Performance Changes in the Simulation of Extratropical Modes of Variability across CMIP Generations
2021
We evaluate extratropical modes of variability in the three most recent phases of the Coupled Model Intercomparison Project (CMIP3, CMIP5, and CMIP6) to gauge improvement of climate models over time. A suite of high-level metrics is employed to objectively evaluate how well climate models simulate the observed northern annular mode (NAM), North Atlantic Oscillation (NAO), Pacific–North America pattern (PNA), southern annular mode (SAM), Pacific decadal oscillation (PDO), North Pacific Oscillation (NPO), and North Pacific Gyre Oscillation (NPGO). We apply a common basis function (CBF) approach that projects model anomalies onto observed empirical orthogonal functions (EOFs), together with the traditional EOF approach, to CMIP Historical and AMIP models. We find simulated spatial patterns of those modes have been significantly improved in the newer models, although the skill improvement is sensitive to the mode and season considered. We identify some potential contributions to the pattern improvement of certain modes (e.g., the Southern Hemisphere jet and high-top vertical coordinate); however, the performance changes are likely attributed to gradual improvement of the base climate and multiple relevant processes. Less performance improvement is evident in the mode amplitude of these modes and systematic overestimation of the mode amplitude in spring remains in the newer climate models. We find that the postdominant season amplitude errors in atmospheric modes are not limited to coupled runs but are often already evident in AMIP simulations. This suggests that rectifying the egregious postdominant season amplitude errors found in many models can be addressed in an atmospheric-only framework, making it more tractable to address in the model development process.
Journal Article
Climate Impacts of Convective Cloud Microphysics in NCAR CAM5
by
Shan, Yunpeng
,
Lin, Lin
,
Fu, Qiang
in
Aerosols
,
Atmospheric models
,
Atmospheric precipitations
2023
We improved the treatments of convective cloud microphysics in the NCAR Community Atmosphere Model version 5.3 (CAM5.3) by 1) implementing new terminal velocity parameterizations for convective ice and snow particles, 2) adding graupel microphysics, 3) considering convective snow detrainment, and 4) enhancing rain initiation and generation rate in warm clouds. We evaluated the impacts of improved microphysics on simulated global climate, focusing on simulated cloud radiative forcing, graupel microphysics, convective cloud ice amount, and tropical precipitation. Compared to CAM5.3 with the default convective microphysics, the too-strong cloud shortwave radiative forcing due primarily to excessive convective cloud liquid is largely alleviated over the tropics and midlatitudes after rain initiation and generation rate is enhanced, in better agreement with the CERES-EBAF estimates. Geographic distributions of graupel occurrence are reasonably simulated over continents; whereas the graupel occurrence remains highly uncertain over the oceanic storm-track regions. When evaluated against the CloudSat–CALIPSO estimates, the overestimation of convective ice mass is alleviated with the improved convective ice microphysics, among which adding graupel microphysics and the accompanying increase in hydrometeor fall speed play the most important role. The probability distribution function (PDF) of rainfall intensity is sensitive to warm rain processes in convective clouds, and enhancement in warm rain production shifts the PDF toward heavier precipitation, which agrees better with the TRMM observations. Common biases of overestimating the light rain frequency and underestimating the heavy rain frequency in GCMs are mitigated.
Journal Article
Convection–Kelvin Wave Coupling in a Global Convection-Permitting Model
by
Mass, Clifford F.
,
Weber, Nicholas J.
,
Kim, Daehyun
in
Chemical precipitation
,
cloud resolving models
,
Convection
2021
A convectively coupled equatorial Kelvin wave (CCKW) was observed over the equatorial Indian Ocean in early November 2011 during the DYNAMO field campaign. This study examines the structure of the CCKW event using two simulations made using the MPAS model: one with 3-km grid spacing without convective parameterization and another with a 15-km grid and parameterized convection. Both simulations qualitatively capture the observed structure of the CCKW, including its vertical tilt and progression of cloud/precipitation structures. The two simulations, however, differ substantially in the amplitude of the CCKW-associated precipitation. While the 3-km run realistically captures the observed modulation of precipitation by the CCKW, the 15-km simulation severely underestimates its magnitude. To understand the difference between the two MPAS simulations regarding wave–convection coupling within the CCKW, the relationship of precipitation with convective inhibition, saturation fraction, and surface turbulent fluxes is investigated. Results show that the 15-km simulation underestimates the magnitude of the CCKW precipitation peak in association with its unrealistically linear relationship between moisture and precipitation. Precipitation, both in observations and the 3-km run, is predominantly controlled by saturation fraction and this relationship is exponential. In contrast, the parameterized convection in the 15-km run is overly sensitive to convective inhibition and not sensitive enough to environmental moisture. The implications of these results on CCKW theories are discussed.
Journal Article
How to execute Context, Input, Process, and Product evaluation model in medical health education
by
Lee, So young
,
Lee, Seung-Hee
,
Shin, Jwa-Seop
in
context, input, process, and product evaluation model
,
context, input, process, and product model
,
educational evaluation
2019
Improvements to education are necessary in order to keep up with the education requirements of today. The Context, Input, Process, and Product (CIPP) evaluation model was created for the decision-making towards education improvement, so this model is appropriate in this regard. However, application of this model in the actual context of medical health education is considered difficult in the education environment. Thus, in this study, literature survey of previous studies was investigated to examine the execution procedure of how the CIPP model can be actually applied. For the execution procedure utilizing the CIPP model, the criteria and indicators were determined from analysis results and material was collected after setting the material collection method. Afterwards, the collected material was analyzed for each CIPP element, and finally, the relationship of each CIPP element was analyzed for the final improvement decision-making. In this study, these steps were followed and the methods employed in previous studies were organized. Particularly, the process of determining the criteria and indicators was important and required a significant effort. Literature survey was carried out to analyze the most widely used criteria through content analysis and obtained a total of 12 criteria. Additional emphasis is necessary in the importance of the criteria selection for the actual application of the CIPP model. Also, a diverse range of information can be obtained through qualitative as well as quantitative methods. Above all, since the CIPP evaluation model execution result becomes the basis for the execution of further improved evaluations, the first attempt of performing without hesitation is essential.
Journal Article
Semi-supervised approaches to efficient evaluation of model prediction performance
2018
In many modern machine learning applications, the outcome is expensive or time consuming to collect whereas the predictor information is easy to obtain. Semi-supervised (SS) learning aims at utilizing large amounts of ‘unlabelled’ data along with small amounts of ‘labelled’ data to improve the efficiency of a classical supervised approach. Though numerous SS learning classification and prediction procedures have been proposed in recent years, no methods currently exist to evaluate the prediction performance of a working regression model. In the context of developing phenotyping algorithms derived from electronic medical records, we present an efficient two-step estimation procedure for evaluating a binary classifier based on various prediction performance measures in the SS setting. In step I, the labelled data are used to obtain a non-parametrically calibrated estimate of the conditional risk function. In step II, SS estimates of the prediction accuracy parameters are constructed based on the estimated conditional risk function and the unlabelled data. We demonstrate that, under mild regularity conditions, the estimators proposed are consistent and asymptotically normal. Importantly, the asymptotic variance of the SS estimators is always smaller than that of the supervised counterparts under correct model specification. We also correct for potential overfitting bias in the SS estimators in finite samples with cross-validation and we develop a perturbation resampling procedure to approximate their distributions. Our proposals are evaluated through extensive simulation studies and illustrated with two real electronic medical record studies aiming to develop phenotyping algorithms for rheumatoid arthritis and multiple sclerosis.
Journal Article
Examining the Impacts of Great Lakes Temperature Perturbations on Simulated Precipitation in the Northeastern United States
by
Langlois, Jessica
,
Huang, Huanping
,
Clemins, Patrick J.
in
Boundary conditions
,
climate models
,
ENVIRONMENTAL SCIENCES
2021
Most inland water bodies are not resolved by general circulation models, requiring that lake surface temperatures be estimated. Given the large spatial and temporal variability of the surface temperatures of the North American Great Lakes, such estimations can introduce errors when used as lower boundary conditions for dynamical downscaling. Lake surface temperatures (LSTs) influence moisture and heat fluxes, thus impacting precipitation within the immediate region and potentially in regions downwind of the lakes. For this study, the Advanced Research version of the Weather Research and Forecasting Model (WRF-ARW) was used to simulate precipitation over the six New England states during a 5-yr historical period. The model simulation was repeated with perturbed LSTs, ranging from 10°C below to 10°C above baseline values obtained from reanalysis data, to determine whether the inclusion of erroneous LST values has an impact on simulated precipitation and synoptic-scale features. Results show that simulated precipitation in New England is statistically correlated with LST perturbations, but this region falls on a wet–dry line of a larger bimodal distribution. Wetter conditions occur to the north and drier conditions occur to the south with increasing LSTs, particularly during the warm season. The precipitation differences coincide with large-scale anomalous temperature, pressure, and moisture patterns. Care must therefore be taken to ensure reasonably accurate Great Lakes surface temperatures when simulating precipitation, especially in southeastern Canada, Maine, and the mid-Atlantic region.
Journal Article