Search Results Heading

MBRLSearchResults

mbrl.module.common.modules.added.book.to.shelf
Title added to your shelf!
View what I already have on My Shelf.
Oops! Something went wrong.
Oops! Something went wrong.
While trying to add the title to your shelf something went wrong :( Kindly try again later!
Are you sure you want to remove the book from the shelf?
Oops! Something went wrong.
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
    Done
    Filters
    Reset
  • Discipline
      Discipline
      Clear All
      Discipline
  • Is Peer Reviewed
      Is Peer Reviewed
      Clear All
      Is Peer Reviewed
  • Item Type
      Item Type
      Clear All
      Item Type
  • Subject
      Subject
      Clear All
      Subject
  • Year
      Year
      Clear All
      From:
      -
      To:
  • More Filters
15 result(s) for "stochastic dynamic programming (SDP)"
Sort by:
SDP Policy Iteration-Based Energy Management Strategy Using Traffic Information for Commuter Hybrid Electric Vehicles
This paper demonstrates an energy management method using traffic information for commuter hybrid electric vehicles. A control strategy based on stochastic dynamic programming (SDP) is developed, which minimizes on average the equivalent fuel consumption, while satisfying the battery charge-sustaining constraints and the overall vehicle power demand for drivability. First, according to the sample information of the traffic speed profiles, the regular route is divided into several segments and the statistic characteristics in the different segments are constructed from gathered data on the averaged vehicle speeds. And then, the energy management problem is formulated as a stochastic nonlinear and constrained optimal control problem and a modified policy iteration algorithm is utilized to generate a time-invariant state-dependent power split strategy. Finally, simulation results over some driving cycles are presented to demonstrate the effectiveness of the proposed energy management strategy.
Reinforcement Learning–Based Energy Management Strategy for a Hybrid Electric Tracked Vehicle
This paper presents a reinforcement learning (RL)–based energy management strategy for a hybrid electric tracked vehicle. A control-oriented model of the powertrain and vehicle dynamics is first established. According to the sample information of the experimental driving schedule, statistical characteristics at various velocities are determined by extracting the transition probability matrix of the power request. Two RL-based algorithms, namely Q-learning and Dyna algorithms, are applied to generate optimal control solutions. The two algorithms are simulated on the same driving schedule, and the simulation results are compared to clarify the merits and demerits of these algorithms. Although the Q-learning algorithm is faster (3 h) than the Dyna algorithm (7 h), its fuel consumption is 1.7% higher than that of the Dyna algorithm. Furthermore, the Dyna algorithm registers approximately the same fuel consumption as the dynamic programming–based global optimal solution. The computational cost of the Dyna algorithm is substantially lower than that of the stochastic dynamic programming.
Energy Management Strategy in Consideration of Battery Health for PHEV via Stochastic Control and Particle Swarm Optimization Algorithm
This paper presents an energy management strategy for plug-in hybrid electric vehicles (PHEVs) that not only tries to minimize the energy consumption, but also considers the battery health. First, a battery model that can be applied to energy management optimization is given. In this model, battery health damage can be estimated in the different states of charge (SOC) and temperature of the battery pack. Then, because of the inevitability that limiting the battery health degradation will increase energy consumption, a Pareto energy management optimization problem is formed. This multi-objective optimal control problem is solved numerically by using stochastic dynamic programming (SDP) and particle swarm optimization (PSO) for satisfying the vehicle power demand and considering the tradeoff between energy consumption and battery health at the same time. The optimization solution is obtained offline by utilizing real historical traffic data and formed as mappings on the system operating states so as to implement online in the actual driving conditions. Finally, the simulation results carried out on the GT-SUITE-based PHEV test platform are illustrated to demonstrate that the proposed multi-objective optimal control strategy would effectively yield benefits.
Impacts of Yield and Seasonal Prices on the Operation of Lancang Cascaded Reservoirs
This work formulates a stochastic dynamic programming (SDP) model that incorporates seasonal electricity prices and can handle a constraint on power yield, which is assumed to be satisfied at any time it is possible, thus allowing for an analysis of their impacts on the operational performances of cascaded reservoirs. The model is applied to the Lancang Cascade, specifically its two largest reservoirs, Xiaowan and Nuozhadu. The results show that increasing the power yield of the cascade will reduce energy production unfavorably but will impact water spillage favorably, with a power yield of 2000 MW and with a 91% reliability suggested as being a satisfactory operational target. The case study also suggests that using seasonal electricity prices makes the power generation very unstable during weeks 12–20, which is a period of time that is critical to transferring from dry to flooding seasons.
Improving Dam and Reservoir Operation Rules Using Stochastic Dynamic Programming and Artificial Neural Network Integration Model
The simulation elevation-surface area-storage interrelationship of a reservoir is a crucial task in developing ideal water release policies for reservoir and dam operations. In this study, an inclusive (stochastic dynamic programming-artificial neural network (SDP-ANN)) model was established and applied to obtain an ideal reservoir operation strategy for Sg. Langat reservoir in Malaysia. The problems associated with the management of water resources mostly relate to uncertainty and the stochastic nature of the reservoir inflow, and the SDP-ANN model is meant to consider uncertainty in the input parameters such as reservoir inflow and reservoir evaporation losses. The performance of the SDP-ANN model was compared to that of the stochastic dynamic programming-autoregression (AR) model. The primary aim of the model is to decrease the squared deviation from the desired water release, which we determined by comparing the SDP-AR and SDP-ANN model performances. The results indicate that the SDP-ANN model demonstrated greater resilience and reliability with a lower supply deficit. Consequently, the case study results confirm that the SDP-ANN model performs better than the SDP-AR model in obtaining the best parameters for the reservoir operation. Specifically, a comparison of the models shows that the proposed Model 2 increased the reliability and resilience of the system by 7.5% and 6.3%, respectively.
Optimizing search strategies for invasive pests: learn before you leap
1. Strategic searching for invasive pests presents a formidable challenge for conservation managers. Limited funding can necessitate choosing between surveying many sites cursorily, or focussing intensively on fewer sites. While existing knowledge may help to target more likely sites, e.g. with species distribution models (maps), this knowledge is not flawless and improving it also requires management investment. 2. In a rare example of trading-off action against knowledge gain, we combine search coverage and accuracy, and its future improvement, within a single optimisation framework. More specifically we examine under which circumstances managers should adopt one of two search-and-control strategies (cursory or focussed), and when they should divert funding to improving knowledge, making better predictive maps that benefit future searches. 3. We use a family of Receiver Operating Characteristic curves to reflect the quality of maps that direct search efforts. We demonstrate our framework by linking these to a logistic model of invasive spread such as that for the red imported fire ant Solenopsis invicta in south-east Queensland, Australia. 4. Cursory widespread searching is only optimal if the pest is already widespread or knowledge is poor, otherwise focussed searching exploiting the map is preferable. For longer management timeframes, eradication is more likely if funds are initially devoted to improving knowledge, even if this results in a short-term explosion of the pest population. 5. Synthesis and applications. By combining trade-offs between knowledge acquisition and utilization, managers can better focus - and justify - their spending to achieve optimal results in invasive control efforts. This framework can improve the efficiency of any ecological management that relies on predicting occurrence.
Optimal Rangeland Stocking Decisions Under Stochastic and Climate-Impacted Weather
A Stochastic Dynamic Programming (SDP) model is developed to analyze optimal stocking rates in the face of weather uncertainty and potential climate change projections. The model extends previous work modeling grazing as a predator-prey relationship. Attention is given to profit maximizing decisions when growing season precipitation is unknown. Comparisons are made across results from a model that utilizes constant growing season precipitation in all years. Results suggest that optimal stocking rates and profitability decrease in climate change scenarios with increased precipitation variability as compared to the historical stochastic weather scenario.
Real-time energy management based on ECMS with stochastic optimized adaptive equivalence factor for HEVs
For both globally suboptimal solution and implementable strategy, a real-time energy management strategy, based on equivalent consumption minimization strategy (ECMS), is proposed for commuter hybrid electric vehicles (HEVs) running on fixed routes. The determination of the adaptive equivalence factor is a focus. By the statistical characteristics deriving from historical driving data, the infinite-horizon stochastic dynamic programming (SDP) optimization with a discount factor is first formulated for finding proper equivalence factor according to uncertain driving cycles on a fixed route. And then, a mapping of equivalent factor on the system state is established off-line by stochastic optimal solution deriving from SDP policy iteration algorithm. In the power splits online, the equivalence factor of the implemented adaptive ECMS is obtained from the mapping according to the real time driving condition to achieve the near global optimal control objective that fuel consumption is minimized and the battery state of charge (SOC) is maintained within the boundaries over the whole driving route. Based on the HEV test platform established by specialized GT-Suite, simulation results and comparisons in some real driving cycles are presented to verify the effectiveness of the proposed strategy and to evaluate the advantages over other strategies.
A State-Dependent Model for the Optimal Management of an Invasive Metapopulation
Management of invasive species involves choosing between different management strategy options, but often the best strategy for a particular scenario is not obvious. We illustrate the use of optimization methods to determine the most efficient management strategy using one of the most devastating invasive forest pests in North America, the gypsy moth (Lymantria dispar), as a case study. The optimization approach involves the application of stochastic dynamic programming (SDP) to a metapopulation framework with different infestation patch sizes, with the goal of minimizing infestation spread. We use a novel \"moving window\" approach as a way to address a spatially explicit problem without being explicitly spatial. We examine results for two cases in order to develop general rules of thumb for management. We explore a model with limited parameter information and then assess how strategies change with specific parameterization for the gypsy moth. The model results in a complex but stable, state-dependent management strategy for a multiyear management program that is robust even under situations of uncertainty. The general rule of thumb for the basic model consists of three strategies: eradicating medium-density infestations, reducing large-density infestations, and reducing the colonization rate from the main infestation, depending on the state of the system. With specific gypsy moth parameterization, reducing colonization decreases in importance relative to the other two strategies. The application of this model to gypsy moth management emphasizes the importance of managing based on the state of the system, and if applied to a specific geographic area, has the potential to substantially improve the efficiency and cost-effectiveness of current gypsy moth eradication programs, helping to slow the spread of this pest. Additionally, the approach used for this particular invasive species can be extended to the optimization of management programs for the spread of other invasive and problem species exhibiting metapopulation dynamics.
Integrated Artificial Neural Network (ANN) and Stochastic Dynamic Programming (SDP) Model for Optimal Release Policy
Complexicity in reservoir operation poses serious challenges to water resources planners and managers. These challenges of water reservoir operation are illustrated using a simulation to aid the development of an optimal operation policy for dam and reservoir. To achieve this, a Comprehensive Stochastic Dynamic Programming with Artificial Neural Network (SDP-ANN) model were developed and tested at Sg. Langat Reservoir in Malaysia. The nonlinearity of the natural physical processes was a major problem in determining the simulation of the reservoir parameters (elevation, surface-area, storage). To overcome water shortages resulting from uncertainty, the SDP-ANN model was used to evaluate the input variable and the performance outcome of the Model were compared with the Stochastic Dynamic Programming integrated with auto-regression (SDP-AR) model. The objective function of the models was set to minimize the sum of squared deviation from the desired targeted supply. Comparison result on the performance between SDP-AR model policy with SDP-ANN model found that the SDP-ANN model is a reliable and resilience model with a lesser supply deficit. The study concludes that the SDP-ANN model performs better than the SDP-AR model in deriving an optimal operating policy for the reservoir.