Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
22,021
result(s) for
"Utility functions"
Sort by:
Value creation through stakeholder synergy
by
Priem, Richard L.
,
Tantalo, Caterina
in
multi-attribute utility functions
,
stakeholders
,
strategy
2016
Our \"stakeholder synergy\" perspective identifies new value creation opportunities that are especially effective strategically because a single strategic action (1) increases different types of value for two or more essential stakeholder groups simultaneously, and (2) does not reduce the value already received by any other essential stakeholder group. This result is obtainable because multiple potential sources of value creation exist for each essential stakeholder group. Actions that meet these criteria increase the size of the value \"pie\" available for essential stakeholder groups, and thereby serve to attract exceptional stakeholders and obtain their increasing effort and commitment. The stakeholder synergy perspective extends stakeholder theory further into the strategy realm, and offers insights for realizing broader value creation that is more likely to produce sustainable competitive advantage.
Journal Article
Actor-critic multi-objective reinforcement learning for non-linear utility functions
by
Hayes, Conor F.
,
Roijers, Diederik M.
,
Reymond, Mathieu
in
Algorithms
,
Artificial Intelligence
,
Computer Science
2023
We propose a novel multi-objective reinforcement learning algorithm that successfully learns the optimal policy even for non-linear utility functions. Non-linear utility functions pose a challenge for SOTA approaches, both in terms of learning efficiency as well as the solution concept. A key insight is that, by proposing a critic that learns a multi-variate distribution over the returns, which is then combined with accumulated rewards, we can directly optimize on the utility function, even if it is non-linear. This allows us to vastly increase the range of problems that can be solved compared to those which can be handled by single-objective methods or multi-objective methods requiring linear utility functions, yet avoiding the need to learn the full Pareto front. We demonstrate our method on multiple multi-objective benchmarks, and show that it learns effectively where baseline approaches fail.
Journal Article
Guilt in Games
by
Dufwenberg, Martin
,
Battigalli, Pierpaolo
in
Beliefs in the Utility Function
,
Blame
,
Conditional probabilities
2007
Journal Article
Environmental Policy Selection Based on Linear-Times-Exponential One-Switch Utility Function and ELECTRE I Method
This paper examines how utility functions perform in tackling the multicriteria decision-making problem, especially one-switch utility function. Linear-times-exponential one-switch, exponential, and linear utility functions are implemented, which transforms corresponding criteria into utilities with ELECTRE I method. The detailed formulation of the decision model is presented. A numerical example about environmental policy selection is introduced to illustrate the use of the new decision model. With different wealth levels and utility functions for a policymaker, the inconsistent outranking policies illustrate the special characteristic of linear-times-exponential one-switch utility function whose initial wealth level has a significant impact on the outranking environmental policy. This study is also the first study applying one-switch utility function in address/ing multicriteria decision-making problem.
Journal Article
Portfolio Choice Under Cumulative Prospect Theory: An Analytical Treatment
by
He, Xue Dong
,
Zhou, Xun Yu
in
Applied sciences
,
cumulative prospect theory
,
Decision making models
2011
We formulate and carry out an analytical treatment of a single-period portfolio choice model featuring a reference point in wealth, S-shaped utility (value) functions with loss aversion, and probability weighting under Kahneman and Tversky's
cumulative prospect theory
(CPT). We introduce a new measure of loss aversion for large payoffs, called the
large-loss aversion degree
(LLAD), and show that it is a critical determinant of the well-posedness of the model. The sensitivity of the CPT value function with respect to the stock allocation is then investigated, which, as a by-product, demonstrates that this function is neither concave nor convex. We finally derive optimal solutions explicitly for the cases in which the reference point is the risk-free return and those in which it is not (while the utility function is piecewise linear), and we employ these results to investigate comparative statics of optimal risky exposures with respect to the reference point, the LLAD, and the curvature of the probability weighting.
This paper was accepted by Wei Xiong, finance.
Journal Article
ARROVIAN AGGREGATION OF CONVEX PREFERENCES
2020
We consider social welfare functions that satisfy Arrow’s classic axioms of independence of irrelevant alternatives and Pareto optimality when the outcome space is the convex hull of some finite set of alternatives. Individual and collective preferences are assumed to be continuous and convex, which guarantees the existence of maximal elements and the consistency of choice functions that return these elements, even without insisting on transitivity. We provide characterizations of both the domains of preferences and the social welfare functions that allow for anonymous Arrovian aggregation. The domains admit arbitrary preferences over alternatives, which completely determine an agent’s preferences over all mixed outcomes. On these domains, Arrow’s impossibility turns into a complete characterization of a unique social welfare function, which can be readily applied in settings involving divisible resources such as probability, time, or money.
Journal Article
Linking Mode Choice with Travel Behavior by Using Logit Model Based on Utility Function
by
Al-Salih, Wissam Qassim
,
Esztergár-Kiss, Domokos
in
Behavior
,
Choice of transportation
,
Households
2021
The currently available transport modeling tools are used to evaluate the effects of behavior change. The aim of this study is to analyze the interaction between the transport mode choice and travel behavior of an individual—more specifically, to identify which of the variables has the greatest effect on mode choice. This is realized by using a multinomial logit model (MNL) and a nested logit model (NL) based on a utility function. The utility function contains activity characteristics, trip characteristics including travel cost, travel time, the distance between activity place, and the individual characteristics to calculate the maximum utility of the mode choice. The variables in the proposed model are tested by using real observations in Budapest, Hungary as a case study. When analyzing the results, it was found that “Trip distance” variable was the most significant, followed by “Travel time” and “Activity purpose”. These parameters have to be mainly considered when elaborating urban traffic models and travel plans. The advantage of using the proposed logit models and utility function is the ability to identify the relationship among the travel behavior of an individual and the mode choice. With the results, it is possible to estimate the influence of the various variables on mode choice and identify the best mode based on the utility function.
Journal Article
Strictly Proper Scoring Rules, Prediction, and Estimation
2007
Scoring rules assess the quality of probabilistic forecasts, by assigning a numerical score based on the predictive distribution and on the event or value that materializes. A scoring rule is proper if the forecaster maximizes the expected score for an observation drawn from the distributionF if he or she issues the probabilistic forecast F, rather than G ≠ F. It is strictly proper if the maximum is unique. In prediction problems, proper scoring rules encourage the forecaster to make careful assessments and to be honest. In estimation problems, strictly proper scoring rules provide attractive loss and utility functions that can be tailored to the problem at hand. This article reviews and develops the theory of proper scoring rules on general probability spaces, and proposes and discusses examples thereof. Proper scoring rules derive from convex functions and relate to information measures, entropy functions, and Bregman divergences. In the case of categorical variables, we prove a rigorous version of the Savage representation. Examples of scoring rules for probabilistic forecasts in the form of predictive densities include the logarithmic, spherical, pseudospherical, and quadratic scores. The continuous ranked probability score applies to probabilistic forecasts that take the form of predictive cumulative distribution functions. It generalizes the absolute error and forms a special case of a new and very general type of score, the energy score. Like many other scoring rules, the energy score admits a kernel representation in terms of negative definite functions, with links to inequalities of Hoeffding type, in both univariate and multivariate settings. Proper scoring rules for quantile and interval forecasts are also discussed. We relate proper scoring rules to Bayes factors and to cross-validation, and propose a novel form of cross-validation known as random-fold cross-validation. A case study on probabilistic weather forecasts in the North American Pacific Northwest illustrates the importance of propriety. We note optimum score approaches to point and quantile estimation, and propose the intuitively appealing interval score as a utility function in interval estimation that addresses width as well as coverage.
Journal Article
Relative risk aversion must be close to 1
2025
Any utility function that is unbounded either from below or from above implies paradoxical behavior. However, these paradoxes may be regarded as irrelevant if they involve wealth levels that are realistically meaningless. Employing real-world constraints on wealth reveals that CRRA utility with relative risk aversion outside of the range 0.75–1.15 yields paradoxical choices that very few individuals, if any, would ever make. Thus, relative risk aversion must be close to 1, the value corresponding to log preferences. These results shed new light on the longstanding debate about the geometric-mean criterion and the argument of stocks for the long-run.
Journal Article
Measuring the suboptimality of dividend controls in a Brownian risk model
2023
We consider an insurance company modelling its surplus process by a Brownian motion with drift. Our target is to maximise the expected exponential utility of discounted dividend payments, given that the dividend rates are bounded by some constant. The utility function destroys the linearity and the time-homogeneity of the problem considered. The value function depends not only on the surplus, but also on time. Numerical considerations suggest that the optimal strategy, if it exists, is of a barrier type with a nonlinear barrier. In the related article of Grandits et al. (Scand. Actuarial J. 2, 2007), it has been observed that standard numerical methods break down in certain parameter cases, and no closed-form solution has been found. For these reasons, we offer a new method allowing one to estimate the distance from an arbitrary smooth-enough function to the value function. Applying this method, we investigate the goodness of the most obvious suboptimal strategies—payout on the maximal rate, and constant barrier strategies—by measuring the distance from their performance functions to the value function.
Journal Article