This series of essays explores the optimization of portfolio weights to maximize a Constant Relative Risk Aversion (CRRA) utility function over an agent's wealth. We use classic stochastic calculus techniques to model price processes as Geometric Brownian Motion (GBM). In Part I, we derive the optimal allocation between two risky assets and find our solution is an extension of the famous Merton Share. In Part II, we extend the analysis to three assets and then to an n-asset model. In Part III, we examine what happens when we change the numeraire from cash to a risky asset.
Part I: The Binary Asset Model
Model Definition
Suppose we have a universe of two stocks, A and B, modeled as independent GBM processes with parameters μA, μB, σA, and σB. We also define λA and λB to be our portfolio weights for assets A and B, such that λA+λB=1. Finally, we have a CRRA utility function over possible wealth states W such that U(W)=1−γW1−γ−1 and γ is our relative risk aversion parameter. In what follows, we attempt to find the optimal portfolio weights λ which maximize the expected utility of our future wealth.
Deriving a Closed Form Expected Utility Function
We intend to find the portfolio allocation [λA,λB] which maximizes the expected utility of our wealth in the next period, that is:
λA,λBmaxE[U(Wt+dt)].
Incorporating our CRRA utility function yields E[U(Wt+dt)]=E[1−γ(Wt+dWt)1−γ−1].
We can now define our wealth dynamic dWt as evolving according to the chosen portfolio weights λA and λB
dWt=λAWtSA,tdSA,t+λBWtSB,tdSB,t.
Similarly, we note that each stock's price follows a GBM, defined by the stochastic differential equations (SDEs)
We now consider the second-order Taylor series expansion of E[(1+x)1−γ] around 1 because we know x will be very small since we're dealing with an infinitesimally small time increment dt. In this case, x=λA(μAdt+σAdNA,t)+λB(μBdt+σBdNB,t).
We remember that the Taylor series of a function f(x) around a point a is given by
f(x)=f(a)+f′(a)(x−a)+2!f′′(a)(x−a)2+…
and we are careful to make sure to include the second order term which includes our volatility parameters.
This implies that E[(1+x)1−γ]≈1+(1−γ)E[x]+2(1−γ)(−γ)E[x2].
Using x=λA(μAdt+σAdNA,t)+λB(μBdt+σBdNB,t), we see that E[x]=(λAμA+λBμB)dt because E[dNi,t]=0.
Furthermore, to solve for E[x2], we substitute in x which gives us
E[x2]=E[(λA(μAdt+σAdNA,t)+λB(μBdt+σBdNB,t))2].
At this point, some properties of Brownian motion come to our aid, particularly that E[dNi,t]=0, E[dNi,t2]=dt, and E[dNA,tdNB,t]=0 (since A and B are independent processes).
We can make the following simplifications:
(λA(μAdt+σAdNA,t))2=λA2σA2dt
(λB(μBdt+σBdNB,t))2=λB2σB2dt
2λAλB(μAdt+σAdNA,t)(μBdt+σBdNB,t)=0
Putting it all together, the simplified expression for E[x2]=λA2σA2dt+λB2σB2dt.
Returning back to the earlier expectation we've been trying to solve with these new results in hand, we see that
Quick aside, my hunch is that if we were to extend to a multi-asset model this becomes:
E[U(Wt+dt)]=1−γWt1−γ(1+(1−γ)(∑i=1nλiμi)dt+2(1−γ)(−γ)(∑i=1nλi2σi2)dt)−1
Now, because we're in a dual-asset model where the weights sum to 100%, λB is determined to be 1−λA. We can subsitute this into the expression above which gives us:
Optimizing Portfolio Weights to Maximize Expected Utility
In order to maximize E[U(Wt+dt)], we can follow the classic method of differentiating the function with respect to λA and setting this partial derivative equal to zero.
First, we notice that the term 1−γW1−γ−1 is a constant with respect to λA, so we can focus on differentiating only the bracketed expression, which we can denote as f(λA):
We can see that (1−γ)dt is a common factor in both terms, and since dt is an infinitesimal time increment (which importantly is not zero), we can simplify the equation by dividing through by (1−γ)dt:
Given two assets modeled as independent GBM processes, wealth Wt, and a CRRA utility function, we have found that the optimal allocation to asset A is λA=γ(σA2+σB2)μA−μB+γσB2 and the optimal allocation to asset B is λB=γ(σA2+σB2)μB−μA+γσA2.
You might have noticed that the portfolio allocations λA and λB don't have a subscript t. This is because, given that the price processes are stationary and that our risk aversion parameter γ does not change, they are time and wealth independent! This implies a constant fractional allocation to each stock in our portfolio.
Notably, in the case where B is a risk-free investment, implying that σB2=0, our optimal allocation reduces to λA=γσA2μA−μB, which is the famous Merton Share!
While all models are lossy, I take issue with the idea of a risk-free rate. In particular, the real returns on a nation's treasuries are sensitive to interest rate changes, inflation, and currency fluctuations. It also shouldn't be overlooked that big debt crises occur fairly regularly and nations do default. With this in mind, I think extending the dual-asset Merton Share model to two risky assets is an improvement toward realism.
Part II: Extension to Three Assets
In Part I, we derived the optimal allocations under an independent binary asset model where the two stocks follow geometric Brownian motion processes. We now extend the analysis to three assets and then to an n-asset model.
Model Definition
We have three assets A, B, and C whose price processes follow GBM processeses with parameters (μA,σA), (μB,σB), and (μC,σC), respectively. We allocate our wealth W between A, B, and C in proportion λA, λB, and λC, respectively, such that λA+λB+λC=1. We again maintain a Constant Relative Risk Aversion (CRRA) utility function U(W)=1−γW1−γ−1 where Wt is our wealth at time t and γ is our relative risk aversion parameter.
Deriving Our Expected Utility Function
We intend to find the portfolio allocation [λA,λB,λC] which maximizes the expected utility of our wealth in the next period, that is:
λA,λB,λCmaxE[U(Wt+dt)].
After incorporating our CRRA utility function, we see E[U(Wt+dt)]=E[1−γ(Wt+dWt)1−γ−1].
We can now define our wealth dynamic dWt as evolving according to the chosen portfolio weights λA, λB, and λC.
We now consider the second-order Taylor series expansion of E[(1+x)1−γ] around 1 because we know x will be very small since we're dealing with an infinitesimally small time increment dt. In this case, x=λA(μAdt+σAdNA,t)+λB(μBdt+σBdNB,t)+λC(μCdt+σCdNC,t).
We remember that the Taylor series of a function f(x) around a point a is given by
f(x)=f(a)+f′(a)(x−a)+2!f′′(a)(x−a)2+…
and we are careful to make sure to include the second order term which includes our volatility parameters.
This implies that E[(1+x)1−γ]≈1+(1−γ)E[x]+2(1−γ)(−γ)E[x2].
Using x=λA(μAdt+σAdNA,t)+λB(μBdt+σBdNB,t)+λC(μCdt+σCdNC,t), we see that E[x]=(λAμA+λBμB+λCμC)dt because E[dNi,t]=0.
Furthermore, to solve for E[x2], we substitute in x which gives us
Though it seems a bit unwieldy, we can simplify the expression E[(λA(μAdt+σAdNA,t)+λB(μBdt+σBdNB,t)+λC(μCdt+σCdNC,t))2], we need to first expand the square and use the properties of Wiener processes, notably that E[dNi,t]=0 and E[dNi,t2]=dt.
We can now simplify each term by considering the properties of dNi,t noted before:
For terms like λA2(μAdt+σAdNA,t)2, the expansion will give λA2μA2dt2+2λA2μAσAdtdNA,t+λA2σA2dNA,t2. When taking the expected value of this, the dtdNA,t term disappears, and dNA,t2 becomes dt, leaving λA2σA2dt.
When taking the expected value of this, the dt,dNA,t term disappears, and dNA,t2 becomes dt, leaving λA2σA2dt.
The cross terms like 2λAλB(μAdt+σAdNA,t)(μBdt+σBdNB,t) expand out to
2λAλBμAμBdt2+2λAλBμAσBdtdNB,t+2λAλBσAμBdtdNA,t+2λAλBσAσBdNA,tdNB,t.
Each term in this expression goes to zero because E[dt2]=0 and E[Ni,t]=0.
After applying these simplifications, the expected value expression becomes:
We make one final adjustment by including our λA+λB+λC=1 constraint to reduce a degree of freedom our model.
We first substitute λC=1−λA−λB into the linear term λAμA+λBμB+λCμC=λAμA+λBμB+(1−λA−λB)μC=λA(μA−μC)+λB(μB−μC)+μC. This also makes intuitive sense because initially the expression was the sum of all of our allocation percentages times the average return of those investments which is the expected return of our portfolio. The final expression is the same expected return of our portfolio, except we can conceptualize this as 100% of our portfolio returning μC, and then for each non-C asset we compute how much more or less we'd make on that fraction of our portfolio against a C-based benchmark.
Now we need to handle the quadratic term, λA2σA2+λB2σB2+λC2σC2. We substitute out λC which yields λA2σA2+λB2σB2+(1−λA−λB)2σC2. We note that generally (1−∑i=1nxi)2=1−2∑i=1nxi+2∑1≤i<j≤nxixj+∑i=1nxi2, this generalization will help us when we extend to the n-asset framework, but we can use it in our three-asset model too.
Now we expand and simplify the quadratic term for λC:
This expression for our expected marginal utility incorporates all of the constraints of our model now that λC has been eliminated and replaced by solely λA and λB, which crucially reduces a degree of freedome from our model and allows the matrix inversion technique which follows to succeed.
Optimizing Portfolio Weights to Maximize Expected Utility (Take Three)
We know that the maximium of E[U(Wt+dt)] w.r.t. our λis will have a tangent plane with zero gradient in the λA and λB directions. That is, dλidE[U(Wt+dt)]=0 for i∈{1,2}. From this we will get a system of equations which we can then solve to get our optimal portfolio allocations. We start by solving for dλAdE[U(Wt+dt)].
We can apply a symmetry argument to find dλBdE[U] because λB can be interchanged with λA without changing E[U]. Setting each of these partial derivative to equal zero gives us the system of equations we're looking for.
In Part I, we made the case that there may be no such thing as a risk-free asset. In the case of treasuries, the typical example of the risk-free asset, the holder is exposed to inflation risks, dollar fluctuations, interest rate changes, and other factors. Given this, we constructed a model for the optimal allocation between two risky assets.
Every investor has their own basket of goods under which they estimate changes in their real purchasing power.1 This subjective basket is not exactly cash, which was the motivation for the first essay. But what if it were somehow a known tradable asset? Or even more simply, perhaps an investor wants to denominate their returns in ETH, SPY, or some known liquid asset. Does our previous analysis still hold if we change the currency units? In this essay, I examine the case where, instead of using cash as a base for both assets A and B, we designate asset A as the numeraire, expressing asset B in terms of A.
Let us begin with the optimal allocations derived from the previous cash-denominated model:
Both assets A and B are initially denominated in cash. We now shift our perspective by setting asset A as the numeraire, effectively redefining all quantities in relation to A. This transition moves us from a cash-denominated framework to one in which asset A is the central reference point.
Key Definitions in the A-Denominated Model:
SB/A: Price of asset B relative to A.
SA/A: Price of asset A relative to itself.
μB/A, σB/A: Drift and volatility of B with respect to A.
λA/A, λB/A: Portfolio weights for A and B in the A-denominated framework.
WA,t: Wealth expressed in terms of A.
dWA,t: Wealth dynamics in terms of A.
We start by solving for the simplest expressions.
SA/A is the price of asset A relative to itself, so SA/A=SA/ASA/A=1.
Since SA/A=1, its drift and volatility are zero: μA/A=0, σA/A=0.
The price of B in terms of A is given by SB/A=SASB.
In this model, the drift and volatility of B relative to A are defined as:
μB/A=μB−μA,σB/A=σA2+σB2.
The transition from the cash-denominated optimal B allocation, λB=γ(σA2+σB2)μB−μA+γσA2, to the A-denominated model can be achieved by the following transformations:
Replace the cash-denominated drift difference μB−μA with μB/A:
Next, replace the combined variance σA2+σB2 with σB/A2:
γ(σA2+σB2)μB/A+γσA2=γσB/A2μB/A+γσA2
Finally, note that the numeraire asset A/A has zero drift and zero volatility: μA/A=0 and σA/A=0:
γσB/A2μB/A+γσA2=γσB/A2μB/A
Thus, the optimal weight for B in the A-denominated framework becomes:
λB/A=γσB/A2μB/A
The derivation above is perfectly legitamate, though if we don't want to take as axiom the results of my previous essay, we can start again from scratch.
Fully Deriving the Optimal Portfolio Weights in the A-Denominated Model
We start with maximizing our expected CRRA utility as before:
We note that in the A-denominated model the only risky asset is B/A, so the wealth dynamics are driven solely by B/A. The optimization problem simplifies to maximizing expected utility with respect to λB/A.
Since SA/A=1, asset A contributes no differential to wealth dynamics in A-terms. Thus, wealth dynamics in this A-denominated framework are driven solely by B/A:
We can now express the optimal weights in the A-denominated model directly in terms of μB/A and σB/A2.
λB/A=γσB/A2μB/A,λA/A=1−λB/A.
Thus, after transforming the cash-denominated model's optimal weights to the A-denominated model's optimal weights, we see that we've derived the famous Merton share.
Footnotes
I am skeptical of standard national CPI measures, as discussed in Chapter 5 of Keynes' Treatise on Money. I think exact CPI calculation seems like a fundamentally futile task, though it's nuanced so I haven't made up my mind yet. ↩