5. Information and Consumption Smoothing#
In addition to what’s in Anaconda, this lecture employs the following libraries:
!pip install --upgrade quantecon
Show code cell output
Requirement already satisfied: quantecon in /home/runner/miniconda3/envs/quantecon/lib/python3.11/site-packages (0.7.2)
Requirement already satisfied: numba>=0.49.0 in /home/runner/miniconda3/envs/quantecon/lib/python3.11/site-packages (from quantecon) (0.60.0)
Requirement already satisfied: numpy>=1.17.0 in /home/runner/miniconda3/envs/quantecon/lib/python3.11/site-packages (from quantecon) (1.26.4)
Requirement already satisfied: requests in /home/runner/miniconda3/envs/quantecon/lib/python3.11/site-packages (from quantecon) (2.31.0)
Requirement already satisfied: scipy>=1.5.0 in /home/runner/miniconda3/envs/quantecon/lib/python3.11/site-packages (from quantecon) (1.11.4)
Requirement already satisfied: sympy in /home/runner/miniconda3/envs/quantecon/lib/python3.11/site-packages (from quantecon) (1.12)
Requirement already satisfied: llvmlite<0.44,>=0.43.0dev0 in /home/runner/miniconda3/envs/quantecon/lib/python3.11/site-packages (from numba>=0.49.0->quantecon) (0.43.0)
Requirement already satisfied: charset-normalizer<4,>=2 in /home/runner/miniconda3/envs/quantecon/lib/python3.11/site-packages (from requests->quantecon) (2.0.4)
Requirement already satisfied: idna<4,>=2.5 in /home/runner/miniconda3/envs/quantecon/lib/python3.11/site-packages (from requests->quantecon) (3.4)
Requirement already satisfied: urllib3<3,>=1.21.1 in /home/runner/miniconda3/envs/quantecon/lib/python3.11/site-packages (from requests->quantecon) (2.0.7)
Requirement already satisfied: certifi>=2017.4.17 in /home/runner/miniconda3/envs/quantecon/lib/python3.11/site-packages (from requests->quantecon) (2024.2.2)
Requirement already satisfied: mpmath>=0.19 in /home/runner/miniconda3/envs/quantecon/lib/python3.11/site-packages (from sympy->quantecon) (1.3.0)
5.1. Overview#
In the linear-quadratic permanent income of consumption smoothing model described in this quantecon lecture, a scalar parameter \(\beta \in (0,1)\) plays two roles:
it is a discount factor that the consumer applies to future utilities from consumption
it is the reciprocal of the gross interest rate on risk-free one-period loans
That \(\beta\) plays these two roles is essential in delivering the outcome that, regardless of the stochastic process that describes his non-financial income, the consumer chooses to make consumption follow a random walk (see [Hall, 1978]).
In this lecture, we assign a third role to \(\beta\):
it describes a first-order moving average process for the growth in non-financial income
5.1.1. Same non-financial incomes, different information#
We study two consumers who have exactly the same nonfinancial income process and who both conform to the linear-quadratic permanent income of consumption smoothing model described here.
The two consumers have different information about their future nonfinancial incomes.
A better informed consumer each period receives news in the form of a shock that simultaneously affects both today’s nonfinancial income and the present value of future nonfinancial incomes in a particular way.
A less informed consumer each period receives a shock that equals the part of today’s nonfinancial income that could not be forecast from past values of nonfinancial income.
Even though they receive exactly the same nonfinancial incomes each period, our two consumers behave differently because they have different information about their future nonfinancial incomes.
The second consumer receives less information about future nonfinancial incomes in a sense that we shall make precise.
This difference in their information sets manifests itself in their responding differently to what they regard as time \(t\) information shocks.
Thus, although at each date they receive exactly the same histories of nonfinancial income, our two consumers receive different shocks or news about their future nonfinancial incomes.
We use the different behaviors of our consumers as a way to learn about
operating characteristics of a linear-quadratic permanent income model
how the Kalman filter introduced in this lecture and/or another representation of the theory of optimal forecasting introduced in this lecture embody lessons that can be applied to the news and noise literature
ways of representing and computing optimal decision rules in the linear-quadratic permanent income model
a Ricardian equivalence outcome that describes effects on optimal consumption of a tax cut at time \(t\) accompanied by a foreseen permanent increases in taxes that is just sufficient to cover the interest payments used to service the risk-free government bonds that are issued to finance the tax cut
a simple application of alternative ways to factor a covariance generating function along lines described in this lecture
This lecture can be regarded as an introduction to invertibility issues that take center stage in the analysis of fiscal foresight by Eric Leeper, Todd Walker, and Susan Yang [Leeper et al., 2013], as well as in chapter 4 of [Sargent et al., 1991].
5.2. Two Representations of One Nonfinancial Income Process#
We study consequences of endowing a consumer with one of two alternative representations for the change in the consumer’s nonfinancial income \(y_{t+1} - y_t\).
For both types of consumer, a parameter \(\beta \in (0,1)\) plays three roles.
It appears
as a discount factor applied to future expected one-period utilities,
as the reciprocal of a gross interest rate on one-period loans, and
as a parameter in a first-order moving average that equals the increment in a consumer’s non-financial income
The first representation, which we shall sometimes refer to as the more informative representation, is
where \(\{\epsilon_t\}\) is an i.i.d. normally distributed scalar process with means of zero and contemporaneous variances \(\sigma_\epsilon^2\).
This representation of the process is used by a consumer who at time \(t\) knows both \(y_t\) and the shock \(\epsilon_t\) and can use both of them to forecast future \(y_{t+j}\)’s.
As we’ll see below, representation (5.1) has the peculiar property that a positive shock \(\epsilon_{t+1}\) leaves the discounted present value of the consumer’s financial income at time \(t+1\) unaltered.
The second representation of the same \(\{y_t\}\) process is
where \(\{a_t\}\) is another i.i.d. normally distributed scalar process, with means of zero and now variances \(\sigma_a^2 > \sigma_\epsilon^2\).
The i.i.d. shock variances are related by
so that the variance of the innovation exceeds the variance of the original shock by a multiplicative factor \(\beta^{-2}\).
Representation (5.2) is the innovations representation of equation (5.1) associated with Kalman filtering theory.
To see how this works, note that equating representations (5.1) and (5.2) for \(y_{t+1} - y_t\) implies \(\epsilon_{t+1} - \beta^{-1} \epsilon_t = a_{t+1} - \beta a_t\), which in turn implies
Solving this difference equation backwards for \(a_{t+1}\) gives, after a few lines of algebra,
which we can also write as
where \(L\) is the one-period lag operator, \(h(L) = \sum_{j=0}^\infty h_j L^j\), \(I\) is the identity operator, and
Let \(g_j \equiv E z_t z_{t-j}\) be the \(j\)th autocovariance of the \(\{y_t - y_{t-1}\}\) process.
Using calculations in the quantecon lecture, where \(z \in C\) is a complex variable, the covariance generating function \(g (z) = \sum_{j=-\infty}^\infty g_j z^j\) of the \(\{y_t - y_{t-1}\}\) process equals
which confirms that \(\{a_t\}\) is a serially uncorrelated process with variance
To verify these claims, just notice that \(g(z) = \beta^{-2} \sigma_\epsilon^2\) implies that
\(g_0 = \beta^{-2} \sigma_\epsilon^2\), and
\(g_j = 0\) for \(j \neq 0\).
Alternatively, if you are uncomfortable with covariance generating functions, note that we can directly calculate \(\sigma_a^2\) from formula (5.3) according to
5.3. Application of Kalman filter#
We can also use the the Kalman filter to obtain representation (5.2) from representation (5.1).
Thus, from equations associated with the Kalman filter, it can be verified that the steady-state Kalman gain \(K = \beta^2\) and the steady state conditional covariance
In a little more detail, let \(z_t = y_t - y_{t-1}\) and form the state-space representation
and assume that \(\sigma_\epsilon = 1\) for convenience
Let’s compute the steady-state Kalman filter for this system.
Let \(K\) be the steady-state gain and \(a_{t+1}\) the one-step ahead innovation.
The steady-state innovations representation is
By applying formulas for the steady-state Kalman filter, by hand it is possible to verify that \(K = \beta^2, \sigma_a^2 = \beta^{-2} \sigma_\epsilon^2 = \beta^{-2},\) and \(\Sigma = (1-\beta^2) \sigma_\epsilon^2\).
Alternatively, we can obtain these formulas via the classical filtering theory described in this lecture.
5.4. News Shocks and Less Informative Shocks#
Representation (5.1) is cast in terms of a news shock \(\epsilon_{t+1}\) that represents a shock to nonfinancial income coming from taxes, transfers, and other random sources of income changes known to a well-informed person who perhaps has all sorts of information about the income process.
Representation (5.2) for the same income process is driven by shocks \(a_t\) that contain less information than the news shock \(\epsilon_t\).
Representation (5.2) is called the innovations representation for the \(\{y_t - y_{t-1}\}\) process.
It is cast in terms of what time series statisticians call the innovation or fundamental shock that emerges from applying the theory of optimally predicting nonfinancial income based solely on the information in past levels of growth in nonfinancial income.
Fundamental for the \({y_t}\) process means that the shock \(a_t\) can be expressed as a square-summable linear combination of \(y_t, y_{t-1}, \ldots\).
The shock \(\epsilon_t\) is not fundamental because it has more information about the future of the \(\{y_t - y_{t-1}\}\) process than is contained in \(a_t\).
Representation (5.3) reveals the important fact that the original shock \(\epsilon_t\) contains more information about future \(y\)’s than is contained in the semi-infinite history \(y^t = [y_t, y_{t-1}, \ldots ]\).
Staring at representation (5.3) for \(a_{t+1}\) shows that it consists both of new news \(\epsilon_{t+1}\) as well as a long moving average \((\beta - \beta^{-1})\sum_{j=0}^\infty \beta^j\epsilon_{t-j}\) of old news.
The more information representation (5.1) asserts that a shock \(\epsilon_{t}\) results in an impulse response to nonfinancial income of \(\epsilon_t\) times the sequence
so that a shock that increases nonfinancial income \(y_t\) by \(\epsilon_t\) at time \(t\) is followed by a change in future \(y\) of \(\epsilon_t\) times \(1 - \beta^{-1} < 0\) in all subsequent periods.
Because \(1 - \beta^{-1} < 0\), this means that a positive shock of \(\epsilon_t\) today raises income at time \(t\) by \(\epsilon_t\) and then permanently decreases all future incomes by \((\beta^{-1} -1)\epsilon_t\).
This pattern precisely describes the following mental experiment:
The consumer receives a government transfer of \(\epsilon_t\) at time \(t\).
The government finances the transfer by issuing a one-period bond on which it pays a gross one-period risk-free interest rate equal to \(\beta^{-1}\).
In each future period, the government rolls over the one-period bond and so continues to borrow \(\epsilon_t\) forever.
The government imposes a lump-sum tax on the consumer in order to pay just the current interest on the original bond and its rolled over successors.
Thus, in periods \(t+1, t+2, \ldots\), the government levies a lump-sum tax on the consumer of \(\beta^{-1} -1\) that is just enough to pay the interest on the bond.
The present value of the impulse response or moving average coefficients equals \(d_\epsilon(L) = \frac{0}{1 -\beta } =0\), a fact that we’ll see again below.
Representation (5.2), i.e., the innovations representation, asserts that a shock \(a_{t}\) results in an impulse response to nonfinancial income of \(a_t\) times
so that a shock that increases income \(y_t\) by \(a_t\) at time \(t\) can be expected to be followed by an increase in \(y_{t+j}\) of \(a_t\) times \(1 - \beta > 0\) in all future periods \(j=1, 2, \ldots\).
The present value of the impulse response or moving average coefficients for representation (5.2) is \(d_a(\beta) = \frac{1 -\beta^2}{1 -\beta } = (1 + \beta)\), another fact that will be important below.
5.5. Representation of \(\epsilon_t\) Shock in Terms of Future \(y_t\)#
Notice that reprentation (5.1), namely, \(y_{t+1} - y_t = -\beta^{-1} \epsilon_t + \epsilon_{t+1}\) implies the linear difference equation
Solving forward we obtain
This equation shows that \(\epsilon_t\) equals \(\beta\) times the one-step-backwards error in optimally backcasting \(y_t\) based on the semi-infinite future \(y^t_+ \equiv [y_{t+1}, y_{t+2}, \ldots ]\) via the optimal backcasting formula
Thus, \(\epsilon_t\) exactly reveals the gap between \(y_t\) and \(E [ y_t | y^t_+]\).
5.6. Representation in Terms of \(a_t\) Shocks#
Next notice that representation (5.2), namely, \(y_{t+1} - y_t = - \beta a_t + a_{t+1}\) implies the linear difference equation
Solving this equation backward establishes that the one-step-prediction error \(a_{t+1}\) is
Here the information set is \(y^t = [y_t, y_{t-1}, \ldots ]\) and a one step-ahead optimal prediction is
5.7. Permanent Income Consumption-Smoothing Model#
When we computed optimal consumption-saving policies for our two representations (5.1) and (5.2) by using formulas obtained with the difference equation approach described in quantecon lecture, we obtained:
for a consumer having the information assumed in the news representation (5.1):
for a consumer having the more limited information associated with the innovations representation (5.2):
These formulas agree with outcomes from Python programs below that deploy state-space representations and dynamic programming.
Evidently, although they receive exactly the same histories of nonfinancial incomethe two consumers behave differently.
The better informed consumer who has the information sets associated with representation (5.1) responds to each shock \(\epsilon_{t+1}\) by leaving his consumption unaltered and saving all of \(\epsilon_{t+1}\) in anticipation of the permanently increased taxes that he will bear in order to service the permanent interest payments on the risk-free bonds that the government has presumably issued to pay for the one-time addition \(\epsilon_{t+1}\) to his time \(t+1\) nonfinancial income.
The less well informed consumer who has information sets associated with representation (5.2) responds to a shock \(a_{t+1}\) by increasing his consumption by what he perceives to be the permanent part of the increase in consumption and by increasing his saving by what he perceives to be the temporary part.
The behavior of the better informed consumer sharply illustrates the behavior predicted in a classic Ricardian equivalence experiment.
5.8. State Space Representations#
We now cast our representations (5.1) and (5.2), respectively, in terms of the following two state space systems:
and
where \(\{v_t\}\) and \(\{u_t\}\) are both i.i.d. sequences of univariate standardized normal random variables.
These two alternative income processes are ready to be used in the framework presented in the section “Comparison with the Difference Equation Approach” in thid quantecon lecture.
All the code that we shall use below is presented in that lecture.
5.9. Computations#
We shall use Python to form two state-space representations (5.4) and (5.5).
We set the following parameter values \(\sigma_\epsilon = 1, \sigma_a = \beta^{-1} \sigma_\epsilon = \beta^{-1}\) where \(\beta\) is the same value as the discount factor in the household’s problem in the LQ savings problem in the lecture.
For these two representations, we use the code in this lecture to
compute optimal decision rules for \(c_t, b_t\) for the two types of consumers associated with our two representations of nonfinancial income
use the value function objects \(P, d\) returned by the code to compute optimal values for the two representations when evaluated at the initial condition
for each representation.
create instances of the LinearStateSpace class for the two representations of the \(\{y_t\}\) process and use them to obtain impulse response functions of \(c_t\) and \(b_t\) to the respective shocks \(\epsilon_t\) and \(a_t\) for the two representations.
run simulations of \(\{y_t, c_t, b_t\}\) of length \(T\) under both of the representations
We formulae the problem:
subject to a sequence of constraints
where \(y_t\) follows one of the representations defined above.
Define the control as \(u_t \equiv c_t - \gamma\).
(For simplicity we can assume \(\gamma=0\) below because \(\gamma\) has no effect on the impulse response functions that interest us.)
The state transition equations under our two representations for the nonfinancial income process \(\{y_t\}\) can be written as
and
As usual, we start by importing packages.
import numpy as np
import quantecon as qe
import matplotlib.pyplot as plt
# Set parameters
β, σϵ = 0.95, 1
σa = σϵ / β
R = 1 / β
# Payoff matrices are the same for two representations
RLQ = np.array([[0, 0, 0],
[0, 0, 0],
[0, 0, 1e-12]]) # put penalty on debt
QLQ = np.array([1.])
# More informative representation state transition matrices
ALQ1 = np.array([[1, -R, 0],
[0, 0, 0],
[-R, 0, R]])
BLQ1 = np.array([[0, 0, R]]).T
CLQ1 = np.array([[σϵ, σϵ, 0]]).T
# Construct and solve the LQ problem
LQ1 = qe.LQ(QLQ, RLQ, ALQ1, BLQ1, C=CLQ1, beta=β)
P1, F1, d1 = LQ1.stationary_values()
# The optimal decision rule for c
-F1
array([[ 1. , -1. , -0.05]])
Evidently, optimal consumption and debt decision rules for the consumer having news representation (5.1) are
# Innovations representation
ALQ2 = np.array([[1, -β, 0],
[0, 0, 0],
[-R, 0, R]])
BLQ2 = np.array([[0, 0, R]]).T
CLQ2 = np.array([[σa, σa, 0]]).T
LQ2 = qe.LQ(QLQ, RLQ, ALQ2, BLQ2, C=CLQ2, beta=β)
P2, F2, d2 = LQ2.stationary_values()
-F2
array([[ 1. , -0.9025, -0.05 ]])
For a consumer having access only to the information associated with the innovations representation (5.2), the optimal decision rules are
Now we construct two Linear State Space models that emerge from using optimal policies of the form \(u_t =- F x_t\).
Take the more informative original representation (5.1) as an example:
To have the Linear State Space model be of an innovations representation form (5.2), we can simply replace the corresponding matrices.
# Construct two Linear State Space models
Sb = np.array([0, 0, 1])
ABF1 = ALQ1 - BLQ1 @ F1
G1 = np.vstack([-F1, Sb])
LSS1 = qe.LinearStateSpace(ABF1, CLQ1, G1)
ABF2 = ALQ2 - BLQ2 @ F2
G2 = np.vstack([-F2, Sb])
LSS2 = qe.LinearStateSpace(ABF2, CLQ2, G2)
The following code computes impulse response functions of \(c_t\) and \(b_t\).
J = 5 # Number of coefficients that we want
x_res1, y_res1 = LSS1.impulse_response(j=J)
b_res1 = np.array([x_res1[i][2, 0] for i in range(J)])
c_res1 = np.array([y_res1[i][0, 0] for i in range(J)])
x_res2, y_res2 = LSS2.impulse_response(j=J)
b_res2 = np.array([x_res2[i][2, 0] for i in range(J)])
c_res2 = np.array([y_res2[i][0, 0] for i in range(J)])
c_res1 / σϵ, b_res1 / σϵ
(array([1.99998906e-11, 1.89473923e-11, 1.78947621e-11, 1.68421319e-11,
1.57895017e-11]),
array([ 0. , -1.05263158, -1.05263158, -1.05263158, -1.05263158]))
plt.title("more informative representation")
plt.plot(range(J), c_res1 / σϵ, label="c impulse response function")
plt.plot(range(J), b_res1 / σϵ, label="b impulse response function")
plt.legend()
<matplotlib.legend.Legend at 0x7f2a6616df10>
The above two impulse response functions show that when the consumer has the information assumed in the more informative representation (5.1), his response to receiving a positive shock of \(\epsilon_t\) is to leave his consumption unchanged and to save the entire amount of his extra income and then forever roll over the extra bonds that he holds.
To see this notice, that starting from next period on, his debt permanently decreases by \(\beta^{-1}\)
c_res2 / σa, b_res2 / σa
(array([0.0975, 0.0975, 0.0975, 0.0975, 0.0975]),
array([ 0. , -0.95, -0.95, -0.95, -0.95]))
plt.title("innovations representation")
plt.plot(range(J), c_res2 / σa, label="c impulse response function")
plt.plot(range(J), b_res2 / σa, label="b impulse response function")
plt.plot([0, J-1], [0, 0], '--', color='k')
plt.legend()
<matplotlib.legend.Legend at 0x7f2a65db3fd0>
The above impulse responses show that when the consumer has only the information that is assumed to be available under the innovations representation (5.2) for \(\{y_t - y_{t-1} \}\), he responds to a positive \(a_t\) by permanently increasing his consumption.
He accomplishes this by consuming a fraction \((1 - \beta^2)\) of the increment \(a_t\) to his nonfinancial income and saving the rest, thereby lowering \(b_{t+1}\) in order to finance the permanent increment in his consumption.
The preceding computations confirm what we had derived earlier using paper and pencil.
Now let’s simulate some paths of consumption and debt for our two types of consumers while always presenting both types with the same \(\{y_t\}\) path.
# Set time length for simulation
T = 100
x1, y1 = LSS1.simulate(ts_length=T)
plt.plot(range(T), y1[0, :], label="c")
plt.plot(range(T), x1[2, :], label="b")
plt.plot(range(T), x1[0, :], label="y")
plt.title("more informative representation")
plt.legend()
<matplotlib.legend.Legend at 0x7f2a657ff390>
x2, y2 = LSS2.simulate(ts_length=T)
plt.plot(range(T), y2[0, :], label="c")
plt.plot(range(T), x2[2, :], label="b")
plt.plot(range(T), x2[0, :], label="y")
plt.title("innovations representation")
plt.legend()
<matplotlib.legend.Legend at 0x7f2a6563af10>
5.10. Simulating Income Process and Two Associated Shock Processes#
We now form a single \(\{y_t\}_{t=0}^T\) realization that we will use to simulate decisions associated with our two types of consumer.
We accomplish this in the following steps.
We form a \(\{y_t, \epsilon_t\}\) realization by drawing a long simulation of \(\{\epsilon_t\}_{t=0}^T\), where \(T\) is a big integer, \(\epsilon_t = \sigma_\epsilon v_t\), \(v_t\) is a standard normal scalar, \(y_0 =100\), and
\[ y_{t+1} - y_t =-\beta^{-1} \epsilon_t + \epsilon_{t+1} . \]We take the \(\{y_t\}\) realization generated in step 1 and form an innovation process \(\{a_t\}\) from the formulas
\[ \begin{aligned} a_0 & = 0 \cr a_t & = \sum_{j=0}^{t-1} \beta^j (y_{t-j} - y_{t-j-1}) + \beta^t a_0, \quad t \geq 1 \end{aligned} \]We throw away the first \(S\) observations and form a sample \(\{y_t, \epsilon_t, a_t\}_{S+1}^T\) as the realization that we’ll use in the following steps.
We use the step 3 realization to evaluate and simulate the decision rules for \(c_t, b_t\) that Python has computed for us above.
The above steps implement the experiment of comparing decisions made by two consumers having identical incomes at each date but at each date having different information about their future incomes.
5.11. Calculating Innovations in Another Way#
Here we use formula (5.3) above to compute \(a_{t+1}\) as a function of the history \(\epsilon_{t+1}, \epsilon_t, \epsilon_{t-1}, \ldots\)
Thus, we compute
We can verify that we recover the same \(\{a_t\}\) sequence computed earlier.
5.12. Another Invertibility Issue#
This quantecon lecture contains another example of a shock-invertibility issue that is endemic to the LQ permanent income or consumption smoothing model.
The technical issue discussed there is ultimately the source of the shock-invertibility issues discussed by Eric Leeper, Todd Walker, and Susan Yang [Leeper et al., 2013] in their analysis of fiscal foresight.