33. Knowing the Forecasts of Others#
Contents
In addition to what’s in Anaconda, this lecture will need the following libraries:
!pip install upgrade quantecon
!conda install y c plotly plotly plotlyorca
Show code cell output
Requirement already satisfied: quantecon in /home/runner/miniconda3/envs/quantecon/lib/python3.11/sitepackages (0.7.2)
Requirement already satisfied: numba>=0.49.0 in /home/runner/miniconda3/envs/quantecon/lib/python3.11/sitepackages (from quantecon) (0.59.1)
Requirement already satisfied: numpy>=1.17.0 in /home/runner/miniconda3/envs/quantecon/lib/python3.11/sitepackages (from quantecon) (1.26.4)
Requirement already satisfied: requests in /home/runner/miniconda3/envs/quantecon/lib/python3.11/sitepackages (from quantecon) (2.31.0)
Requirement already satisfied: scipy>=1.5.0 in /home/runner/miniconda3/envs/quantecon/lib/python3.11/sitepackages (from quantecon) (1.11.4)
Requirement already satisfied: sympy in /home/runner/miniconda3/envs/quantecon/lib/python3.11/sitepackages (from quantecon) (1.12)
Requirement already satisfied: llvmlite<0.43,>=0.42.0dev0 in /home/runner/miniconda3/envs/quantecon/lib/python3.11/sitepackages (from numba>=0.49.0>quantecon) (0.42.0)
Requirement already satisfied: charsetnormalizer<4,>=2 in /home/runner/miniconda3/envs/quantecon/lib/python3.11/sitepackages (from requests>quantecon) (2.0.4)
Requirement already satisfied: idna<4,>=2.5 in /home/runner/miniconda3/envs/quantecon/lib/python3.11/sitepackages (from requests>quantecon) (3.4)
Requirement already satisfied: urllib3<3,>=1.21.1 in /home/runner/miniconda3/envs/quantecon/lib/python3.11/sitepackages (from requests>quantecon) (2.0.7)
Requirement already satisfied: certifi>=2017.4.17 in /home/runner/miniconda3/envs/quantecon/lib/python3.11/sitepackages (from requests>quantecon) (2024.2.2)
Requirement already satisfied: mpmath>=0.19 in /home/runner/miniconda3/envs/quantecon/lib/python3.11/sitepackages (from sympy>quantecon) (1.3.0)
Channels:
 plotly
 default
 defaults
Platform: linux64
Collecting package metadata (repodata.json): 
\

/

\

/

\

/

\

/

\

/

\

/

done
Solving environment: 
/

\

/

\

/

\

/

\

done
# All requested packages already installed.
33.1. Introduction#
Robert E. Lucas, Jr. [Robert E. Lucas, 1975], Kenneth Kasa [Kasa, 2000], and Robert Townsend [Townsend, 1983] showed that putting decision makers into environments in which they want to infer persistent hidden state variables from equilibrium prices and quantities can elongate and amplify impulse responses to aggregate shocks.
This provides a promising way to think about amplification mechanisms in business cycle models.
Townsend [Townsend, 1983] noted that living in such environments makes decision makers want to forecast forecasts of others.
This theme has been pursued for situations in which decision makers’ imperfect information forces them to pursue an infinite recursion that involves forming beliefs about the beliefs of others (e.g., [Allen et al., 2002]).
Lucas [Robert E. Lucas, 1975] side stepped having decision makers forecast the forecasts of other decision makers by assuming that they simply pool their information before forecasting.
A pooling equilibrium like Lucas’s plays a prominent role in this lecture.
Because he didn’t assume such pooling, [Townsend, 1983] confronted the forecasting the forecasts of others problem.
To formulate the problem recursively required that Townsend define a decision maker’s state vector.
Townsend concluded that his original model required an intractable infinite dimensional state space.
Therefore, he constructed a more manageable approximating model in which a hidden Markov component of a demand shock is revealed to all firms after a fixed, finite number of periods.
In this lecture, we illustrate again the theme that finding the state is an art by showing how to formulate Townsend’s original model in terms of a lowdimensional state space.
We show that Townsend’s model shares equilibrium prices and quantities with those that prevail in a pooling equilibrium.
That finding emerged from a line of research about Townsend’s model that built on [Pearlman et al., 1986] and that culminated in [Pearlman and Sargent, 2005] .
Rather than directly deploying the [Pearlman et al., 1986] machinery here, we shall instead implement a sneaky guessandverify tactic.
We first compute a pooling equilibrium and represent it as an instance of a linear statespace system provided by the Python class
quantecon.LinearStateSpace
.Leaving the statetransition equation for the pooling equilibrium unaltered, we alter the observation vector for a firm to match what it is in Townsend’s original model. So rather than directly observing the signal received by firms in the other industry, a firm sees the equilibrium price of the good produced by the other industry.
We compute a population linear least squares regression of the noisy signal at time \(t\) that firms in the other industry would receive in a pooling equilibrium on time \(t\) information that a firm receives in Townsend’s original model.
The \(R^2\) in this regression equals \(1\).
That verifies that a firm’s information set in Townsend’s original model equals its information set in a pooling equilibrium.
Therefore, equilibrium prices and quantities in Townsend’s original model equal those in a pooling equilibrium.
33.1.1. A Sequence of Models#
We proceed by describing a sequence of models of two industries that are linked in a single way:
shocks to the demand curves for their products have a common component.
The models are simplified versions of Townsend’s [Townsend, 1983].
Townsend’s is a model of a rational expectations equilibrium in which firms want to forecast forecasts of others.
In Townsend’s model, firms condition their forecasts on observed endogenous variables whose equilibrium laws of motion are determined by their own forecasting functions.
We shall assemble model components progressively in ways that can help us to appreciate the structure of the pooling equilibrium that ultimately interests us.
While keeping all other aspects of the model the same, we shall study consequences of alternative assumptions about what decision makers observe.
Technically, this lecture deploys concepts and tools that appear in First Look at Kalman Filter and Rational Expectations Equilibrium.
33.2. The Setting#
We cast all variables in terms of deviations from means.
Therefore, we omit constants from inverse demand curves and other functions.
Firms in industry \(i=1,2\) use a single factor of production, capital \(k_t^i\), to produce output of a single good, \(y_t^i\).
Firms bear quadratic costs of adjusting their capital stocks.
A representative firm in industry \(i\) has production function \(y_t^i = f k_t^i\), \(f >0\).
The firm acts as a price taker with respect to output price \(P_t^i\), and maximizes
Demand in industry \(i\) is described by the inverse demand curve
where \(P_t^i\) is the price of good \(i\) at \(t\), \(Y_t^i = f K_t^i\) is output in market \(i\), \(\theta_t\) is a persistent component of a demand shock that is common across the two industries, and \(\epsilon_t^i\) is an industry specific component of the demand shock that is i.i.d. and whose time \(t\) marginal distribution is \({\mathcal N}(0, \sigma_{\epsilon}^2)\).
We assume that \(\theta_t\) is governed by
where \(\{v_{t}\}\) is an i.i.d. sequence of Gaussian shocks, each with mean zero and variance \(\sigma_v^2\).
To simplify notation, we’ll study a special case by setting \(h=f=1\).
Costs of adjusting their capital stocks impart to firms an incentive to forecast the price of the good that they sell.
Throughout, we use the rational expectations equilibrium concept presented in this lecture Rational Expectations Equilibrium.
We let capital letters denote market wide objects and lower case letters denote objects chosen by a representative firm.
In each industry, a competitive equilibrium prevails.
To rationalize the big \(K\), little \(k\) connection, we can think of there being a continuum of firms in industry \(i\), with each firm being indexed by \(\omega \in [0,1]\) and \(K^i = \int_0^1 k^i(\omega) d \omega\).
In equilibrium, \(k_t^i = K_t^i\), but we must distinguish between \(k_t^i\) and \(K_t^i\) when we pose the firm’s optimization problem.
33.3. Tactics#
We shall compute equilibrium laws of motion for capital in industry \(i\) under a sequence of assumptions about what a representative firm observes.
Successive members of this sequence make a representative firm’s information more and more obscure.
We begin with the most information, then gradually withdraw information in a way that approaches and eventually reaches the Townsendlike information structure that we are ultimately interested in.
Thus, we shall compute equilibria under the following alternative information structures:
Perfect foresight: future values of \(\theta_t, \epsilon_{t}^i\) are observed in industry \(i\).
Observed history of stochastic \(\theta_t\): \(\{\theta_t, \epsilon_{t}^i\}\) are realizations from a stochastic process; current and past values of each are observed at time \(t\) but future values are not.
One noiseridden observation on \(\theta_t\): values of \(\{\theta_t, \epsilon_{t}^i\}\) separately are never observed. However, at time \(t\), a history \(w^t\) of scalar noiseridden observations on \(\theta_t\) is observed at time \(t\).
Two noiseridden observations on \(\theta_t\): values of \(\{\theta_t, \epsilon_{t}^i\}\) separately are never observed. However, at time \(t\), a history \(w^t\) of two noiseridden observations on \(\theta_t\) is observed at time \(t\).
Successive computations build one on previous ones.
We proceed by first finding an equilibrium under perfect foresight.
To compute an equilibrium with current and past but not future values of \(\theta_t\) observed, we use a certainty equivalence principle to justify modifying the perfect foresight equilibrium by replacing future values of \(\theta_s, \epsilon_{s}^i, s \geq t\) with mathematical expectations conditioned on \(\theta_t\).
This provides the equilibrium when \(\theta_t\) is observed at \(t\) but future \(\theta_{t+j}\) and \(\epsilon_{t+j}^i\) are not observed.
To find an equilibrium when a history \(w^t\) observations of a single noiseridden \(\theta_t\) is observed, we again apply a certainty equivalence principle and replace future values of the random variables \(\theta_s, \epsilon_{s}^i, s \geq t\) with their mathematical expectations conditioned on \(w^t\).
To find an equilibrium when a history \(w^t\) of two noisy signals on \(\theta_t\) is observed, we replace future values of the random variables \(\theta_s, \epsilon_{s}^i, s \geq t\) with their mathematical expectations conditioned on history \(w^t\).
We call the equilibrium with two noiseridden observations on \(\theta_t\) a pooling equilibrium.
It corresponds to an arrangement in which at the beginning of each period firms in industries \(1\) and \(2\) somehow get together and share information about current values of their noisy signals on \(\theta\).
We want ultimately to compare outcomes in a pooling equilibrium with an equilibrium under the following alternative information structure for a firm in industry \(i\) that originally interested Townsend [Townsend, 1983]:
Firm \(i\)’s noiseridden signal on \(\theta_t\) and the price in industry \(i\), a firm in industry \(i\) observes a history \(w^t\) of one noiseridden signal on \(\theta_t\) and a history of industry \(i\)’s price is observed. (Here \(i\) means ``not \(i\)’’.)
With this information structure, a representative firm \(i\) sees the price as well as the aggregate endogenous state variable \(Y_t^i\) in its own industry.
That allows it to infer the total demand shock \(\theta_t + \epsilon_{t}^i\).
However, at time \(t\), the firm sees only \(P_t^{i}\) and does not see \(Y_t^{i}\), so that a firm in industry \(i\) does not directly observe \(\theta_t + \epsilon_t^{i}\).
Nevertheless, it will turn out that equilibrium prices and quantities in this equilibrium equal their counterparts in a pooling equilibrium because firms in industry \(i\) are able to infer the noisy signal about the demand shock received by firms in industry \(i\).
We shall verify this assertion by using a guess and verify tactic that involves running a least squares regression and inspecting its \(R^2\). 1
33.4. Equilibrium Conditions#
It is convenient to solve a firm’s problem without uncertainty by forming the Lagrangian:
where \(\{\phi_t^i\}\) is a sequence of Lagrange multipliers on the transition law \(k_{t+1}^i = k_{t}^i + \mu_t^i\).
First order conditions for the nonstochastic problem are
Substituting the demand function (33.2) for \(P_t^i\), imposing the condition \(k_t^i = K_t^i\) that makes representative firm be representative, and using definition (33.6) of \(g_t^i\), the Euler equation (33.4) lagged by one period can be expressed as \( b k_t^i + \theta_t + \epsilon_t^i + (k_{t+1}^i  k_t^i)  g_t^i =0\) or
where we define \(g_t^i\) by
We can write Euler equation (33.4) as:
In addition, we have the law of motion for \(\theta_t\), (33.3), and the demand equation (33.2).
In summary, with perfect foresight, equilibrium conditions for industry \(i\) comprise the following system of difference equations:
Without perfect foresight, the same system prevails except that the following equation replaces the third equation of (33.8):
where \(x_{t+1,t}\) denotes the mathematical expectation of \(x_{t+1}\) conditional on information at time \(t\).
33.4.1. Equilibrium under perfect foresight#
Our first step is to compute the equilibrium law of motion for \(k_t^i\) under perfect foresight.
Let \(L\) be the lag operator. 2
Equations (33.7) and (33.5) imply the second order difference equation in \(k_t^i\): 3
Factor the polynomial in \(L\) on the left side as:
where \(\tilde \lambda  < 1\) is the smaller root and \(\lambda\) is the larger root of \((\lambda1)(\lambda1/\beta)=b\lambda\).
Therefore, (33.9) can be expressed as
Solving the stable root backwards and the unstable root forwards gives
Recall that we have already set \(k^i = K^i\) at the appropriate point in the argument, namely, after having derived the firstorder necessary conditions for a representative firm in industry \(i\).
Thus, under perfect foresight the equilibrium capital stock in industry \(i\) satisfies
Next, we shall investigate consequences of replacing future values of \((\epsilon_{t+j}^i + \theta_{t+j})\) in equation (33.10) with alternative forecasting schemes.
In particular, we shall compute equilibrium laws of motion for capital under alternative assumptions about information available to firms in market \(i\).
33.5. Equilibrium with \(\theta_t\) stochastic but observed at \(t\)#
If future \(\theta\)’s are unknown at \(t\), it is appropriate to replace all random variables on the right side of (33.10) with their conditional expectations based on the information available to decision makers in market \(i\).
For now, we assume that this information set is \(I_t^p = \begin{bmatrix} \theta^t & \epsilon^{it} \end{bmatrix}\), where \(z^t\) represents the semiinfinite history of variable \(z_s\) up to time \(t\).
Later we shall give firms less information.
To obtain an appropriate counterpart to (33.10) under our current assumption about information, we apply a certainty equivalence principle.
In particular, it is appropriate to take (33.10) and replace each term \(( \epsilon_{t+j}^i+ \theta_{t+j} )\) on the right side with \(E[ (\epsilon_{t+j}^i+ \theta_{t+j}) \vert \theta^t ]\).
After using (33.3) and the i.i.d. assumption about \(\{\epsilon_t^i\}\), this gives
or
where \(\lambda \equiv (\beta \tilde \lambda)^{1}\).
For our purposes, it is convenient to represent the equilibrium \(\{k_t^i\}_t\) process recursively as
33.5.1. Filtering#
33.5.1.1. One noisy signal#
We get closer to the original Townsend model that interests us by now assuming that firms in market \(i\) do not observe \(\theta_t\).
Instead they observe a history \(w^t\) of noisy signals at time \(t\).
In particular, assume that
where \(e_t\) and \(v_t\) are mutually independent i.i.d. Gaussian shock processes with means of zero and variances \(\sigma_e^2\) and \(\sigma_v^2\), respectively.
Define
where \(w^t = [w_t, w_{t1}, \ldots, w_0]\) denotes the history of the \(w_s\) process up to and including \(t\).
Associated with the statespace representation (33.13) is the timeinvariant innovations representation
where \(a_t \equiv w_t  E(w_t  w^{t1})\) is the innovations process in \(w_t\) and the Kalman gain \(\kappa\) is
and where \(p\) satisfies the Riccati equation
33.5.1.2. Statereconstruction error#
Define the state reconstruction error \(\tilde \theta_t\) by
Then \(p = E \tilde \theta_t^2\).
Equations (33.13) and (33.14) imply
Notice that we can express \(\hat \theta_{t+1}\) as
where the first term in braces equals \(\theta_{t+1}\) and the second term in braces equals \(\tilde \theta_{t+1}\).
We can express (33.11) as
An application of a certainty equivalence principle asserts that when only \(w^t\) is observed, a corresponding equilibrium \(\{k_t^i\}\) process can be found by replacing the information set \(\theta^t\) with \(w^t\) in (33.19).
Making this substitution and using (33.18) leads to
Simplifying equation (33.18), we also have
Equations (33.20), (33.21) describe an equilibrium when \(w^t\) is observed.
33.5.2. A new state variable#
Relative to (33.11), the equilibrium acquires a new state variable, namely, the \(\theta\)–reconstruction error, \(\tilde \theta_t\).
For a subsequent argument, by using (33.15), it is convenient to write (33.20) as
In summary, when decision makers in market \(i\) observe a semiinfinite history \(w^t\) of noisy signals \(w_t\) on \(\theta_t\) at \(t\), we an equilibrium law of motion for \(k_t^i\) can be represented as
33.5.3. Two Noisy Signals#
We now construct a pooling equilibrium by assuming that at time \(t\) a firm in industry \(i\) receives a vector \(w_t\) of two noisy signals on \(\theta_t\):
To justify that we are constructing is a pooling equilibrium we can assume that
so that a firm in industry \(i\) observes the noisy signals on that \(\theta_t\) presented to firms in both industries \(i\) and \(i\).
The pertinent innovations representation now becomes
where \(a_t \equiv w_t  E [w_t  w^{t1}]\) is a \((2 \times 1)\) vector of innovations in \(w_t\) and \(\kappa\) is now a \((1 \times 2)\) vector of Kalman gains.
Formulas for the Kalman filter imply that
where \(p = E \tilde \theta_t \tilde \theta_t^T\) now satisfies the Riccati equation
Thus, when a representative firm in industry \(i\) observes two noisy signals on \(\theta_t\), we can express the equilibrium law of motion for capital recursively as
Below, by using a guessandverify tactic, we shall show that outcomes in this pooling equilibrium equal those in an equilibrium under the alternative information structure that interested Townsend [Townsend, 1983] but that originally seemed too challenging to compute. 4
33.6. GuessandVerify Tactic#
As a preliminary step we shall take our recursive representation (33.23) of an equilibrium in industry \(i\) with one noisy signal on \(\theta_t\) and perform the following steps:
Compute \(\lambda\) and \(\tilde{\lambda}\) by posing a rootfinding problem and solving it with
numpy.roots
Compute \(p\) by forming the appropriate discrete Riccati equation and then solving it using
quantecon.solve_discrete_riccati
Add a measurement equation for \(P_t^i = b k_t^i + \theta_t + e_t\), \(\theta_t + e_t\), and \(e_t\) to system (33.23).
Write the resulting system in statespace form and encode it using
quantecon.LinearStateSpace
Use methods of the
quantecon.LinearStateSpace
to compute impulse response functions of \(k_t^i\) with respect to shocks \(v_t, e_t\).
After analyzing the onenoisysignal structure in this way, by making appropriate modifications we shall analyze the twonoisysignal structure.
We proceed to analyze first the onenoisysignal structure and then the twonoisysignal structure.
33.7. Equilibrium with One Noisy Signal on \(\theta_t\)#
33.7.1. Step 1: Solve for \(\tilde{\lambda}\) and \(\lambda\)#
Cast \(\left(\lambda1\right)\left(\lambda\frac{1}{\beta}\right)=b\lambda\) as \(p\left(\lambda\right)=0\) where \(p\) is a polynomial function of \(\lambda\).
Use
numpy.roots
to solve for the roots of \(p\)Verify \(\lambda \approx \frac{1}{\beta\tilde{\lambda}}\)
Note that \(p\left(\lambda\right)=\lambda^{2}\left(1+b+\frac{1}{\beta}\right)\lambda+\frac{1}{\beta}\).
33.7.2. Step 2: Solve for \(p\)#
Cast \(p=\sigma_{v}^{2}+\frac{p\rho^{2}\sigma_{e}^{2}}{2p+\sigma_{e}^{2}}\) as a discrete matrix Riccati equation.
Use
quantecon.solve_discrete_riccati
to solve for \(p\)Verify \(p \approx\sigma_{v}^{2}+\frac{p\rho^{2}\sigma_{e}^{2}}{2p+\sigma_{e}^{2}}\)
Note that:
33.7.3. Step 3: Represent the system using quantecon.LinearStateSpace
#
We use the following representation for constructing the
quantecon.LinearStateSpace
instance.
This representation includes extraneous variables such as \(P_{t}\) in the state vector.
We formulate things in this way because
it allows us easily to compute covariances of these variables with other
components of the state vector (step 5 above) by using the stationary_distributions
method of the LinearStateSpace
class.
import numpy as np
import quantecon as qe
import plotly.graph_objects as go
import plotly.offline as pyo
from statsmodels.regression.linear_model import OLS
from IPython.display import display, Latex, Image
pyo.init_notebook_mode(connected=True)
β = 0.9 # Discount factor
ρ = 0.8 # Persistence parameter for the hidden state
b = 1.5 # Demand curve parameter
σ_v = 0.5 # Standard deviation of shock to θ_t
σ_e = 0.6 # Standard deviation of shocks to w_t
# Compute λ
poly = np.array([1, (1 + β + b) / β, 1 / β])
roots_poly = np.roots(poly)
λ_tilde = roots_poly.min()
λ = roots_poly.max()
# Verify that λ = (βλ_tilde) ^ (1)
tol = 1e12
np.max(np.abs(λ  1 / (β * λ_tilde))) < tol
True
A_ricc = np.array([[ρ]])
B_ricc = np.array([[1.]])
R_ricc = np.array([[σ_e ** 2]])
Q_ricc = np.array([[σ_v ** 2]])
N_ricc = np.zeros((1, 1))
p = qe.solve_discrete_riccati(A_ricc, B_ricc, Q_ricc, R_ricc, N_ricc).item()
p_one = p # Save for comparison later
# Verify that p = σ_v ^ 2 + p * ρ ^ 2  (ρ * p) ^ 2 / (p + σ_e ** 2)
tol = 1e12
np.abs(p  (σ_v ** 2 + p * ρ ** 2  (ρ * p) ** 2 / (p + σ_e ** 2))) < tol
True
κ = ρ * p / (p + σ_e ** 2)
κ_prod = κ * σ_e ** 2 / p
κ_one = κ # Save for comparison later
A_lss = np.array([[0., 0., 0., 0., 0., 0.],
[κ / (λ  ρ), λ_tilde, κ_prod / (λ  ρ), 0., ρ / (λ  ρ), 0.],
[κ, 0., κ_prod, 0., 0., 1.],
[b * κ / (λ  ρ) , b * λ_tilde, b * κ_prod / (λ  ρ), 0., b * ρ / (λ  ρ) + ρ, 1.],
[0., 0., 0., 0., ρ, 1.],
[0., 0., 0., 0., 0., 0.]])
C_lss = np.array([[σ_e, 0.],
[0., 0.],
[0., 0.],
[σ_e, 0.],
[0., 0.],
[0., σ_v]])
G_lss = np.array([[0., 0., 0., 1., 0., 0.],
[1., 0., 0., 0., 1., 0.],
[1., 0., 0., 0., 0., 0.]])
mu_0 = np.array([0., 0., 0., 0., 0., 0.])
lss = qe.LinearStateSpace(A_lss, C_lss, G_lss, mu_0=mu_0)
ts_length = 100_000
x, y = lss.simulate(ts_length, random_state=1)
# Verify that two ways of computing P_t match
np.max(np.abs(np.array([[1., b, 0., 0., 1., 0.]]) @ x  x[3])) < 1e12
True
33.7.4. Step 4: Compute impulse response functions#
To compute impulse response functions of \(k_t^i\), we use the impulse_response
method of the
quantecon.LinearStateSpace
class and plot outcomes.
xcoef, ycoef = lss.impulse_response(j=21)
data = np.array([xcoef])[0, :, 1, :]
fig = go.Figure(data=go.Scatter(y=data[:1, 0], name=r'$e_{t+1}$'))
fig.add_trace(go.Scatter(y=data[1:, 1], name=r'$v_{t+1}$'))
fig.update_layout(title=r'Impulse Response Function',
xaxis_title='Time',
yaxis_title=r'$k^{i}_{t}$')
fig1 = fig
# Export to PNG file
Image(fig1.to_image(format="png"))
# fig1.show() will provide interactive plot when running
# notebook locally
33.7.5. Step 5: Compute stationary covariance matrices and population regressions#
We compute stationary covariance matrices by
calling the stationary_distributions
method of
the quantecon.LinearStateSpace
class.
By appropriately decomposing the covariance matrix of the state vector, we obtain ingredients of pertinent population regression coefficients.
Define
where \(\Sigma_{11}\) is the covariance matrix of dependent variables and \(\Sigma_{22}\) is the covariance matrix of independent variables.
Regression coefficients are \(\beta=\Sigma_{21}\Sigma_{22}^{1}\).
To verify an instance of a law of large numbers computation, we construct a long simulation of the state vector and for the resulting sample compute the ordinary leastsquares estimator of \(\beta\) that we shall compare with corresponding population regression coefficients.
_, _, Σ_x, Σ_y, Σ_yx = lss.stationary_distributions()
Σ_11 = Σ_x[0, 0]
Σ_12 = Σ_x[0, 1:4]
Σ_21 = Σ_x[1:4, 0]
Σ_22 = Σ_x[1:4, 1:4]
reg_coeffs = Σ_12 @ np.linalg.inv(Σ_22)
print('Regression coefficients (e_t on k_t, P_t, \\tilde{\\theta_t})')
print('')
print(r'k_t:', reg_coeffs[0])
print(r'\tilde{\theta_t}:', reg_coeffs[1])
print(r'P_t:', reg_coeffs[2])
Regression coefficients (e_t on k_t, P_t, \tilde{\theta_t})

k_t: 3.275556845219769
\tilde{\theta_t}: 0.9649461170475457
P_t: 0.9649461170475457
# Compute R squared
R_squared = reg_coeffs @ Σ_x[1:4, 1:4] @ reg_coeffs / Σ_x[0, 0]
R_squared
0.9649461170475461
# Verify that the computed coefficients are close to least squares estimates
model = OLS(x[0], x[1:4].T)
reg_res = model.fit()
np.max(np.abs(reg_coeffs  reg_res.params)) < 1e2
True
# Verify that R_squared matches least squares estimate
np.abs(reg_res.rsquared  R_squared) < 1e2
True
# Verify that θ_t + e_t can be recovered
model = OLS(y[1], x[1:4].T)
reg_res = model.fit()
np.abs(reg_res.rsquared  1.) < 1e6
True
33.8. Equilibrium with Two Noisy Signals on \(\theta_t\)#
Steps 1, 4, and 5 are identical to those for the onenoisysignal structure.
Step 2 requires a straightforward modification.
For step 3, we construct the following statespace representation so that we can get our hands on all of the random processes that we require in order to compute a regression of the noisy signal about \(\theta\) from the other industry that a firm receives directly in a pooling equilibrium against information that a firm would receive in Townsend’s original model.
For this purpose, we include equilibrium goods prices from both industries in the state vector:
A_ricc = np.array([[ρ]])
B_ricc = np.array([[np.sqrt(2)]])
R_ricc = np.array([[σ_e ** 2]])
Q_ricc = np.array([[σ_v ** 2]])
N_ricc = np.zeros((1, 1))
p = qe.solve_discrete_riccati(A_ricc, B_ricc, Q_ricc, R_ricc, N_ricc).item()
p_two = p # Save for comparison later
# Verify that p = σ_v^2 + (pρ^2σ_e^2) / (2p + σ_e^2)
tol = 1e12
np.abs(p  (σ_v ** 2 + p * ρ ** 2 * σ_e ** 2 / (2 * p + σ_e ** 2))) < tol
True
κ = ρ * p / (2 * p + σ_e ** 2)
κ_prod = κ * σ_e ** 2 / p
κ_two = κ # Save for comparison later
A_lss = np.array([[0., 0., 0., 0., 0., 0., 0., 0.],
[0., 0., 0., 0., 0., 0., 0., 0.],
[κ / (λ  ρ), κ / (λ  ρ), λ_tilde, κ_prod / (λ  ρ), 0., 0., ρ / (λ  ρ), 0.],
[κ, κ, 0., κ_prod, 0., 0., 0., 1.],
[b * κ / (λ  ρ), b * κ / (λ  ρ), b * λ_tilde, b * κ_prod / (λ  ρ), 0., 0., b * ρ / (λ  ρ) + ρ, 1.],
[b * κ / (λ  ρ), b * κ / (λ  ρ), b * λ_tilde, b * κ_prod / (λ  ρ), 0., 0., b * ρ / (λ  ρ) + ρ, 1.],
[0., 0., 0., 0., 0., 0., ρ, 1.],
[0., 0., 0., 0., 0., 0., 0., 0.]])
C_lss = np.array([[σ_e, 0., 0.],
[0., σ_e, 0.],
[0., 0., 0.],
[0., 0., 0.],
[σ_e, 0., 0.],
[0., σ_e, 0.],
[0., 0., 0.],
[0., 0., σ_v]])
G_lss = np.array([[0., 0., 0., 0., 1., 0., 0., 0.],
[0., 0, 0, 0., 0., 1., 0., 0.],
[1., 0., 0., 0., 0., 0., 1., 0.],
[0., 1., 0., 0., 0., 0., 1., 0.],
[1., 0., 0., 0., 0., 0., 0., 0.],
[0., 1., 0., 0., 0., 0., 0., 0.]])
mu_0 = np.array([0., 0., 0., 0., 0., 0., 0., 0.])
lss = qe.LinearStateSpace(A_lss, C_lss, G_lss, mu_0=mu_0)
ts_length = 100_000
x, y = lss.simulate(ts_length, random_state=1)
xcoef, ycoef = lss.impulse_response(j=20)
data = np.array([xcoef])[0, :, 2, :]
fig = go.Figure(data=go.Scatter(y=data[:1, 0], name=r'$e_{1,t+1}$'))
fig.add_trace(go.Scatter(y=data[:1, 1], name=r'$e_{2,t+1}$'))
fig.add_trace(go.Scatter(y=data[1:, 2], name=r'$v_{t+1}$'))
fig.update_layout(title=r'Impulse Response Function',
xaxis_title='Time',
yaxis_title=r'$k^{i}_{t}$')
fig2=fig
# Export to PNG file
Image(fig2.to_image(format="png"))
# fig2.show() will provide interactive plot when running
# notebook locally
_, _, Σ_x, Σ_y, Σ_yx = lss.stationary_distributions()
Σ_11 = Σ_x[1, 1]
Σ_12 = Σ_x[1, 2:5]
Σ_21 = Σ_x[2:5, 1]
Σ_22 = Σ_x[2:5, 2:5]
reg_coeffs = Σ_12 @ np.linalg.inv(Σ_22)
print('Regression coefficients (e_{2,t} on k_t, P^{1}_t, \\tilde{\\theta_t})')
print('')
print(r'k_t:', reg_coeffs[0])
print(r'\tilde{\theta_t}:', reg_coeffs[1])
print(r'P_t:', reg_coeffs[2])
Regression coefficients (e_{2,t} on k_t, P^{1}_t, \tilde{\theta_t})

k_t: 0.0
\tilde{\theta_t}: 0.0
P_t: 0.0
# Compute R squared
R_squared = reg_coeffs @ Σ_x[2:5, 2:5] @ reg_coeffs / Σ_x[1, 1]
R_squared
0.0
# Verify that the computed coefficients are close to least squares estimates
model = OLS(x[1], x[2:5].T)
reg_res = model.fit()
np.max(np.abs(reg_coeffs  reg_res.params)) < 1e2
True
# Verify that R_squared matches least squares estimate
np.abs(reg_res.rsquared  R_squared) < 1e2
True
_, _, Σ_x, Σ_y, Σ_yx = lss.stationary_distributions()
Σ_11 = Σ_x[1, 1]
Σ_12 = Σ_x[1, 2:6]
Σ_21 = Σ_x[2:6, 1]
Σ_22 = Σ_x[2:6, 2:6]
reg_coeffs = Σ_12 @ np.linalg.inv(Σ_22)
print('Regression coefficients (e_{2,t} on k_t, P^{1}_t, P^{2}_t, \\tilde{\\theta_t})')
print('')
print(r'k_t:', reg_coeffs[0])
print(r'\tilde{\theta_t}:', reg_coeffs[1])
print(r'P^{1}_t:', reg_coeffs[2])
print(r'P^{2}_t:', reg_coeffs[3])
Regression coefficients (e_{2,t} on k_t, P^{1}_t, P^{2}_t, \tilde{\theta_t})

k_t: 3.1373589171035627
\tilde{\theta_t}: 0.9242343967443672
P^{1}_t: 0.037882801627816154
P^{2}_t: 0.9621171983721835
# Compute R squared
R_squared = reg_coeffs @ Σ_x[2:6, 2:6] @ reg_coeffs / Σ_x[1, 1]
R_squared
0.9621171983721837
33.9. Key Step#
Now we come to the key step for verifying that equilibrium outcomes for prices and quantities are identical in the pooling equilibrium original model that led Townsend to deduce an infinitedimensional state space.
We accomplish this by computing a population linear least squares regression of the noisy signal that firms in the other industry receive in a pooling equilibrium on time \(t\) information that a firm would receive in Townsend’s original model.
Let’s compute the regression and stare at the \(R^2\):
# Verify that θ_t + e^{2}_t can be recovered
# θ_t + e^{2}_t on k^{i}_t, P^{1}_t, P^{2}_t, \\tilde{\\theta_t}
model = OLS(y[1], x[2:6].T)
reg_res = model.fit()
np.abs(reg_res.rsquared  1.) < 1e6
True
reg_res.rsquared
1.0
The \(R^2\) in this regression equals \(1\).
That verifies that a firm’s information set in Townsend’s original model equals its information set in a pooling equilibrium.
Therefore, equilibrium prices and quantities in Townsend’s original model equal those in a pooling equilibrium.
33.10. An observed common shock benchmark#
For purposes of comparison, it is useful to construct a model in which demand disturbance in both industries still both share have a common persistent component \(\theta_t\), but in which the persistent component \(\theta\) is observed each period.
In this case, firms share the same information immediately and have no need to deploy signalextraction techniques.
Thus, consider a version of our model in which histories of both \(\epsilon_t^i\) and \(\theta_t\) are observed by a representative firm.
In this case, the firm’s optimal decision rule is described by
where \(\hat \theta_{t+1} = E_t \theta_{t+1}\) is given by
Thus, the firm’s decision rule can be expressed
Consequently, when a history \(\theta_s, s \leq t\) is observed without noise, the following state space system prevails:
where \(z_{t,t+1} \) is a scalar iid standardized Gaussian process.
As usual, the system can be written as
In order once again to use the quantecon class quantecon.LinearStateSpace
, let’s form pertinent statespace matrices
Ao_lss = np.array([[ρ, 0.],
[ρ / (λ  ρ), λ_tilde]])
Co_lss = np.array([[σ_v], [0.]])
Go_lss = np.identity(2)
muo_0 = np.array([0., 0.])
lsso = qe.LinearStateSpace(Ao_lss, Co_lss, Go_lss, mu_0=muo_0)
Now let’s form and plot an impulse response function of \(k_t^i\) to shocks \(v_t\) to \(\theta_{t+1}\)
xcoef, ycoef = lsso.impulse_response(j=21)
data = np.array([ycoef])[0, :, 1, :]
fig = go.Figure(data=go.Scatter(y=data[:1, 0], name=r'$z_{t+1}$'))
fig.update_layout(title=r'Impulse Response Function',
xaxis_title= r'lag $j$',
yaxis_title=r'$k^{i}_{t}$')
fig3 = fig
# Export to PNG file
Image(fig3.to_image(format="png"))
# fig1.show() will provide interactive plot when running
# notebook locally
33.11. Comparison of All Signal Structures#
It is enlightening side by side to plot impulse response functions for capital for the two noisysignal information structures and the noiseless signal on \(\theta\) that we have just presented.
Please remember that the twosignal structure corresponds to the pooling equilibrium and also Townsend’s original model.
fig_comb = go.Figure(data=[
*fig1.data,
*fig2.update_traces(xaxis='x2', yaxis='y2').data,
*fig3.update_traces(xaxis='x3', yaxis='y3').data
]).set_subplots(1, 3,
subplot_titles=("One noisysignal",
"Two noisysignal",
"No Noise"),
horizontal_spacing=0.02,
shared_yaxes=True)
# Export to PNG file
Image(fig_comb.to_image(format="png"))
# fig_comb.show() # will provide interactive plot when running
# notebook locally
The three panels in the graph above show that
responses of \( k_t^i \) to shocks \( v_t \) to the hidden Markov demand state \( \theta_t \) process are largest in the nonoisysignal structure in which the firm observes \(\theta_t\) at time \(t\)
responses of \( k_t^i \) to shocks \( v_t \) to the hidden Markov demand state \( \theta_t \) process are smaller in the twonoisysignal structure
responses of \( k_t^i \) to shocks \( v_t \) to the hidden Markov demand state \( \theta_t \) process are smallest in the onenoisysignal structure
With respect to the iid demand shocks \(e_t\) the graphs show that
responses of \( k_t^i \) to shocks \( e_t \) to the hidden Markov demand state \( \theta_t \) process are smallest (i.e., nonexistent) in the nonoisysignal structure in which the firm observes \(\theta_t\) at time \(t\)
responses of \( k_t^i \) to shocks \( e_t \) to the hidden Markov demand state \( \theta_t \) process are larger in the twonoisysignal structure
responses of \( k_t^i \) to idiosyncratic ownmarket noiseshocks \( e_t \) are largest in the onenoisysignal structure
Among other things, these findings indicate that time series correlations and coherences between outputs in the two industries are higher in the twonoisysignals or pooling model than they are in the onenoisy signal model.
The enhanced influence of the shocks \( v_t \) to the hidden Markov demand state \( \theta_t \) process that emerges from the twonoisysignal model relative to the onenoisysignal model is a symptom of a lower equilibrium hiddenstate reconstruction error variance in the twosignal model:
display(Latex('$\\textbf{Reconstruction error variances}$'))
display(Latex(f'Onenoise structure: {round(p_one, 6)}'))
display(Latex(f'Twonoise structure: {round(p_two, 6)}'))
Kalman gains for the two structures are
display(Latex('$\\textbf{Kalman Gains}$'))
display(Latex(f'One noisysignal structure: {round(κ_one, 6)}'))
display(Latex(f'Two noisysignals structure: {round(κ_two, 6)}'))
Another lesson that comes from the preceding threepanel graph is that the presence of iid noise \(\epsilon_t^i\) in industry \(i\) generates a response in \(k_t^{i}\) in the twonoisysignal structure, but not in the onenoisysignal structure.
33.12. Notes on History of the Problem#
To truncate what he saw as an intractable, infinite dimensional state space, Townsend constructed an approximating model in which the common hidden Markov demand shock is revealed to all firms after a fixed number of periods.
Thus,
Townsend wanted to assume that at time \( t \) firms in industry \( i \) observe \( k_t^i, Y_t^i, P_t^i, (P^{i})^t \), where \( (P^{i})^t \) is the history of prices in the other market up to time \( t \).
Because that turned out to be too challenging, Townsend made a sensible alternative assumption that eased his calculations: that after a large number \( S \) of periods, firms in industry \( i \) observe the hidden Markov component of the demand shock \( \theta_{tS} \).
Townsend argued that the more manageable model could do a good job of approximating the intractable model in which the Markov component of the demand shock remains unobserved for ever.
By applying technical machinery of [Pearlman et al., 1986], [Pearlman and Sargent, 2005] showed that there is a recursive representation of the equilibrium of the perpetually and symmetrically uninformed model that Townsend wanted to solve [Townsend, 1983].
A reader of [Pearlman and Sargent, 2005] will notice that their representation of the equilibrium of Townsend’s model exactly matches that of the pooling equilibrium presented here.
We have structured our notation in this lecture to faciliate comparison of the pooling equilibrium constructed here with the equilibrium of Townsend’s model reported in [Pearlman and Sargent, 2005].
The computational method of [Pearlman and Sargent, 2005] is recursive: it enlists the Kalman filter and invariant subspace methods for solving systems of Euler equations 5 .
As [Singleton, 1987], [Kasa, 2000], and [Sargent, 1991] also found, the equilibrium is fully revealing: observed prices tell participants in industry \( i \) all of the information held by participants in market \( i \) (\( i \) means not \( i \)).
This means that higherorder beliefs play no role: observing equilibrium prices in effect lets decision makers pool their information sets 6 .
The disappearance of higher order beliefs means that decision makers in this model do not really face a problem of forecasting the forecasts of others.
Because those forecasts are the same as their own, they know them.
33.12.1. Further historical remarks#
Sargent [Sargent, 1991] proposed a way to compute an equilibrium without making Townsend’s approximation.
Extending the reasoning of [Muth, 1960], Sargent noticed that it is possible to summarize the relevant history with a low dimensional object, namely, a small number of current and lagged forecasting errors.
Positing an equilibrium in a space of perceived laws of motion for endogenous variables that takes the form of a vector autoregressive, moving average, Sargent described an equilibrium as a fixed point of a mapping from the perceived law of motion to the actual law of motion of that form.
Sargent worked in the time domain and proceeded to guess and verify the appropriate orders of the autoregressive and moving average pieces of the equilibrium representation.
By working in the frequency domain [Kasa, 2000] showed how to discover the appropriate orders of the autoregressive and moving average parts, and also how to compute an equilibrium.
The [Pearlman and Sargent, 2005] recursive computational method, which stays in the time domain, also discovered appropriate orders of the autoregressive and moving average pieces.
In addition, by displaying equilibrium representations in the form of [Pearlman et al., 1986], [Pearlman and Sargent, 2005] showed how the moving average piece is linked to the innovation process of the hidden persistent component of the demand shock.
That scalar innovation process is the additional state variable contributed by the problem of extracting a signal from equilibrium prices that decision makers face in Townsend’s model.
 1
[Pearlman and Sargent, 2005] verified this assertion using a different tactic, namely, by constructing analytic formulas for an equilibrium under the incomplete information structure and confirming that they match the pooling equilibrium formulas derived here.
 2
See [Sargent, 1987], especially chapters IX and XIV, for principles that guide solving some roots backwards and others forwards.
 3
As noted by [Sargent, 1987], this difference equation is the Euler equation for a planning problem that maximizes the discounted sum of consumer plus producer surplus.
 4
[Pearlman and Sargent, 2005] verify the same claim by applying machinery of [Pearlman et al., 1986].
 5
See [Anderson et al., 1996] for an account of invariant subspace methods.
 6
See [Allen et al., 2002] for a discussion of information assumptions needed to create a situation in which higher order beliefs appear in equilibrium decision rules. A way to read our findings in light of [Allen et al., 2002] is that, relative to the number of signals agents observe, Townsend’s section 8 model has too few random shocks to get higher order beliefs to play a role.