American Economic Association

Contaminated Control Variables in 2SLS Models

Asad Dossani

,

Colorado State University

Rob Schonlau

,

Colorado State University

Jeffrey Dotson

,

Brigham Young University

Abstract

Despite guidance in the theoretical literature that there needs to be as many exogenous instruments as endogenous variables for identification when using 2SLS, many papers in empirical finance instrument only the key variable of interest but then include, as though exogenous, an assortment of control variables that may themselves also be endogenous. We discuss the tradeoff between the omitted variable bias associated with not including these variables versus the bias created by including endogenous control variables. We suggest a new diagnostic test when thinking about this tradeoff in a 2SLS setting and suggest a way to calculate the maximum possible bias in the coefficient of interest coming from the control variables. Using simulated data and an empirical example from the diversification discount literature, we show how the new test and bias calculations can help researchers better understand and troubleshoot their 2SLS models.

Estimating Counterfactual Matrix Means with Short Panel Data

Lihua Lei

,

Stanford University

Brad Ross

,

Stanford University

Abstract

We develop a new, spectral approach for identifying and estimating average counterfactual outcomes under a low-rank factor model with short panel data and general outcome missingness patterns. Potential applications include event studies and studies of outcomes of “matches” between agents of two types, e.g. workers and firms or people and places. We show that our approach identifies all counterfactual outcome means, including those not estimable by existing methods, if a particular graph constructed based on overlaps in the sets of observed outcomes between subpopulations of units is connected. Our analogous, computationally efficient estimation procedure yields consistent, asymptotically normal estimates of counterfactual outcome means under fixed-T (number of outcomes), large-N (sample size) asymptotics. In a semi-synthetic simulation study based on matched employer-employee data, our method yields estimates of average wages with lower bias and only slightly higher variance than a Two-Way-Fixed-Effects-model-based estimator, suggesting complementarities between workers and firms do affect wages.

External Validity in an Instrumental Variable Setting

Alexander Kwon

,

CUNY-Graduate Center

Kyungtae Lee

,

CUNY-Graduate Center

View Abstract

Abstract

We study the external validity within the context of instrumental variable estimation. The key assumption we impose for external validity is conditional external unconfoundedness among compliers, meaning that the treatment effect and target selection are independent among compliers conditional on covariates. By using a case study about the impact of solid fuel usage on women's average cooking time, we compare the local average treatment effect (LATE) of the country of interest with the predicted LATE estimated with data from other countries. While the sub-population is an important factor, it does not significantly undermine external validity in our case study. Among six countries examined, four (Ethiopia, Honduras, Kenya, and Zambia) exhibit no statistically significant difference between predicted and actual LATE across various specifications. These results give evidence that external validity is not severely harmed in our case study. Conversely, in Cambodia and Nepal, the two LATEs are statistically different, indicating distinct sub-populations compared to those in other countries. These findings provide evidence that sub-population is a non-trivial factor for external validity.

Fuzzy Regression Discontinuity Design without Monotonicity

Yi Cui

,

University of North Carolina-Chapel Hill

View Abstract

Abstract

In this paper, we present a novel approach to derive nonparametric sharp (i.e., the tightest possible) bounds for compliers and defiers separately under fuzzy regression discontinuity (FRD) design, without relying on the monotonicity assumption. Our method builds on the seminal work by Imbens and Angrist (1994), Angrist et al. (1996) and Hahn et al. (2001). Unlike the existing literature that tests the validity of FRD design assuming monotonicity or weaker forms of it, we demonstrate the invalidity and bias of Wald estimand without this crucial assumption.

Identification and Estimation of Discrete Choice Models with Spillovers Using Partial Network Data

Shuo Qi

,

Southern Methodist University

View Abstract

Abstract

Understanding peer influences is essential in an increasingly interconnected world. However, to what extent can we use data on social connections if they are incomplete? This paper investigates peer effects in discrete choice models with incomplete data on social links. Following Graham (2017), we set up an undirected dyadic link formation model where connections are based on homophily (similarities in characteristics) and individual fixed effects. We identify homophily effects through available configurations among tetrads (groups of four agents). We then identify the distribution of fixed effects through available configurations among triads (groups of three agents). After recovering the network-generating process, we propose a simulated network approach to study the influence of peers on individual decision-making. Simulations illustrate that the finite sample performance of the estimator is close to that obtained when the true network is observed. We apply our estimator to examine household microfinance participation decisions in rural India (Banerjee et al., 2013), detecting positive peer effects even in cases of missing links and missing networks.

Quantile Local Projections: Identification, Smooth Estimation, and Inference

Josef Ruzicka

,

Nazarbayev University

Abstract

Standard impulse response functions measure the average effect of a shock on a response variable. However, different parts of the distribution of the response variable may react to the shock differently. A popular method to capture this heterogeneity are quantile regression local projections. We identify them by short-run restrictions or external instruments, and we establish their asymptotics. To overcome their excessive volatility, we introduce two novel smoothing estimators and propose information criteria for optimal smoothing. In the first empirical application, we show that financial conditions affect the entire distribution of GDP growth and not just its lower part. Thus, financial conditions matter not only for recessions, but also during normal times and even in recovery periods. The second application demonstrates that conventional monetary policy is more effective at curbing inflation than at generating it.

Reliable Wild Bootstrap Inference with Multiway Clustering

Jiahao Lin

,

SUNY-Albany

Ulrich Hounyo

,

SUNY-Albany

Abstract

This paper studies wild bootstrap-based inference for regression models with multiway clustering. Our proposed method is a multiway counterpart to the (one-way) wild cluster bootstrap approach introduced by Cameron et al. (2008). We establish the validity of our method for studentized statistics. Theoretical results are provided, accommodating arbitrary serial dependence in the common time effects – an aspect excluded by existing two-way bootstrap-based approaches. Simulation experiments document the potential for enhanced inference with our novel approach. We illustrate the effectiveness of the method by revisiting empirical studies involving multiway clustered and correlated data.

Synthetic IV Estimation in Panels

Jaume Vives

,

Massachusetts Institute of Technology

Ahmet Gulek

,

Massachusetts Institute of Technology

Abstract

We propose a Synthetic Instrumental Variables (SIV) estimator for panel data that combines the strengths of instrumental variables and synthetic controls to address unmeasured confounding. We derive conditions under which SIV is consistent and asymptotically normal, even when the standard IV estimator is not. Motivated by the finite sample properties of our estimator, we introduce an ensemble estimator that simultaneously addresses multiple sources of bias and provide a permutation-based inference procedure. We demonstrate the effectiveness of our methods through a calibrated simulation exercise, two shift-share empirical applications, and an application in digital economics that includes both observational data and data from a randomized control trial. In our primary empirical application, we examine the impact of the Syrian refugee crisis on Turkish labor markets. Here, the SIV estimator reveals significant effects that the standard IV does not capture. Similarly, in our digital economics application, the SIV estimator successfully recovers the experimental estimates, whereas the standard IV does not.

Testable Identification of Finite Mixture Models

Bruno de Albuquerque Furtado

,

Royal Holloway University of London and Oxford University

View Abstract

Abstract

Finite mixture models, in which observations are drawn from a combination of latent component distributions with unknown weights, have a wide array of applications in economics. It is well known that the latent parameters of such models are not always identifiable under common identification assumptions. This paper establishes sufficient conditions for non-parametric identifiability of finite mixtures models, assuming that an observable covariate shifts mixture weights while leaving component distributions unchanged. Such an assumption is naturally satisfied across various domains, including latent topic models of text data, addressing misclassified categorical regressors, and analyzing Markovian regime switching models. The proposed identification conditions focus solely on observable mixture distributions, enabling verification without direct knowledge of latent parameters. Therefore, they can be verified without direct knowledge of the latent parameters, and in principle can be checked prior to estimating the model. Building on this, I introduce a statistical test to assess whether these conditions are met and derive its asymptotic distribution. The test employs an extremum estimator with a strictly concave objective function, which makes simulating its asymptotic distribution computationally tractable. Additionally, a straightforward transformation of the same extremum estimator yields a consistent estimator of the latent parameters.

Testing Spatial Correlation for Spatial Models with Heterogeneous Coefficients When Both n and T Are Large

Shi Ryoung Chang

,

Ohio State University

Robert de Jong

,

Ohio State University

Abstract

The widely used approach to testing spatial correlation is to formulate a hypothesis on a homogenous spatial coefficient in spatial models. This paper proposes a novel test for spatial correlation in spatial panel data models with heterogeneous spatial autoregressive coefficients. In small reciprocal interactions, the proposed test asymptotically follows a standard normal distribution when both n and T tend to infinity jointly. The power under local alternatives is investigated. We show that the traditional test may lose power when spatial effects are heterogeneous in nature. Monte Carlo simulations demonstrate that our proposed test has better power compared to the traditional one in these types of networks. We provide an empirical example to illustrate that the proposed and traditional tests can draw different conclusions on spatial correlation.

Econometric Methods

Friday, Jan. 3, 2025 10:15 AM - 12:15 PM (PST)

Contaminated Control Variables in 2SLS Models

Abstract

Estimating Counterfactual Matrix Means with Short Panel Data

Abstract

External Validity in an Instrumental Variable Setting

Abstract

Fuzzy Regression Discontinuity Design without Monotonicity

Abstract

Identification and Estimation of Discrete Choice Models with Spillovers Using Partial Network Data

Abstract

Quantile Local Projections: Identification, Smooth Estimation, and Inference

Abstract

Reliable Wild Bootstrap Inference with Multiway Clustering

Abstract

Synthetic IV Estimation in Panels

Abstract

Testable Identification of Finite Mixture Models

Abstract

Testing Spatial Correlation for Spatial Models with Heterogeneous Coefficients When Both n and T Are Large

Abstract

JEL Classifications

This website uses cookies.

Econometric Methods

Friday, Jan. 3, 2025 10:15 AM - 12:15 PM (PST)

Contaminated Control Variables in 2SLS Models

Abstract

Estimating Counterfactual Matrix Means with Short Panel Data

Abstract

External Validity in an Instrumental Variable Setting

Abstract

Fuzzy Regression Discontinuity Design without Monotonicity

Abstract

Identification and Estimation of Discrete Choice Models with Spillovers Using Partial Network Data

Abstract

Quantile Local Projections: Identification, Smooth Estimation, and Inference

Abstract

Reliable Wild Bootstrap Inference with Multiway Clustering

Abstract

Synthetic IV Estimation in Panels

Abstract

Testable Identification of Finite Mixture Models

Abstract

Testing Spatial Correlation for Spatial Models with Heterogeneous Coefficients When Both n and T Are Large

Abstract

JEL Classifications