Deep Causal Inequalities: Demand Estimation using Individual-Level Data
Bakhitov, Edvard,
Singh, Amandeep,
and Zhang, Jiding
working paper,
2021
Modern machine learning algorithms can easily deal with unstructured data, however, recent literature has demonstrated that they do not perform well in presence of endogeneity in the explanatory variables. On the other hand, extant methods catered towards addressing endogeneity issues make strong parametric assumptions and, hence, are incapable of directly incorporating high-dimensional unstructured data. In this paper, we borrow from the literature on partial identification and propose the Deep Causal Inequalities (DeepCI) estimator that overcomes both these issues. Instead of relying on observed labels, the DeepCI estimator uses inferred moment inequalities from the observed behavior of agents in the data. This allows us to take care of endogeneity by differencing out unobservable product characteristics. We provide theoretical guarantees for our estimator and prove its consistency under very mild conditions. We demonstrate through extensive Monte Carlo simulations that our estimator outperforms standard supervised machine learning algorithms and existing partial identification methods. Finally, we apply DeepCI to the differentiated products demand estimation framework. The flexibility of the method allows for highly unstructured data like images, which we exploit in the empirical application based on the consumer-level car rental data from Hertz. Using the DeepCI estimator, we show how to estimate the importance of various car design features affecting consumer rental decisions.