Machine learning the first stage in 2SLS: Practical guidance from bias decomposition and simulation

要約

機械学習（ML）は、主に「予測の問題」を解決するために進化しました。
2段階の最小二乗（2SL）の最初の段階は予測の問題であり、MLの第1段階の支援からの潜在的な利益を示唆しています。
ただし、MLが2SLS $ \ unicode {x2014} $を支援する場合、またはそれが痛い場合は、ほとんどガイダンスが存在しません。
MLを2SLに挿入することの意味を調査し、3つの有益なコンポーネントにバイアスを分解します。
機械的に、ML-in-2SLS手順は、予測および因果関係の設定に共通の問題に直面しています$ \ unicode {x2014} $およびそれらの相互作用。
シミュレーションを通じて、線形MLメソッド（例えば、ラッソ後）がうまく機能することを示しますが、非線形メソッド（ランダムフォレスト、ニューラルネットなど）は、第2段階の推定値で実質的なバイアスを生成します$ \ Unicode {x2014} $は、内生のOLのバイアスを潜在的に超えています。

要約(オリジナル)

Machine learning (ML) primarily evolved to solve ‘prediction problems.’ The first stage of two-stage least squares (2SLS) is a prediction problem, suggesting potential gains from ML first-stage assistance. However, little guidance exists on when ML helps 2SLS$\unicode{x2014}$or when it hurts. We investigate the implications of inserting ML into 2SLS, decomposing the bias into three informative components. Mechanically, ML-in-2SLS procedures face issues common to prediction and causal-inference settings$\unicode{x2014}$and their interaction. Through simulation, we show linear ML methods (e.g., post-Lasso) work well, while nonlinear methods (e.g., random forests, neural nets) generate substantial bias in second-stage estimates$\unicode{x2014}$potentially exceeding the bias of endogenous OLS.

arxiv情報

著者	Connor Lennon,Edward Rubin,Glen Waddell
発行日	2025-05-19 17:53:15+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Machine learning the first stage in 2SLS: Practical guidance from bias decomposition and simulation

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー