Addressing Dynamic and Sparse Qualitative Data: A Hilbert Space Embedding of Categorical Variables

08/22/2023
by   Anirban Mukherjee, et al.
0

We propose a novel framework for incorporating qualitative data into quantitative models for causal estimation. Previous methods use categorical variables derived from qualitative data to build quantitative models. However, this approach can lead to data-sparse categories and yield inconsistent (asymptotically biased) and imprecise (finite sample biased) estimates if the qualitative information is dynamic and intricate. We use functional analysis to create a more nuanced and flexible framework. We embed the observed categories into a latent Baire space and introduce a continuous linear map – a Hilbert space embedding – from the Baire space of categories to a Reproducing Kernel Hilbert Space (RKHS) of representation functions. Through the Riesz representation theorem, we establish that the canonical treatment of categorical variables in causal models can be transformed into an identified structure in the RKHS. Transfer learning acts as a catalyst to streamline estimation – embeddings from traditional models are paired with the kernel trick to form the Hilbert space embedding. We validate our model through comprehensive simulation evidence and demonstrate its relevance in a real-world study that contrasts theoretical predictions from economics and psychology in an e-commerce marketplace. The results confirm the superior performance of our model, particularly in scenarios where qualitative information is nuanced and complex.

READ FULL TEXT

page 4

page 7

page 29

research
01/27/2021

Reproducing kernel Hilbert C*-module and kernel mean embeddings

Kernel methods have been among the most popular techniques in machine le...
research
11/23/2020

Discovering Causal Structure with Reproducing-Kernel Hilbert Space ε-Machines

We merge computational mechanics' definition of causal states (predictiv...
research
09/26/2013

Hilbert Space Embeddings of Predictive State Representations

Predictive State Representations (PSRs) are an expressive class of model...
research
08/13/2012

Path Integral Control by Reproducing Kernel Hilbert Space Embedding

We present an embedding of stochastic optimal control problems, of the s...
research
03/08/2021

A reproducing kernel Hilbert space framework for functional data classification

We encounter a bottleneck when we try to borrow the strength of classica...
research
06/06/2020

Learning Inconsistent Preferences with Kernel Methods

We propose a probabilistic kernel approach for preferential learning fro...
research
12/08/2020

Robustness of Model Predictions under Extension

Often, mathematical models of the real world are simplified representati...

Please sign up or login with your details

Forgot password? Click here to reset