Research

Hierarchical Latent Class Models for Mortality Surveillance

Overview

Monitoring causes of death is crucial for understanding disease burdens and shaping public health interventions. In many low-resource settings, verbal autopsy (VA) is the primary tool for cause-of-death surveillance, relying on structured interviews with caregivers of the deceased. However, existing VA models often require extensive domain knowledge or large labeled datasets, making them ill-suited for emerging diseases with rapidly evolving epidemiological patterns.

Symptoms Distribution by Time and Age

Our Contribution

In our research, we propose a Bayesian hierarchical latent class model that accounts for partially verified VA data, enabling robust cause-of-death estimation under real-world data constraints. Key innovations include:

Latent Class Profiles

Why This Matters

Read More

📄 Full Paper: Hierarchical Latent Class Models for Mortality Surveillance (https://arxiv.org/abs/2410.09274)


Treatment Effect Estimation in Dynamic Network Environments [CODE@MIT, 2024]

Overview

Online multiplayer games introduce complex interactions among players, making it difficult to assess the true impact of game features. Traditional A/B testing methods, which assume no interference among units (SUTVA), fail in these settings due to the dynamic and ephemeral network structures formed during gameplay.

In this research, we address the challenges of network interference in online gaming experiments and propose a novel treatment effect estimator that enables post-hoc interference adjustment in randomized experiments.

Key Challenges

Our Contribution

To address these challenges, we propose a causal inference framework specifically designed for dynamic online gaming environments.

Treatment Effect Estimation Comparisons in Simulation Study

Why This Matters

Read More

📄 Full Paper: Treatment Effect Estimation Amidst Dynamic Network Interference (https://arxiv.org/abs/2402.05336)


Bayesian Tensor Decomposition for Verbal Autopsy Analysis

Overview

Accurate cause-of-death estimation is critical for understanding disease burdens and shaping public health policy, especially in low-resource settings where medical certification of deaths is often unavailable. Verbal autopsy (VA) is a widely used alternative, relying on interviews with caregivers to infer the probable cause of death.

Traditional VA models often use latent class models, which assume that symptoms are conditionally independent given the cause of death. However, real-world symptom distributions exhibit complex dependencies, making them hard to interpret and potentially limiting their accuracy.

Our Contribution

We propose a Bayesian hierarchical tensor decomposition framework that balances predictive accuracy with interpretability in VA cause-of-death estimation.

Key innovations include:

Key Results

We evaluate our method on simulated and real-world VA data, including the PHMRC gold-standard dataset, showing that:

Latent Class_Profiles for Stroke

Case Study: PHMRC Gold-Standard VA Dataset

We applied our model to 7,841 adult deaths across 34 causes from the PHMRC dataset, demonstrating:

Learned Symptom Clustering

The model identifies distinct symptom groups that correlate with specific diseases, reducing the complexity of high-dimensional VA data.

Cause-of-Death Relationships

Hierarchical clustering of causes based on symptom structures reveals intuitive groupings, such as cardiovascular diseases forming a distinct cluster.

Cause of Death Clustering

Why This Matters

Read More

📄 Full Paper: Bayesian Tensor Decomposition for VA (https://arxiv.org/abs/2502.00171)