Resume
๐ผ Work Experience
Meta โ Research Scientist, Feed Recommendation
June 2025 โ Present
- Sequence-level RL policy optimization for personalized feed ranking
- Transfer learning for recurrent RL to improve convergence stability
- Reward modeling refinement (debiased VPV alignment)
- Hierarchical latent reasoning LLM-as-Judge framework with SFT + distillation
- Online A/B experimentation for topline impact
Microsoft โ Data Scientist Intern, Azure Compute
June 2024 โ Sept 2024
- Constrained RL under non-stationary reward distributions
- Agentic LLM reasoning system for root cause analysis
- Human-in-the-loop feedback pipeline (RLHF-ready design)
- Evaluation harness for systematic model benchmarking
Tencent America โ Research Data Scientist Intern, IEG Global
Oct 2023 โ May 2024
- Treatment effect estimation under network interference
- Bayesian modeling for online experimentation
๐ Education
Ph.D. in Statistics, UC Santa Cruz โ 2025
M.S. in Statistics, UC Davis โ 2019
๐ Skills
Languages: R, Python, SQL, Java, C++, JavaScript, Machine Learning Frameworks: Keras, PyTorch, Tensorflow
Statistics and Experimentationg: A/B Testing, Experimental Design, Causal Inference, Bayesian Optimization
AI and Learning System: Reinforcement Learning (Policy Optimization, Q-learning, PPO/GRPO-style Optimization, RLHF), Sequence-level RL for Ranking & Personalization, LLM Post-training (SFT, Distillation, Preference Modeling), LLM Inference & Serving (vLLM, Low-latency Deployment), Reward Modeling & Alignment under Distribution Shift, Agentic Workflow Design & Feedback Loops, Transfer Learning & Cross-Domain Generalization, Online Experimentation & Evaluation Harness Design
๐ Selected Honors
- UCSC Graduate Deanโs Fellowship
- SFASA Travel Award
- WNAR Student Paper Competition Fellowship
- National Scholarship for Academic Excellence
