I'm an undergrad at UC Berkeley advised by Sewon Min. I'm interested in understanding and improving language model capabilities, particularly under real-world constraints.
Publications
-
Which Models Are Our Models Built On?
Auditing Invisible Dependencies in Modern LLMsWe introduce ModSleuth, an agentic system that reconstructs the dependency graphs behind modern LLMs from papers, model cards, dataset cards, code, configs, and upstream artifacts. It traces how models are used to generate data, filter corpora, evaluate outputs, and guide training decisions, recovering 1,060 source-verified dependencies across four releases and surfacing license-relevant paths, train-evaluation coupling, and documentation mismatches.
* equal contribution
-
Train Separately, Merge Together: Modular Post-Training with Mixture-of-ExpertsWe present BAR (Branch-Adapt-Route), a recipe for modular post-training. Rather than training a single model on all data at once, BAR trains independent domain experts – each through its own complete training pipeline – and composes them into a unified model via a mixture-of-experts (MoE) architecture. Each expert can be developed, upgraded, or replaced without touching the others.
* equal contribution
Industry Experience
- Jane Street, Machine Learning Engineer Intern Summer 2026
- Google, Software Engineer Intern May – Aug 2025
- Apple, Machine Learning Engineer Intern Mar – May 2025