Built across LLMs, vision, world models, and clinical AI. Now I want a PhD with room for deeper work.
I've shipped research with Tencent, Gradient Networks, MeetaVista, TU Delft, McGill, Tsinghua, and IMDEA. Next I want long-horizon work on useful, durable AI, especially where access and reliability matter.
Day-to-day
An independent AI researcher in Amsterdam. I currently collaborate with industry (Tencent, Gradient Networks, MeetaVista) and academia (TU Delft, McGill, Tsinghua) on cost-efficient large language models, optimization-as-reasoning, and intent-aware world models. Earlier roles include Marie Skłodowska-Curie Fellow at IMDEA Networks (Madrid), AI Research Engineer at TU Delft Imaging Physics, and an MSc in Computer & Embedded Systems Engineering at TU Delft.
Two things drive this work: pushing AI beyond text into perception and action, and making those systems fairer and more reachable in practice.
World models
Systems that perceive a scene and imagine what happens next. The bridge from describing the world to acting in it.
Large language models
Making capable models cheap, dependable, and useful enough to actually deploy. Routing, optimization, and agent design.
Computer vision
Vision as a channel for human signal, not just object detection. Reading emotion, intent, and attention from what people see.
AI for medicine
Models that work in the real clinic, not only on the leaderboard. Smaller, faster, and fair across patient populations.
AI safety
Building things that behave reliably when they leave the lab. Robustness, evaluation, and the unglamorous work of trust.
Fairness × access
A motivation, not a sub-field. Anything I build should reach the people who need it most, not the people who already have everything.
beyond language×fairness & access
§01·Research
§01 / 06
What runs through every paper here. I want this work to be useful where it matters, accessible to people who don't already have it, and meaningful enough to keep at for years.
First large-scale public eye-tracking dataset for VR emotion recognition, with high-frame-rate periocular video plus 240 Hz gaze, across seven discrete emotions.
A physics-driven framework that estimates tissue properties from annotated cardiac MRI via diffusion models, enabling zero-shot segmentation across unseen MRI sequences.
Trained nnU-Net models contain substantial weight redundancy. Over 80% of weights can be removed by simple magnitude pruning while preserving segmentation quality.
Where the research meets real systems. Each one is a live collaboration with an actual partner (Gradient Networks, Tencent, MeetaVista). Most are still in motion.
A browser extension that surfaces venue-quality badges, author-role signals, and flexible ranking data directly on Google Scholar. The information you wish was there at a glance.
Built a benchmark for evaluating LLM routing strategies. Showed that routing can preserve same-quality performance while reducing cost by >90% versus calling a single SOTA model for every step.
MAR 2026 → PRESENT · IN PROGRESS · TARGET NEURIPS '25
Cost-Adaptive LLM Routing with Specialist Models
Building on the LLM Router benchmark, using stronger-model trajectories to fine-tune small specialists for repeated workflows. As usage accumulates, small models improve and the system's cost falls.
MEETAVISTA · MAR 2026 → PRESENT · IN PROGRESS · TARGET TOP-TIER AI VENUE
Human Intent World Model
A vision-language world model that infers customer intent from visual cues and responds with sales-relevant reasoning, fine-tuned on a synthetic dataset distilled from classical sales literature.
TENCENT · APR 2026 → PRESENT · IN PROGRESS · TARGET TOP-TIER AI VENUE
LLM for Optimization
A framework that uses an optimization harness (structured search, feedback, memory retrieval) to guide LLMs toward better solutions on optimization problems like TSP, rather than relying on direct generation alone.
§03·Writing
§03 / 06
Notes on research, taste, and the boring parts of building AI.
I read everything. Reach out about research, collaboration, supervision, or just to say hi. I answer faster than is professional, and I bring my whole self to the conversation.
Seeking PhD positions · ready any time
Tony Yang杨童耘
Independent AI Researcher
World models · LLMs · vision · medical AI · safety