Welcome to Defu Cao’s Homepage

Research Identity

I build foundation models and agentic LLM systems for structured, scientific, and decision-critical data.

Agentic Post-Training · Agent Harnesses & Skills · LLM Safety · Scientific Foundation Models · Numerical LLM Interfaces

Defu Cao /dəfu: tsaʊ/ is a Ph.D. candidate in the Thomas Lord Department of Computer Science at the University of Southern California, working with Prof. Yan Liu at the USC Melady Lab. He is also working closely with Prof. Yue Zhao and Prof. Angela Zhou from USC Marshall Data Sciences and Operations. Prior, he earned his Master’s Degree at the School of EECS Peking University.

He has been fortunate to work as a visiting scholar with Prof. Yisong Yue at the California Institute of Technology (2025), focusing on multimodal foundation models and LLM infrastructure, and Prof. Kun Zhang at MBZUAI (2023), focusing on causal inference and causal discovery.

Cao’s research focuses on foundation models for structured and scientific data, agentic post-training systems, and LLM safety. Time series and scientific signals are a major application domain, but the broader goal is to build models and agent harnesses that can represent complex numerical structure, compose tools and reusable skills, reason with evidence, interact with humans, and remain reliable under distribution shift, physical constraints, privacy risks, and high-stakes deployment.

His current research agenda connects several layers:

Direction	Representative Work	Core Question
Post-training and agentic systems	TSOrchestra, HILA, TS-Reasoner	How can LLMs become judges, planners, tool users, skill composers, collaborators, and continually improving agents?
LLM safety and agent security	“Someone Hid It!”, Topology Matters	How do retrieval systems, RAG, agent memory, and multi-agent topologies fail under attack or privacy pressure?
Foundation models for structured/scientific data	TEMPO, ClimateLLM, TimeDiT, PINFDiT	How can foundation models capture numerical structure, uncertainty, multi-resolution dynamics, and physics constraints?
Numerical interfaces for LLMs	Speaking Numbers to LLMs, GPT4MTS	How should LLMs ingest continuous values, multimodal signals, and domain-specific numerical evidence?
Benchmarks and high-stakes evaluation	TSAIA, TemporalBench, Physiological Waveform Reasoning	How do we evaluate reasoning quality, traceability, and utility beyond benchmark accuracy?

He has published his research in top conference proceedings including NeurIPS, ICML, ICLR, CVPR, ICRA, and NAACL.

Research Highlights

Highlight	Signal
TSOrchestra	Rank #1 on the Salesforce GIFT-Eval leaderboard; LLM-as-judge orchestration for foundation-model ensembles.
Agentic systems	HILA, TS-Reasoner, agent harnesses, reusable skills, human-in-the-loop multi-agent learning, tool-augmented reasoning, and adaptive collaboration.
LLM safety	Query-agnostic attacks on LLM-based retrieval and memory leakage in multi-agent systems.
Foundation models	TEMPO, ClimateLLM, TimeDiT, and PINFDiT for structured, scientific, and physics-constrained data.
Journal impact	🏆 “When Physics Meets Machine Learning” ranked #1 in Machine Learning for Computational Science and Engineering’s “Top 10 Downloaded Articles in 2025”.
Impact	4,000+ citations across 30+ publications; invited talks at Caltech, Rice, UBC, UCSD, DataDog AI Research Lab, and Peking University; invited workshop presentation at Google; Best RA Award 2024 in the USC CS Department.

Internships & Research Experience

Summer 2025: Visiting Scholar at California Institute of Technology, working with Prof. Yisong Yue
2024-2025: Contributor to Humanity Unleashed’s open-source cooperative AI policymaking platform; helped lead the pretraining track for the foundation-model component.
Summer 2024: Quantitative Researcher at Cubist Systematic Strategies
Summer 2023: Research Assistant at MBZUAI, working with Prof. Kun Zhang and Prof. Biwei Huang on causality
Summer 2022: Research Intern at Adobe Research, mentored by Dr. Zhaowen Wang
Previous: Research Intern at Microsoft Research Lab Asia (MSRA) (twice), working with Dr. Yujing Wang and Alibaba Damo Academy’s Data Analytics and Intelligence Lab, advised by Dr. Jingren Zhou
First Internship: Baidu, 7 years ago

Awards

2025, 1st Place on GIFT-EVAL Leaderboard, Salesforce
2024, Best Poster Award, USC
2024, Best Research Assistant Award, USC (1 per department)
2024, Endowed Fellowship, USC
2021, Annenberg Fellowship, USC
2021, MSRA Stars of Tomorrow Internship Program Award, Microsoft
2020, National Scholarship for Graduate Students, China
2020, Excellent Student Exchange Scholarship, Peking University
2020, Huawei Fellowship
2019, Intel Fellowship
2019, HeHE Fellowship
Travel Fellowship, ICLR/NeurIPS/ICML/AAAI/SDM

Defu Cao

Research Identity

Research Highlights

Recent News (2025-2026)

Internships & Research Experience

Awards