Xianghao Kong

I’m a Video GenAI Researcher at Cupertino to bridge the gap between Hollywood and Silicon Valley. I recently completed my PhD at UC Riverside in CS, worked with the incredible ✨ Prof. Greg Ver Steeg ✨. Fortunately, I also worked at SonyAI(Host: Vikash Sehwag) and Adobe Firefly (Host: Hareesh Ravi) as a Research Intern.

My research centers on Generative Models (Diffusion Models & Energy-Based Models), with a focus on their interpretability, alignment, and compositionality. Specifically, I explore diffusion models through a novel information-theoretic lens, termed Information-Theoretic Diffusion (ITD) ℹ️. Our work demonstrates how Pointwise Mutual Information (PMI) enhances compositional reasoning and modality alignment (e.g., text and image). We are actively expanding the ITD universe 🌌 and welcome collaboration opportunities!

Prior to UCR, I focused on EEG data analysis in Brain-Computer Interface (BCI) 🧠 technology, bridging neuroscience and computer science. Outside of research, I enjoy exploring food, sketching, and visiting museums.

news

Nov 17, 2025	I reunited with Michael to create a 2-minute-30-second horror sci-fi short film, Dreamcatcher Hotel. Watch it now and leave likes or comments!
Sep 22, 2025	I’m excited to share that I’ve started a new role as a Video GenAI Researcher at BayArea, where work with Emmy Winners to explore GenAI-powered filmmaking workflows that incorporate real actors. Looking forward to this journey and the creative possibilities ahead!
Sep 03, 2025	PhD defense complete!! Doctor status unlocked 🍻
Jun 11, 2025	Flying to Nashville for CVPR 2025 and celebrating my birthday for the first time over 30,000 feet in the air!
Apr 26, 2025	I delivered a 20-minute presentation at SOCAMS ☕, had the pleasure of attending many insightful talks, and came away convinced that the REAL AGI still has a way to go.

selected publications

CVPR

Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget

Vikash Sehwag, Xianghao Kong , and 3 more authors

2025

arXiv Code
ICLR

Interpretable Diffusion via Information Decomposition

Xianghao Kong^*, Ollie Liu^* , and 3 more authors

2024

arXiv Code
ICLR

Information-Theoretic Diffusion

Xianghao Kong, Rob Brekelmans , and 1 more author

2023

arXiv Code
ACL

Asymmetric Bias in Text-to-Image Generation with Adversarial Attacks

Haz Sameen Shahgir, Xianghao Kong , and 2 more authors

2024

arXiv Code