Xianghao Kong

Hello/你好👋

prof_pic.jpeg

I’m currently a Video GenAI Researcher at Voia. I was a CS Ph.D. student, worked with the incredible ✨ Prof. Greg Ver Steeg ✨ at the University of California, Riverside. Fortunately, I also worked at SonyAI(Host: Vikash Sehwag) and Adobe Firefly (Host: Hareesh Ravi) as a Research Intern.

My research centers on Generative Models (Diffusion Models & Energy-Based Models), with a focus on their interpretability, alignment, and compositionality. Specifically, I explore diffusion models through a novel information-theoretic lens, termed Information-Theoretic Diffusion (ITD) ℹ️. Our work demonstrates how Pointwise Mutual Information (PMI) enhances compositional reasoning and modality alignment (e.g., text and image). We are actively expanding the ITD universe 🌌 and welcome collaboration opportunities!

Prior to UCR, I focused on EEG data analysis in Brain-Computer Interface (BCI) 🧠 technology, bridging neuroscience and computer science. Outside of research, I enjoy exploring food, sketching, and visiting museums.

news

Sep 22, 2025 I’m excited to share that I’ve started a new role as a Video GenAI Researcher at Voia, where I’ll be exploring GenAI-powered filmmaking workflows that incorporate real actors. Looking forward to this journey and the creative possibilities ahead!
Sep 03, 2025 PhD defense complete!! Doctor status unlocked 🍻
Jun 11, 2025 Flying to Nashville for CVPR 2025 and celebrating my birthday for the first time over 30,000 feet in the air!
Apr 26, 2025 I delivered a 20-minute presentation at SOCAMS ☕, had the pleasure of attending many insightful talks, and came away convinced that the REAL AGI still has a way to go.
Mar 01, 2025 I am honored to receive the Dissertation Completion Fellowship Award from UCR! 🎉

selected publications

  1. Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget
    Vikash Sehwag, Xianghao Kong , and 3 more authors
    2025
  2. Interpretable Diffusion via Information Decomposition
    Xianghao Kong*, Ollie Liu* , and 3 more authors
    2024
  3. Information-Theoretic Diffusion
    Xianghao Kong, Rob Brekelmans , and 1 more author
    2023
  4. ACL
    Asymmetric Bias in Text-to-Image Generation with Adversarial Attacks
    Haz Sameen Shahgir, Xianghao Kong , and 2 more authors
    2024