Benran Hu

I'm a Research Engineer at AMD GenAI team, focusing on generative models.

I received my MS in Computer Science at CMU, where I worked with Prof. Ioannis Gkioulekas on 3D reconstruction, inverse rendering, and differentiable rendering. Previously, I interned at Snap Research with Ivan Skorokhodov on building better video generation models. I received my B.Sc. in Computer Science from HKUST, where I was advised by Prof. Chi-Keung Tang, Prof. Yu-Wing Tai, and Prof. Pedro V. Sander.

Email  /  Resume  /  Full CV  /  Scholar  /  GitHub

profile photo

Research

I'm interested in 3D vision, inverse rendering and 3D reconstruction, scene understanding, generative models, as well as other fields in computer vision and graphics. Currently, I have been working on text-to-image generative models. My past research is mainly on empowering radiance fields and other neural scene representations with the capability of scene understanding. I also work on improving 3D reconstruction and inverse rendering, and accelerating real-time rendering with temporal reprojection.

Instella-T2I: Pushing the Limits of 1D Discrete Latent Space Image Generation
Ze Wang, Hao Chen, Benran Hu, Jiang Liu, Ximeng Sun, Jialian Wu, Yusheng Su, Xiaodong Yu, Emad Barsoum, Zicheng Liu
arXiv, 2025
Hugging Face / GitHub / arXiv

Text-to-image generation with a 1D binary autoencoder for high compression rates and efficient training/inference.

Improving the Diffusability of Autoencoders
Ivan Skorokhodov, Sharath Girish, Benran Hu, Willi Menapace, Yanyu Li, Rameen Abdal, Sergey Tulyakov, Aliaksandr Siarohin
ICML, 2025
arXiv

Aligning spectral properties of RGB and latent spaces helps to create better autoencoders for diffusion models.

SANeRF-HQ: Segment Anything for NeRF in High Quality
Yichen Liu, Benran Hu, Chi-Keung Tang, Yu-Wing Tai
CVPR, 2024
project page / arXiv

Fusing multi-view SAM segmentation masks as an object field improves the performance of zero-shot 3D segmentation in NeRF.

Instance Neural Radiance Field
Yichen Liu*, Benran Hu*, Junkai Huang*, Yu-Wing Tai, Chi-Keung Tang
ICCV, 2023
GitHub / arXiv

3D instance segmentation in NeRF by matching multi-view instance masks with sparse 3D masks produced by a 3D Mask R-CNN.

NeRF-RPN: A general framework for object detection in NeRFs
Benran Hu*, Junkai Huang*, Yichen Liu*, Yu-Wing Tai, Chi-Keung Tang
CVPR, 2023
GitHub / arXiv

We introduce 3D object detection to NeRF by sampling feature grids from radiance fields and applying a 3D detector on them.


This is another website using Jon Barron's template.