Benran Hu

I'm a Research Engineer at AMD GenAI team, focusing on generative models.

I received my MS in Computer Science at CMU, where I worked with Prof. Ioannis Gkioulekas on 3D reconstruction, inverse rendering, and differentiable rendering. Previously, I interned at Snap Research with Ivan Skorokhodov on building better video generation models. I received my B.Sc. in Computer Science from HKUST, where I was advised by Prof. Chi-Keung Tang, Prof. Yu-Wing Tai, and Prof. Pedro V. Sander.

Email / Resume / Full CV / Scholar / GitHub

Research

I'm interested in 3D vision, inverse rendering and 3D reconstruction, scene understanding, generative models, as well as other fields in computer vision and graphics. Currently, I have been working on text-to-image generative models. My past research is mainly on empowering radiance fields and other neural scene representations with the capability of scene understanding. I also work on improving 3D reconstruction and inverse rendering, and accelerating real-time rendering with temporal reprojection.

	Instella-T2I: Pushing the Limits of 1D Discrete Latent Space Image Generation Ze Wang, Hao Chen, Benran Hu, Jiang Liu, Ximeng Sun, Jialian Wu, Yusheng Su, Xiaodong Yu, Emad Barsoum, Zicheng Liu arXiv, 2025 Hugging Face / GitHub / arXiv Text-to-image generation with a 1D binary autoencoder for high compression rates and efficient training/inference.
	Improving the Diffusability of Autoencoders Ivan Skorokhodov, Sharath Girish, Benran Hu, Willi Menapace, Yanyu Li, Rameen Abdal, Sergey Tulyakov, Aliaksandr Siarohin ICML, 2025 arXiv Aligning spectral properties of RGB and latent spaces helps to create better autoencoders for diffusion models.
	SANeRF-HQ: Segment Anything for NeRF in High Quality Yichen Liu, Benran Hu, Chi-Keung Tang, Yu-Wing Tai CVPR, 2024 project page / arXiv Fusing multi-view SAM segmentation masks as an object field improves the performance of zero-shot 3D segmentation in NeRF.
	Instance Neural Radiance Field Yichen Liu, Benran Hu*, Junkai Huang, Yu-Wing Tai, Chi-Keung Tang ICCV, 2023 GitHub / arXiv 3D instance segmentation in NeRF by matching multi-view instance masks with sparse 3D masks produced by a 3D Mask R-CNN.
	NeRF-RPN: A general framework for object detection in NeRFs Benran Hu, Junkai Huang, Yichen Liu, Yu-Wing Tai, Chi-Keung Tang CVPR*, 2023 GitHub / arXiv We introduce 3D object detection to NeRF by sampling feature grids from radiance fields and applying a 3D detector on them.

This is another website using Jon Barron's template.