Research
I'm interested in 3D vision, inverse rendering and 3D reconstruction, scene understanding, generative models, as well as other fields in computer vision and graphics.
Currently, I have been working on text-to-image generative models.
My past research is mainly on empowering radiance fields and other neural scene representations with the capability of scene understanding.
I also work on improving 3D reconstruction and inverse rendering, and accelerating real-time rendering with temporal reprojection.
|
|
Instella-T2I: Pushing the Limits of 1D Discrete Latent Space Image Generation
Ze Wang, Hao Chen,
Benran Hu,
Jiang Liu, Ximeng Sun, Jialian Wu, Yusheng Su, Xiaodong Yu, Emad Barsoum, Zicheng Liu
arXiv, 2025
Hugging Face
/
GitHub
/
arXiv
Text-to-image generation with a 1D binary autoencoder for high compression rates and efficient training/inference.
|
|
Improving the Diffusability of Autoencoders
Ivan Skorokhodov, Sharath Girish,
Benran Hu,
Willi Menapace, Yanyu Li, Rameen Abdal, Sergey Tulyakov, Aliaksandr Siarohin
ICML, 2025
arXiv
Aligning spectral properties of RGB and latent spaces helps to create better autoencoders for diffusion models.
|
|
SANeRF-HQ: Segment Anything for NeRF in High Quality
Yichen Liu,
Benran Hu,
Chi-Keung Tang,
Yu-Wing Tai
CVPR, 2024
project page
/
arXiv
Fusing multi-view SAM segmentation masks as an object field improves the performance of zero-shot 3D segmentation in NeRF.
|
|
Instance Neural Radiance Field
Yichen Liu*,
Benran Hu*,
Junkai Huang*,
Yu-Wing Tai,
Chi-Keung Tang
ICCV, 2023
GitHub
/
arXiv
3D instance segmentation in NeRF by matching multi-view instance masks with sparse 3D masks produced by a 3D Mask R-CNN.
|
|
NeRF-RPN: A general framework for object detection in NeRFs
Benran Hu*,
Junkai Huang*,
Yichen Liu*,
Yu-Wing Tai,
Chi-Keung Tang
CVPR, 2023
GitHub
/
arXiv
We introduce 3D object detection to NeRF by sampling feature grids from radiance fields and applying a 3D detector on them.
|
This is another website using Jon Barron's template.
|
|