官善琰

Shanyan Guan

Researcher, AI Photography R&D, vivo Imaging Department

guanshanyan@vivo.com

Shanghai, China

📢 招生: 长期招收实习生 & 少量应届校招名额。研究方向:视频模型后训练 | 高效图像生成/编辑 | 世界模型。 联系我

Shanyan Guan

About

I am a researcher at vivo Imaging Department, working on AI Photography R&D. I received my Ph.D. from Shanghai Jiao Tong University in 2024 (advisors: Prof. Xiaokang Yang & Prof. Yunbo Wang), and my B.S. from Xidian University in 2017.

Previously interned at Tencent XR, SenseTime, Tencent YouTu Lab, and miHoYo.

Research: High-resolution image generation, video synthesis, world models.

Publications (* Co-first author, † Project Lead)

2026

VINS-120K: Ultra High-Resolution Image Editing with A Large-Scale Dataset

Zhizhou Chen, Shanyan Guan, Zhanxin Gao, En Ci, Yanhao Ge, Wei Li, Zhenyu Zhang, Jian Yang, Ying Tai

CVPR 2026

Guiding a Diffusion Model by Swapping Its Tokens

Weijia Zhang, Yuehao Liu, Shanyan Guan, Wu Ran, Yanhao Ge, Wei Li, Chao Ma

CVPR 2026

Octopus: History-Free Gradient Orthogonalization for Continual Learning in Multimodal Large Language Models

Yuehao Liu, Shanyan Guan, Weijia Zhang, Xuanming Shang, Yanhao Ge, Wei Li, Chao Ma

CVPR 2026

LearnIR: Learnable Posterior Sampling for Real-World Image Restoration

Yihang Bao, Zhen Huang, Shanyan Guan, Songlin Yang, Yanhao Ge, Wei Li, Bukun Huang, Zengmin Xu

ICLR 2026

2025

UltraHR-100K: Enhancing UHR Image Synthesis with A Large-Scale High-Quality Dataset

Shanyan Guan, Yanhao Ge, Yunbo Wang, Wei Li, Xiaokang Yang

NeurIPS 2025

NeoWorld: Neural Simulation of Explorable Virtual Worlds via Progressive 3D Unfolding

Yanpeng Zhao, Shanyan Guan, Yunbo Wang, Yanhao Ge, Wei Li, Xiaokang Yang

arXiv preprint, 2025

Describe, Don't Dictate: Semantic Image Editing with Natural Language Intent

En Ci*, Shanyan Guan*, Yanhao Ge, Yilin Zhang, Wei Li, Jian Yang, Ying Zhenyu Zhang, Tai

ICCV 2025

PostEdit: Posterior Sampling for Efficient Zero-Shot Image Editing

Feng Tian, Yixuan Li, Yichao Yan, Shanyan Guan, Yanhao Ge, Xiaokang Yang

ICLR 2025

2024

NeuMA: Neural Material Adaptor for Visual Grounding of Intrinsic Dynamics

Junyi Cao, Shanyan Guan†, Yanhao Ge, Wei Li, Xiaokang Yang, Chao Ma

NeurIPS 2024

HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation

Shanyan Guan*, Yanhao Ge*, Ying Tai, Jian Yang, Wei Li, Mingyu You

ECCV 2024

2022

CageNeRF: Cage-based Neural Radiance Field for Generalized 3D Deformation and Animation

Yicong Peng, Yichao Yan, Shengqi Liu, Yuhao Cheng, Shanyan Guan, Bowen Pan, Guangtao Zhai, Xiaokang Yang

NeurIPS 2022

Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online Adaptation

Shanyan Guan, Jingwei Xu, Michelle Z He, Yunbo Wang, Bingbing Ni, Xiaokang Yang

IEEE T-PAMI 2022

PTSEFormer: Progressive Temporal-Spatial Enhanced Transformer Towards Video Object Detection

Han Wang, Jun Tang, Xiaodong Liu, Shanyan Guan, Rong Xie, Li Song

ECCV 2022

NeuroFluid: Fluid Dynamics Grounding with Particle-Driven Neural Radiance Fields

Shanyan Guan, Huayu Deng, Yunbo Wang, Xiaokang Yang

ICML 2022 Spotlight

2021 & Earlier

Bilevel Online Adaptation for Out-of-Domain Human Mesh Reconstruction

Shanyan Guan*, Jingwei Xu*, Yunbo Wang, Bingbing Ni, Xiaokang Yang

CVPR 2021

Collaborative Learning for Faster StyleGAN Embedding

Shanyan Guan, Ying Tai, Bingbing Ni, Feida Zhu, Feiyue Huang, Xiaokang Yang

arXiv 2019

Human Action Transfer Based on 3D Model Reconstruction

Shanyan Guan*, Shuo Wen*, Dexin Yang*, Bingbing Ni, Wendong Zhang, Jun Tang, Xiaokang Yang

AAAI 2019 Oral

Talks

Next-Gen Camera Systems: Instilling Intuitive Imagination in Smartphones

IJCAI 2024 Industry Day

Next-Gen Camera System: Instilling Intuitive Imagination in Smartphones

GAMES Webinar 337

基于视觉的领域泛化和物理运动推断

智东西公开课

基于视觉的物理规律反演

智东西公开课