I am currently a Senior Researcher at SenseTime.
I obtained my Ph.D. in 2023 from the MMLab at The Chinese University of Hong Kong (CUHK),
where I was fortunate to be advised by Prof. Hongsheng Li and
Prof. Xiaogang Wang.
Prior to that, I earned my bachelor's degree from Nanjing University in 2019.
My current research interests focus on generative models and computational photography.
I am passionate about conducting research with real-world impact, addressing pressing technological challenges.
Most of my publications have been successfully integrated into commercial products.
A novel AR mirror system that allows a seamless, perspective-aligned user experience, which is enabled by
placing the camera behind a transparent display.
We introduce Motion-I2V, a novel framework for consistent and controllable image-to-video generation (I2V).
In contrast to previous methods that directly learn the complicated image-to-video mapping, Motion-I2V
factorizes I2V into two stages with explicit motion modeling.
A general-purpose backbone for image restoration tasks (e.g. denoising, deraining, and
deblurring).
A novel kernel basis attention (KBA) module has been proposed to effectively aggregate the
spatial information via a
series of learnable kernel bases.
We propose a grouped spatial-temporal shift module for effective video restoration.
It surpasses previous SOTA methods with only 43% computational cost on video deblurring and
denoising.
Learning degradation representations for blurry images. The learned degradation
representation
shows pleasing improvements on both image deblurring and reblurring.
A self-supervised image denoising method using noisy images and the noise model.
We achieve SOTA results on both real-world noise and synthetic noises (both point-wise and
spatial correlated noise types).
A method to synthesize "almost the most realistic" raw image noise by sampling directly from
the real noise distribution.
We also found existing comparisons of noise synthesis methods are based on inaccurate noise
parameters.
We achieve over 93% winning rate of Protoss against the most difficult non-cheating
built-in AI (level-7) of Terran,
training within two days using a single machine with only 48 CPU cores and 8 K40 GPUs.