Real-time 3D-aware Portrait Video Relighting

Official PyTorch implementation of the CVPR 2024 Highlight Paper

Real-time 3D-aware Portrait Video Relighting
Ziqi Cai, Kaiwen Jiang, Shu-Yu Chen, Yu-Kun Lai, Hongbo Fu, Boxin Shi, Lin Gao*

Abstract: Synthesizing realistic videos of talking faces under custom lighting conditions and viewing angles benefits various downstream applications like video conferencing. However, most existing relighting methods are either time-consuming or unable to adjust the viewpoints. In this paper, we present the first real-time 3D-aware method for relighting in-the-wild videos of talking faces based on Neural Radiance Fields (NeRF). Given an input portrait video, our method can synthesize talking faces under both novel views and novel lighting conditions with a photo-realistic and disentangled 3D representation. Specifically, we infer an albedo tri-plane, as well as a shading tri-plane based on a desired lighting condition for each video frame with fast dual-encoders. We also leverage a temporal consistency network to ensure smooth transitions and reduce flickering artifacts. Our method runs at 32.98 fps on consumer-level hardware and achieves state-of-the-art results in terms of reconstruction quality, lighting error, lighting instability, temporal consistency and inference speed. We demonstrate the effectiveness and interactivity of our method on various portrait videos with diverse lighting and viewing conditions.

Installation

Prerequisites

The code has been tested in the following environment:

CUDA: 12.4
PyTorch: 2.2.2
Python: 3.10.14
GCC: 11.4.0
Ubuntu: 22.04

Steps

Clone the repository:

git clone https://github.com/GhostCai/PortraitRelighting.git
cd PortraitRelighting

Set up a virtual environment:

conda create -n relighting python=3.10
conda activate relighting

Install dependencies:
```
pip install -r requirements.txt
```
Download pre-trained models and save them in the checkpoints/ directory.
Download NeRFFaceLighting-ffhq-64.pkl from here and rename it to NeRFFaceLighting.pkl. Save it in the checkpoints directory.
Go to the link to download its BFM directory, follow the Prepare prerequisite models section (we set 'model_name' as 'pretrained' ), and put its supplemented BFM and checkpoints directories under third_party/CropPose directory.

Quick Start

To quickly relight an image or video, simply run the example script:

python example.py

The relighted outputs will be saved in the examples/ directory.

Usage

Image Relighting

Initialize the Model

Set up the environment and load the necessary models:

device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
cropposer, relighting, dpr = initialize_models(device)
example_lightings = np.load("examples/example_lightings.npy")

Warm-up the Models

Prepare the models for inference:

warm_up_models(cropposer, relighting, dpr, device)

Reconstruct the Input Image (optional)

Optionally, reconstruct the input image before relighting:

cropped_image, cam, planes = reconstruct_image(cropposer, relighting, dpr, device)

If you prefer to relight an image with a custom camera pose, you can directly modify the cam and sh parameters in the reconstruct_image function, instead of estimating them from the input image.

Perform Relighting

Apply relighting to the image using the pre-loaded lighting settings:

perform_relighting(cropped_image, planes, relighting, device, example_lightings)

Video Relighting

Initialize the Models

Follow the same initialization steps as for Image Relighting.

Ensure you have the following:

Cropped video frames in examples/video/cropped/
Corresponding camera parameters in examples/video/camera/

Perform Relighting on Video Frames

For a detailed implementation, refer to the example.py script.

Citation

If you use this project in your research, please cite it as follows:

@InProceedings{Cai_2024_CVPR,
  author    = {Cai, Ziqi and Jiang, Kaiwen and Chen, Shu-Yu and Lai, Yu-Kun and Fu, Hongbo and Shi, Boxin and Gao, Lin},
  title     = {Real-time 3D-aware Portrait Video Relighting},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  month     = {June},
  year      = {2024},
  pages     = {6221-6231}
}

License

This project is licensed under the Apache-2.0 license. See the LICENSE file for more details.

Acknowledgements

This repository relies on the NeRFFaceLighting, EG3D, DECA, Deep3DRecon, DPR, E4E, face.evoLVe, face-parsing.PyTorch, and facemesh.pytorch.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
docs		docs
examples		examples
networks		networks
third_party		third_party
LICENSE		LICENSE
example.py		example.py
readme.md		readme.md
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real-time 3D-aware Portrait Video Relighting

Table of Contents

Installation

Prerequisites

Steps

Quick Start

Usage

Image Relighting

Video Relighting

Citation

License

Acknowledgements

About

Releases

Packages

Languages

License

GhostCai/PortraitRelighting

Folders and files

Latest commit

History

Repository files navigation

Real-time 3D-aware Portrait Video Relighting

Table of Contents

Installation

Prerequisites

Steps

Quick Start

Usage

Image Relighting

Video Relighting

Citation

License

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages