Some questions about the paper #58

zhoutianyang2002 · 2024-08-11T12:49:15Z

Hi!

Thank you for your excellent work!

I am a newbie of 3D Vision. May I ask some questions about the paper?

I notice that we do not use the perceptual loss term(e.g. vgg perceptual loss) in the loss function unlike other 3DGS avatar papers. We only use L1 term and SSIM loss term. Is that because it is empirically effective or for other reasons?
In this paper, we unproject all pixels of two views into 3D space to form 3D Gaussians. Will it result in the existence of many Gaussian positions in 3D space are very close(because they are corresponding points in 2D images), leading to duplication and reduced efficiency?
In formula(6), it maybe does not like a matrix multiplication form. Maybe the indices are wrong? In other words, maybe the correct form is $$C_{i j k}=\sum_{h}\left(\mathbf{f}{l}^{S}\right){i j h} \cdot\left(\mathbf{f}{r}^{S}\right){i h k}$$, or $$C_{i j k}=\sum_{h}\left(\mathbf{f}{l}^{S}\right){i h k} \cdot\left(\mathbf{f}{r}^{S}\right){h j k}$$ , not $$C_{i j k}=\sum_{h}\left(\mathbf{f}{l}^{S}\right){i j h} \cdot\left(\mathbf{f}{r}^{S}\right){i k h}$$ in paper?

Sorry to bother you. Thank you very much!

ShunyuanZheng · 2024-08-12T14:22:03Z

Hi, thanks for your interest!

We have tried to use LPIPS loss in the training of GPS-Gaussian but witnessed no significant improvement. Considering the additional memory usage, we do not use it in our pipeline. The loss term of L1+SSIM including the weights follows the setup in 3DGS.
Yes, the Gaussians are very close and small in size compared to the original 3DGS. However, the number of Gaussian points does not significantly degrade the efficiency. As reported in our supplementary material, the rendering of around 300 thousand Gaussians takes around 0.8ms. The compression of GPS-Gaussian as discussed in about per-pixel gaussian allocation #54 (comment) worth an in-deep research.
Eq6 borrows from RAFT-Stereo.

zhoutianyang2002 · 2024-08-13T03:00:17Z

2. Yes, the Gaussians are very close and small in size compared to the original 3DGS. However, the number of Gaussian points does not significantly degrade the efficiency. As reported in our supplementary material, the rendering of around 300 thousand Gaussians takes around 0.8ms. The compression of GPS-Gaussian as discussed in about per-pixel gaussian allocation #54 (comment) worth an in-deep research.

Thank you for your reply! Best wishes!

zhoutianyang2002 closed this as completed Aug 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some questions about the paper #58

Some questions about the paper #58

zhoutianyang2002 commented Aug 11, 2024

ShunyuanZheng commented Aug 12, 2024

zhoutianyang2002 commented Aug 13, 2024

Some questions about the paper #58

Some questions about the paper #58

Comments

zhoutianyang2002 commented Aug 11, 2024

ShunyuanZheng commented Aug 12, 2024

zhoutianyang2002 commented Aug 13, 2024