Light of Normals: Unified Feature Representation for Universal Photometric Stereo

1BAAI, 2BUAA, 3NJU, 4FNii, CUHKSZ, 5NTU, 6BNU, 7NII, 8HKUST, 9PKU, 10AIR, THU
*Equal Contribution  Equal Contribution  Corresponding Author

Abstract

Universal photometric stereo (PS) aims to recover high-quality surface normals from objects under arbitrary lighting conditions without relying on specific illumination models. Despite recent advances such as SDM-UniPS and Uni MS-PS, two fundamental challenges persist: 1) the deep coupling between varying illumination and surface normal features, where ambiguity in observed intensity makes it difficult to determine whether brightness variations stem from lighting changes or surface orientation; and 2) the preservation of high-frequency geometric details in complex surfaces, where intricate geometries create self-shadowing, inter-reflections, and subtle normal variations that conventional feature processing operations struggle to capture accurately.

Method

Overview of the LINO-UniPS architecture, featuring a Light-Normal Contextual Encoder, Decoder, and loss computation.

Left Top Image

LINO-UniPS significantly performs better when processing data characterized by high-frequency information.

Left Bottom Image

Attention maps of lighting registers tokens on the encoder's final-layer. Different tokens exhibit specialized attention on diverse lighting information from multiple directions.

Right Top Image

The features extracted by our LiNO-UniPS encoder effectively disentangle lighting from surface normal information and concurrently exhibit enhanced consistency.

Right Bottom Image

Ablation Study

Ablation study first reveals that training on PS-Verse is substantially better than on PS-Mix. It further demonstrates that by incrementally adding key components like light registers tokens, global attention, light alignment, wavelet transform and gradient perception loss, the Performance (MAE) and the feature consistency (CSIM, SSIM) can be steadily improved compared to the baseline.

feature_consistency

Feature consistency

LINO-UniPS enhances the decoupling of lighting and normals, resulting in features with greater consistency.

feature_consistency

Some Visual Results

Hover to view an example from the multi-light input images and the corresponding surface normals reconstructed by LINO-UniPS.

Image 1 Image 2 Image 3 Image 4 Image 5 Image 6 Image 7 Image 8 Image 9 Image 10 Image 11 Image 12 Image 13 Image 14 Image 15 Image 16 Image 17 Image 18 Image 19 Image 20 Image 21 Image 22 Image 23 Image 24 Image 25 Image 26 Image 27 Image 28 Image 29 Image 30

PS-Verse Dataset

Level 1

Showcase Image 1
Showcase Image 2
Showcase Image 10
Showcase Image 11
Showcase Image 12
Showcase Image 20
Showcase Image 21
Showcase Image 30
Showcase Image 31
Showcase Image 40

Level 2

Showcase Image 1
Showcase Image 2
Showcase Image 10
Showcase Image 11
Showcase Image 12
Showcase Image 20
Showcase Image 21
Showcase Image 30
Showcase Image 31
Showcase Image 40

Level 3

Showcase Image 1
Showcase Image 2
Showcase Image 10
Showcase Image 11
Showcase Image 12
Showcase Image 20
Showcase Image 21
Showcase Image 30
Showcase Image 31
Showcase Image 40

Level 4

Showcase Image 1
Showcase Image 2
Showcase Image 10
Showcase Image 11
Showcase Image 12
Showcase Image 20
Showcase Image 21
Showcase Image 30
Showcase Image 31
Showcase Image 40

Level 5

Showcase Image 1
Showcase Image 2
Showcase Image 10
Showcase Image 11
Showcase Image 12
Showcase Image 20
Showcase Image 21
Showcase Image 30
Showcase Image 31
Showcase Image 40

Downstream Application

The detailed surface normals reconstructed by LINO-UniPS can be directly applied to downstream tasks such as 3D generation. Hi👋3DGen:


LINO-UniPS excels as a normal estimation module, capable of replacing the second stage of the Neural LightRig pipeline. The figure below compares the multi-light inputs from Stage 1 of Neural LightRig pipeline (left) with the final normal maps. This comparison highlights that LINO-UniPS (bottom right) achieves significantly more detailed and precise results than the original Neural LightRig pipeline Stage 2 (top right). Additionally, a future release will incorporate support for PBR material prediction.

Main Image
Normal Map 1 Nerual LightRig
Normal Map 2 LINO-UniPS
Thumbnail 0 Thumbnail 1 Thumbnail 2 Thumbnail 3 Thumbnail 4 Thumbnail 5 Thumbnail 6

BibTeX

@article{li2025lightnormalsunifiedfeature,
      title={Light of Normals: Unified Feature Representation for Universal Photometric Stereo}, 
      author={Hong Li and Houyuan Chen and Chongjie Ye and Zhaoxi Chen and Bohan Li and Shaocong Xu and Xianda Guo and Xuhui Liu and Yikai Wang and Baochang Zhang and Satoshi Ikehata and Boxin Shi and Anyi Rao and Hao Zhao},
      journal={arXiv preprint arXiv:2506.18882},
      year={2025}
}