Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting

Key Contribution

We propose Multi-Scale Bilateral Grids that unify appearance codes and bilateral grids, significantly improving geometric accuracy in dynamic, decoupled autonomous driving scene reconstruction.

Project Overview

Explore how bilateral grids and appearance codes come together for photometrically consistent driving-scene rendering.

Introduction

Five-minute walkthrough of the bilateral-grid pipeline, training setup, and deployment across nuScenes, Waymo, Argoverse, and PandaSet.

Teaser

Quick side-by-side comparisons highlighting sharper geometry, denser occupancy, and lighting robustness versus Omnire.

Abstract

Neural rendering techniques, including NeRF and Gaussian Splatting (GS), rely on photometric consistency to produce high-quality reconstructions. However, in real-world scenarios, it is challenging to guarantee perfect photometric consistency in acquired images. Appearance codes have been widely used to address this issue, but their modeling capability is limited, as a single code is applied to the entire image. Recently, the bilateral grid was introduced to perform pixel-wise color mapping, but it is difficult to optimize and constrain effectively. In this paper, we propose a novel multi-scale bilateral grid that unifies appearance codes and bilateral grids. We demonstrate that this approach significantly improves geometric accuracy in dynamic, decoupled autonomous driving scene reconstruction, outperforming both appearance codes and bilateral grids. This is crucial for autonomous driving, where accurate geometry is important for obstacle avoidance and control. Our method shows strong results across four datasets: Waymo, NuScenes, Argoverse, and PandaSet. We further demonstrate that the improvement in geometry is driven by the multi-scale bilateral grid, which effectively reduces floaters caused by photometric inconsistency.

Framework

We unify appearance codes with multi-scale bilateral grids. Initially, a coarse rendering is obtained from a Gaussian scene graph. This rendered image is then processed by our multi-scale bilateral grids to perform detailed per-pixel color modeling, guided by a luminance-based map through slice and fusion operations.

Challenging Scene Demos

Stress-test scenarios that showcase robustness to lighting, reflections, and severe photometric shifts.

Nighttime Scene

Rainy Road

Reflective Exterior

Deep Shadows I

Deep Shadows II

Sun Flare

Rendering Results Comparison

Baseline results are reconstructed using Omnire (ICLR 2025 Spotlight).
Scenes are from nuScenes, Waymo, PandaSet, and Argoverse.

Due to file size limitations, videos are compressed and quality might be affected

Citation

@article{wang2025unifying,
  title={Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting},
  author={Wang, Nan and Chen, Yuantao and Xiao, Lixing and Xiao, Weiqing and Li, Bohan and Chen, Zhaoxi and Ye, Chongjie and Xu, Shaocong and Zhang, Saining and Yan, Ziyang and others},
  journal={arXiv preprint arXiv:2506.05280},
  year={2025}
}