MLP Splatting Object-Centric Neural Fields
- Imperial College London
- * Denotes Equal Contribution
arXiv
Decomposed objects
MLP-Splatting efficiently reconstructs even the most complex objects with just a few hundred primitives: matching Feature-3DGS in scene quality (+0.6 dB PSNR) while using 1/63 the primitives, 1/6 the memory, and rendering 2× faster.
Method figures
MLP-Splatting offers the compactness of a neural scene representation while keeping each primitive’s influence independent, making downstream tasks such as scene editing straightforward. E.g. with added semantic guidance on top of the proposed method, language-guided editing becomes possible (see examples).
Abstract
3D representations are fundamental to scene rendering, understanding, and interaction. Recent approaches, such as 3D Gaussian Splatting and Neural Radiance Fields, achieve impressive photorealistic novel-view synthesis, but lack the ability to easily decompose scene elements into a few primitives, requiring additional segmentation or grouping for object-level manipulation. We present MLP-Splatting, a method that enables scene decomposition via a few expressive light-field primitives while providing photorealistic novel-view synthesis.
MLP-Splatting models each primitive as an independent compact MLP with localized spatial support that predicts radiance and opacity. In contrast to low-level Gaussian primitives or a single global radiance field, our neural primitives provide greater expressive capacity while remaining spatially localized. Rendering is performed through efficient sparse volumetric compositing over ray–primitive interactions.
Our primitives are supervised using RGB supervision alone, which yields primitives that represent local scene regions often corresponding to objects or object parts, enabling interactive object-level editing without segmentation masks by selecting a handful of primitives. Our method, augmented with optional semantic feature distillation, enables open-vocabulary scene interaction and open-set instant segmentation. Compared to state-of-the-art methods, we achieve substantially lower memory usage ($1/15\times$) and faster rendering ($3\times$), as we show in our experiments compared to semantic 3DGS methods.
Interactive Demo
Below is an interactive demo of the easy object selection, editing, and language-guided editing enabled by our object-level primitives.
Results
BibTeX
@article{kim2026mlpsplatting,
title = {MLP Splatting: Object-Centric Neural Fields},
author = {Kim, Shinjeong and Cheng, Yuzhou and Kong, Xin and Kelly, Paul H. J. and Davison, Andrew J.},
journal = {arXiv preprint arXiv:2606.03877},
year = {2026}
}