Mark Boss

Co-Head of 3D & Image

I’m a research lead in 3D at Stability AI with research interests in the intersection of machine learning and computer graphics.

Mark Boss

Mark Boss is the Co-Head of 3D & Image at Stability AI. He worked at Unity Technologies before and completed his PhD at the University of Tübingen in the computer graphics group of …

Mark Boss

• Jun 22, 2026 • 1 min read

3D Generation

Arbor: Explicit Geometric Conditioning for Controllable 3D Asset Generation

A trainable adapter for 3D generators that introduces explicit geometric control via typed constraint meshes (hull, avoidance, touch).

jan-niklas-dihlmann

• Jun 22, 2026 • 1 min read

Image Layer Decomposition

Stable-Layers: Fine-Tuning Image Layer Decomposition Models with VLM-Scored Reinforcement Learning

An RL framework that fine-tunes image layer decomposition models using VLM-as-judge rewards, eliminating paired supervision.

ciara-rowles

• May 28, 2026 • 1 min read

KV Cache

OCTOPUS: Optimized KV Cache for Transformers via Octahedral Parametrization Under Optimal Squared Error Quantization

A rotation-preconditioned KV cache codec that jointly quantizes coordinate triplets via an octahedral map, achieving state-of-the-art compression across text, video, and audio …

Mark Boss

• May 21, 2026 • 1 min read

3D Reconstruction

ReLi3D: Relightable Multi-view 3D Reconstruction with Disentangled Illumination

A unified feed-forward pipeline for relightable 3D reconstruction from sparse views in under one second.

jan-niklas-dihlmann

• Jan 23, 2026 • 1 min read

Diffusion

ReSWD: ReSTIR‘d, not shaken. Combining Reservoir Sampling and Sliced Wasserstein Distance for Variance Reduction.

Distribution matching is central to many vision and graphics tasks, where the widely used Wasserstein distance is too costly to compute for high dimensional distributions. The …

Mark Boss

• Oct 2, 2025 • 1 min read

Video Diffusion

SViM3D: Stable Video Material Diffusion for Single Image 3D Generation

We present Stable Video Materials 3D (SViM3D), a framework to predict multi-view consistent physically based rendering (PBR) materials, given a single image. Recently, video …

andreas-engelhardt

• Sep 30, 2025 • 1 min read

Diffusion

MARBLE: Material Recomposition and Blending in CLIP-Space

Editing materials of objects in images based on exemplar images is an active area of research in computer vision and graphics. We propose MARBLE, a method for performing material …

ta-ying-cheng

• Jun 5, 2025 • 1 min read

Novel View Synthesis

Stable Virtual Camera: Generative View Synthesis with Diffusion Models

We present Stable Virtual Camera (Seva), a generalist diffusion model that creates novel views of a scene, given any number of input views and target cameras. Existing works …

jensen-zhou

• Mar 19, 2025 • 1 min read

Large Reconstruction Model

SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images

We study the problem of single-image 3D object reconstruction. Recent works have diverged into two directions: regression-based modeling and generative modeling. Regression methods …

zixuan-huang

• Jan 8, 2025 • 1 min read

No results found

Mark Boss