Stable-Layers: Fine-Tuning Image Layer Decomposition Models with VLM-Scored Reinforcement Learning
An RL framework that fine-tunes image layer decomposition models using VLM-as-judge rewards, eliminating paired supervision.
ciara-rowles
An RL framework that fine-tunes image layer decomposition models using VLM-as-judge rewards, eliminating paired supervision.
A rotation-preconditioned KV cache codec that jointly quantizes coordinate triplets via an octahedral map, achieving state-of-the-art compression across text, video, and audio …
Distribution matching is central to many vision and graphics tasks, where the widely used Wasserstein distance is too costly to compute for high dimensional distributions. The …
Creating plausible surfaces is an essential component in achieving a high degree of realism in rendering. To relieve artists, who create these surfaces in a time-consuming, manual …