Papers
arxiv:2602.06507

FloorplanVLM: A Vision-Language Model for Floorplan Vectorization

Published on Feb 6
Authors:
,
,
,

Abstract

FloorplanVLM addresses complex floorplan vectorization by reformulating it as an image-conditioned sequence modeling task, achieving high structural validity through specialized training and evaluation frameworks.

Converting raster floorplans into engineering-grade vector graphics is challenging due to complex topology and strict geometric constraints. To address this, we present FloorplanVLM, a unified framework that reformulates floorplan vectorization as an image-conditioned sequence modeling task. Unlike pixel-based methods that rely on fragile heuristics or query-based transformers that generate fragmented rooms, our model directly outputs structured JSON sequences representing the global topology. This 'pixels-to-sequence' paradigm enables the precise and holistic constraint satisfaction of complex geometries, such as slanted walls and curved arcs. To support this data-hungry approach, we introduce a scalable data engine: we construct a large-scale dataset (Floorplan-2M) and a high-fidelity subset (Floorplan-HQ-300K) to balance geometric diversity and pixel-level precision. We then employ a progressive training strategy, using Supervised Fine-Tuning (SFT) for structural grounding and quality annealing, followed by Group Relative Policy Optimization (GRPO) for strict geometric alignment. To standardize evaluation on complex layouts, we establish and open-source FPBench-2K. Evaluated on this rigorous benchmark, FloorplanVLM demonstrates exceptional structural validity, achieving 92.52% external-wall IoU and robust generalization across non-Manhattan architectures.

Community

hello,

thank you for this piece of research.

Any plans to open-source/open-weights it?

Any date of release for the FPBENCH-2K layouts for evaluation?

on a separate note: Has anyone been able to replicate this on an open source / custom dataset ? I see cubicasa has been tried but the checkpoints give terrible hallucinations and do not come close to what this paper reports.

Sign up or log in to comment

Get this paper in your agent:

hf papers read 2602.06507
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 6

Browse 6 models citing this paper

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2602.06507 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2602.06507 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.