Deep Waltz: Neuro-Symbolic Line Drawing Interpretation via Learned Perception and Constraint Satisfaction

Main Article Content

Karthikan Gurumoorthy

Abstract

Interpreting three-dimensional structure from two- dimensional line drawings remains a fundamental challenge in computer vision, cognitive science, and artificial intelligence. Classical symbolic approaches based on constraint-driven junction labelling provide strong geometric interpretability but are highly sensitive to noise, fragmented lines, and missing segments. In contrast, modern deep learning methods are effective at detecting edges, junctions, and local geometric patterns under real-world conditions, yet often lack global consistency, interpretability, and enforcement of physically plausible structural relationships. These limitations motivate hybrid neuro-symbolic approaches that combine learned perception with symbolic reasoning. In this work, we present Deep Waltz, a hybrid vision framework that integrates a compact CNN-based neural refinement module with a Waltz- style constraint satisfaction solver. The proposed pipeline performs end to-end processing from raw images to globally consistent symbolic interpretations, including edge detection, line segment extraction, junction detection, CNN-based patch classification, and constraint-based global inference using legal junction-label tables. An EM-like iterative training scheme is introduced, in which CSP-inferred labels serve as pseudo-labels to refine the neural components and progressively improve global coherence. Experiments on synthetic polyhedral scenes, hand-drawn sketches, and real-image edge maps demonstrate that Deep Waltz substantially improves junction classification accuracy, legal labelling rates, and structural reconstruction quality compared to symbolic-only and neural-only baselines. These results indicate that the proposed framework provides a robust, interpretable, and reproducible solution for structural scene understanding from line drawings.

Downloads

Download data is not yet available.

Article Details

Section

Articles

How to Cite

[1]
Karthikan Gurumoorthy, “Deep Waltz: Neuro-Symbolic Line Drawing Interpretation via Learned Perception and Constraint Satisfaction”, IJSCE, vol. 15, no. 6, pp. 17–20, Jan. 2026, doi: 10.35940/ijsce.B1038.15060126.

References

Y. Liu and S. Navathe, “A survey on structural scene understanding and line drawing interpretation,” ACM Computing Surveys, vol. 54, no. 8, pp. 1–32, 2021. DOI: https://doi.org/10.1145/3469845

B. P. Bhuyan, “Neuro-symbolic artificial intelligence: a survey,” Neural Computing and Applications, 2024.

DOI: https://doi.org/10.1007/s00521-024-09960-z

S. Ramalingam and P. Sturm, “A unifying model for line and junction interpretation in images,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 6, pp. 1105–1121, 2017. DOI: https://doi.org/10.1109/TPAMI.2016.2599178

D. DeTone, T. Malisiewicz, and A. Rabinovich, “SuperPoint: Self- Supervised Interest Point Detection and Description,” in Proc. CVPR Workshops, 2018. URL: https://arxiv.org/abs/1712.07629

N. Xue, B. Xiong, and X. Qian, “Learning attraction field representation for robust line segment detection,” in Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019. DOI: https://doi.org/10.1109/CVPR.2019.01234

Y. Xu, W. Xu, D. Cheung, and Z. Tu, “Line Segment Detection Using Transformers Without Edges,” in Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021. DOI: https://doi.org/10.1109/CVPR46437.2021.01235

J. Mao, C. Gan, P. Kohli, J. B. Tenenbaum, and J. Wu, “The Neuro- Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences,” in Proc. International Conference on Learning Representations (ICLR), 2019. URL: https://arxiv.org/abs/1904.12584

Most read articles by the same author(s)

1 2 3 4 5 6 7 > >>