Jinman Park
June 6th, 2025 – 12:00-12:30pm, EC4-2101A
This talk explores a lightweight, superpixel-based approach to salient object detection—segmenting the most visually prominent regions in an image. While traditional methods rely on dense pixel-level computation, we introduce SuperFormer, a vision transformer tailored for superpixel inputs. Our work addresses challenges of superpixel heterogeneity, positional encoding, and pre-training, achieving state-of-the-art results on multiple benchmarks with significantly reduced computational cost.