Abstract

We propose a layered statistical model for imagesegmentation and labeling obtained by combining independentlyextracted, possibly overlapping sets of figure-ground(FG) segmentations. The process of constructing consistentimage segmentations, called tilings, is cast as optimizationover sets of maximal cliques sampled from a graph connectingall non-overlapping figure-ground segment hypotheses.Potential functions over cliques combine unary, Gestaltbasedfigure qualities, and pairwise compatibilities amongspatially neighboring segments, constrained by T-junctionsand the boundary interface statistics of real scenes. Buildingon the segmentation layer, we further derive a joint imagesegmentation and labeling model (JSL) which, given a bagof FGs, constructs a joint probability distribution over boththe compatible image interpretations (tilings) composed fromthose segments, and over their labeling into categories. Theprocess of drawing samples from the joint distribution canbe interpreted as first sampling tilings, followed by samplinglabelings conditioned on the choice of a particular tiling.

Reference

Ion, A., Carreira, J., & Sminchisescu, C. (2013). Probabilistic Joint Image Segmentation and Labeling by Figure-Ground Composition. In International Journal of Computer Vision (pp. 40–57). Springer Science+Business Media New York 2013. https://doi.org/10.1007/s11263-013-0663-7