ITMO
ru/ ru

ISSN: 1023-5086

ru/

ISSN: 1023-5086

Scientific and technical

Opticheskii Zhurnal

A full-text English translation of the journal is published by Optica Publishing Group under the title “Journal of Optical Technology”

Article submission Подать статью
Больше информации Back

DOI: 10.17586/1023-5086-2021-88-12-17-27

УДК: 612.821

Segmentation of visual images: experimental data and modeling

For Russian citation (Opticheskii Zhurnal):

Бондарко В.М., Данилова М.В., Чихман В.Н. Сегментация зрительных изображений: экспериментальные данные и моделирование // Оптический журнал. 2021. Т. 88. № 12. С. 17–27. http://doi.org/10.17586/1023-5086-2021-88-12-17-27

 

Bondarko V.M., Danilova M.V., Chikhman V.N. Segmentation of visual images: experimental data and modeling [in Russian] // Opticheskii Zhurnal. 2021. V. 88. № 12. P. 17–27. http://doi.org/10.17586/1023-5086-2021-88-12-17-27

For citation (Journal of Optical Technology):

V. M. Bondarko, M. V. Danilova, and V. N. Chikhman, "Segmentation of visual images: experimental data and modeling," Journal of Optical Technology. 88(12), 692-699 (2021). https://doi.org/10.1364/JOT.88.000692

Abstract:

Experimental data on the segmentation of simple geometrical figures surrounded by a frame are compared for the first time to our knowledge with the results of a model of modules that filters the images in local areas of the visual field. It is shown in the experiments that images are perceived as two separate images when they reach a definite spacing between them, depending on their size and shape. When they are small, the distances are comparable with the optical point-spread function and the size of the highest-frequency receptive fields of the neurons in the primary visual cortex, and, when they are large, with the size of modules that optimally describe the images (with the maximum energy conservation in the images when there is a limited number of filters). The images are segmented when the second image (the frame) goes beyond the limits of the module. The results were confirmed by our earlier data on the study of the Oppel–Kundt illusion and by estimating the width of the spatial intervals.

Keywords:

segmentation, modeling, spatial frequency analysis, model of modules, painting frame, K. Malevich’s "Black Square"

Acknowledgements:

This study was supported by the State Program 47 GP “Scientific and Technological Development of the Russian Federation” (2019-2030), theme  0134-2019-0005.

OCIS codes: 330.7326 330.4060 330.5510

References:

1. V. R. Lutsiv, “Convolutional deep-learning artificial neural networks,” J. Opt. Technol. 82(8), 499–508 (2015) [Opt. Zh. 82(8), 11–23 (2015)].
2. A. K. Tsytsulin, A. I. Bobrovski, A. V. Morozov, V. A. Pavlov, and M. A. Galeeva, “Using convolutional neural networks to automatically select small artificial space objects on optical images of a starry sky,” J. Opt. Technol. 86(10), 627–633 (2019) [Opt. Zh. 86(10), 30–38 (2019).
3. Y. Xie, F. Zhu, and Y. Fu, “Main-secondary network for defect segmentation of textured surface images,” in IEEE Winter Conference on Applications of Computer Vision (WACV) (2020), pp. 3531–3540.
4. J. Beck, “Textural segmentation, second-order statistics, and textural elements,” Biol. Cybern. 48, 125–130 (1983).
5. B. Julesz, “A theory of preattentive texture discrimination based on first-order statistics of textons,” Biol. Cybern. 41, 131–138 (1981).
6. B. Julesz, “Textons, the elements of texture perception, and their interactions,” Nature 290, 91–97 (1981).
7. B. Julesz, Towards an Axiomatic Theory of Preattentive Vision: Dynamic Aspects of Neocortical Function (Wiley, Toronto, 1984), pp. 585–612.
8. B. Julesz, “Texton gradients: the texton theory revisited,” Biol. Cybern. 54, 245–251 (1986).
9. I. Fogel and D. Sagi, “Gabor filters as texture discriminator,” Biol. Cybern. 61, 103–113 (1989).
10. M. S. Landy and J. R. Bergen, “Texture segregation and orientation gradient,” Vision Res. 31, 679–691 (1991).
11. J. Beck, A. Sutter, and R. Ivry, “Spatial frequency channels and perceptual grouping in texture perception,” Comput. Vision Graphics Image Process. 37, 299–325 (1987).
12. A. C. Bovik, M. Clark, and W. S. Geisler, “Multichannel texture analysis using localized spatial filters,” IEEE Trans. Pattern Anal. Mach. Intell. 12, 55–73 (1990).
13. J. Malik and P. Perona, “Preattentive texture discrimination with early vision mechanisms,” J. Opt. Soc. Am. A 7, 923–932 (1990).
14. M. R. Turner, “Texture discrimination by Gabor functions,” Biol. Cybern. 55, 71–82 (1986).
15. E. Arsenault, A. Yoonessi, and C. Baker, “Higher-order texture statistics impair contrast boundary segmentation,” J. Vision 11(10):14, 1–15 (2011).
16. M. J. Prakash, S. Kezia, I. S. Prabha, and V. V. Kumar, “A new approach for texture segmentation using gray-level textons,” J. Int. Signal Process. Image Process. Pattern Recognit. 6(3), 81–90 (2013).
17. G. Blasdel and G. Salama, “Voltage-sensitive dyes reveal a modular organization in monkey striate cortex,” Nature 321, 579–585 (1986).
18. V. M. Bondarko and M. V. Danilova, “Estimating the integrity of visible objects as a function of their size,” Fiziol. Chel. 22(3), 65–70 (1996).
19. Yu. E. Shelepin, V. M. Bondarko, and M. V. Danilova, “Structure of the foveola and a model of the pyramidal organization of the visual system,” Sens. Sist. 9(1), 86–96 (1995).
20. V. M. Bondarko and M. V. Danilova, “The estimation of the size of visible objects of various shapes depends on the distance between the objects and the conditions under which they are presented,” Sens. Sist. 11(3), 300–311 (1997).
21. J. Atkinson, “Review of human visual development: crowding and dyslexia,” in Vision and Visual Dysfunction: Dyslexia, J. F. Stein, ed. (1991), vol. 13, pp. 44–57.
22. M. V. Danilova and V. M. Bondarko, “Foveal contour interactions and crowding effects at the resolution limit of the visual system,” J. Vision 7(2):25, 1–18 (2007).
23. Y. E. Shelepin and V. M. Bondarko, “Resolving ability and image discretization in the visual system,” Neurosci. Behav. Physiol. 34, 147–157 (2004).
24. V. D. Glezer, Vision and Mind (Nauka, Leningrad, 1985).
25. I. A. Vol, “Spatial–frequency model of hyperacuity of the visual system,” Sens. Sist. 2(2), 133–138 (1988).
26. N. A. Kaliteevsky, V. E. Semenov, V. D. Glezer, and V. E. Gauselman, “Algorithm of invariant image description by the use of a modified Gabor transform,” Appl. Opt. 33(23), 5256–5261 (1994).
27. S. V. Alyukov, “Approximation of step functions in problems of mathematical modeling,” Math Models Comput. Simul. 3, 661 (2011) [Mat. Model. 23(3), 75–88 (2011)].
28. W. Rudin, Principles of Mathematical Analysis (R. R. Donnelley & Sons, Chicago, 1976; Mir, Moscow, 1976).
29. G. E. Shilov, Mathematical Analysis (GIFML, Moscow, 1961).
30. V. D. Glezer, V. V. Yakovlev, and V. E. Gauselman, “Harmonic basis function for spatial coding in the cat striate cortex,” Visual Neurosci. 3, 351–383 (1989).
31. O. V. Zhukova, E. Yu, Malakhova, and Yu. E. Shelepin, “La Gioconda and the indeterminacy of smile recognition by a person and by an artificial neural network,” J. Opt. Technol. 86(11), 706–715 (2019) [Opt. Zh. 86(11), 40–50 (2019)].
32. E. Yu and Malakhova, “Information representation space in artificial and biological neural networks,” J. Opt. Technol. 87(10), 598–603 (2020) [Opt. Zh. 87(10), 50–58 (2020)].
33. M. A. Titarenko and R. O. Malashin, “Image enhancement by deep neural networks using high-level information,” J. Opt. Technol. 87(10), 604–610 (2020) [Opt. Zh. 87(10), 59–68 (2020)].
34. O. V. Zhukova and P. P. Vasil’ev, “Reconstruction of a neural network and alteration of the operators’ strategies during the recognition of facial images,” J. Opt. Technol. 87(10), 581–589 (2020) [Opt. Zh. 87(10), 25–37 (2020)].
35. A. K. Kharauzov, Yu. E. Shelepin, O. V. Tsvetkov, O. V. Zhukova, and S. V. Pronin, “Methods of masking threatening images and detecting electrophysiological indicators of their unconscious perception,” J. Opt. Technol. 87(10), 611–618 (2020) [Opt. Zh. 87(10), 69–80 (2020)].
36. Yu. E. Shelepin, A. K. Kharauzov, O. V. Zhukova, S. V. Pronin, M. S. Kupriyanov, and O. V. Tsvetkov, “Masking and detection of hidden signals in dynamic images,” J. Opt. Technol. 87(10), 624–632 (2020) [Opt. Zh. 87(10), 89–102 (2020)].
37. O. Ronneberger, P. Fischer, and T. Brox, “U-net: convolutional networks for biomedical image segmentation,” Lect. Notes Comput. Sci. 9351, 234–241 (2015).
38. R. O. Malashin, “Principle of least action in dynamically configured image-analysis systems,” J. Opt. Technol. 86(11), 678–685 (2019) [Opt. Zh. 86(11), 5–13 (2019)].
39. V. M. Bondarko, “Comparison of the metrics specified on image-feature space with errors in recognizing these images,” Sens. Sist. 3(1), 56–68 (1989).
40. F. V. Campbell and Yu. E. Shelepin, “Possibilities of the foveola in various objects,” Sens. Sist. 4(2), 181–185 (1990).
41. O. A. Vakhrameeva, M. V. Sukhinin, G. A. Moiseenko, S. V. Murav’eva, S. V. Pronin, V. V. Volkov, and Yu. E. Shelepin, “Study of recognition thresholds as a function of the geometry of the fovea,” Sens. Sist. 27(2), 122–129 (2013).
42. V. Chikhman, Y. Shelepin, and S. Pronin, “Experimental study of invariant perception of wavelet images,” J. Opt. Technol. 78(12), 803–807 (2011) [Opt. Zh. 78(12), 50–56 (2011)].
43. P. J. Burt, “Fast filter transforms for image processing,” Comput. Graphics Image Process. 16, 20–51 (1981).
44. A. B. Watson and A. J. Ahumada, “A hexagonal orthogonal-oriented pyramid as a model of image representation in visual cortex,” IEEE Trans. Biomed. Eng. 36(1), 97–106 (1989).
45. V. M. Bondarko, V. E. Gauzel’man, and V. D. Glezer, “Mechanisms of the visual detection and recognition of differences,” Fiziol. Chel. 9(8), 496–498 (1983).
46. H. R. Wilson, D. K. McFarlane, and G. C. Phillips, “Spatial-frequency tuning of orientation-selective units estimated by oblique masking,” Vision Res. 23(9), 873–882 (1983).
47. M. C. Flom, “Contour interaction and the crowding effect,” Probl. Optom. 3(2), 237–257 (1991).
48. M. C. Flom, F. W. Weymouth, and D. Kahneman, “Visual resolution and contour interaction,” J. Opt. Soc. Am. 53, 1026–1032 (1963).

49. A. Bulatov and A. Bertulis, “Distortions of length perception,” Biol. Cybern. 80, 185–193 (1999).
50. V. M. Bondarko, D. V. Bondarko, S. D. Solnushkin, and V. N. Chikhman, “Modeling of optical illusions,” J. Opt. Technol. 85(8), 448–454 (2018) [Opt. Zh. 85(8), 5–12 (2018)].
51. V. M. Bondarko and M. V. Danilova, “Spatial interval discrimination in the presence of flanking lines,” Spat. Vision 12(2), 239–253 (1999).
52. R. F. Hess and D. R. Badcock, “Metric for separation discrimination by the human visual system,” J. Opt. Soc. Am. A 12, 3–16 (1995).