AUTOMATED OBJECT SEGMENTATION AND BACKGROUND REMOVAL IN MULTIMEDIA CONTENT PROCESSING SYSTEMS

Authors

DOI:

https://doi.org/10.20998/2079-0023.2026.01.07

Keywords:

multimedia content processing, automatic background removal, image segmentation, deep learning, alpha matting, BEN2 architecture

Abstract

In today's rapidly developing digital technologies, image processing occupies a key place in the fields of computer vision and graphics. One of the most popular tasks is the automatic separation of foreground objects from the background, which is of critical importance for web design, e-commerce, the advertising industry, and augmented reality systems. Traditional manual editing methods are laborious and dependent on user experience, especially when working with complex textures or fuzzy object boundaries. The development of deep learning methods make it possible to automate these processes, providing high accuracy and speed of processing in real time. The aim of the work is to analyze existing algorithms for automatic background removal and develop an effective methodology for object segmentation using modern computer vision models to improve the quality of multimedia content. The work uses a comprehensive approach based on the use of the hybrid neural network architecture BEN2. The key feature of the method is the implementation of an innovative Confidence Guided Matting pipeline, which implements a two-stage Bayesian approach: first, coarse segmentation (BEN Base) is performed to create a "draft" mask, and then targeted refinement of boundaries in areas of low confidence of the model (BEN Refiner). As part of the research, a software application based on the Streamlit framework was developed, which provides automated inference of the BEN2 model. The system supports loading images in popular graphic formats, batch data processing, and visualization of results using an interactive slider. The high efficiency of the algorithm was experimentally confirmed: for complex objects, the architecture provides accurate matting of hair and small details, minimizing artifacts such as "ragged edges". The processing workflow was optimized by using local caching of models and supporting acceleration on GPU/CPU. The proposed approach based on the BEN2 architecture and boundary refinement mechanisms demonstrated higher accuracy compared to classical one-stage segmentation methods. The developed system is scalable and suitable for integration into real multimedia processing information systems, which allows significantly reducing the time for preparing graphic content and increasing its professional level.

References

Weingerl P., Hladnik A., Dejana Javoršek D. Development of a machine learning model for extracting image prominent colors. Color Research & Application. 2020. No. 45(3), pp. 409–426. DOI: https://doi.org/10.1002/col.22485.

Gonzalez R. C., Woods R. E. Digital Image Processing. 4th ed. New York: Pearson, 2018. 1019 p.

Szeliski R. Computer Vision: Algorithms and Applications (2nd Edition). Springer, 2022. 1232 p.

Kovalenko S., Kovalenko S., Mikhnova O., Kovalenko A., Pelikh D., Severin V., "An Approach to Blood Cell Classification Based on Object Segmentation and Machine Learning," 2023 IEEE 4th KhPI Week on Advanced Technology (KhPIWeek), Kharkiv, 2023, pp. 1–6. DOI: 10.1109/KhPIWeek61412.2023.10312903.

Chen J., Lu Y., Yu Q., Luo X., Adeli E., Wang Y., Lu L., Yuille A. L., Zhou Y. TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation. arXiv, preprint arXiv:2102.04306, 2021. 13 p. DOI: 10.48550/arXiv.2102.04306.

Kovalenko S. M., Kutsenko O. S., Kovalenko S. V., Kovalenko A. S. Approach to the automatic creation of an annotated dataset for the detection, localization and classification of blood cells in an image. Radio Electronics, Computer Science, Control. 2024, no. 1, pp. 128– 139. DOI: 10.15588/1607-3274-2024-1-12.

Kovalenko A., Severyn V. Optimization of the architecture and hyperparameters of the U-Net model to improve the quality of biological objects segmentation. Journal of the Ternopil National Technical University. 2025, vol 120, no. 4, pp. 21–31. DOI: https://doi.org/10.33108/visnyk_tntu2025.04.021.

Dosovitskiy, A. An Image Is Worth 16x16 Words: Transformers for Image Recognition at Scale. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York City, 2021, pp. 45–67.

Khanorkar, A., Pawar, B., Singh, D., Dhanbhar, K., Mangrulkar, N. Background removal of video in realtime. 3C TIC. Cuadernos de desarrollo aplicados a las TIC. 2022, vol. 11, no. 2, pp. 195–206. DOI: 10.17993/3ctic.2022.112.195-206.

Meyer M., Spruyt J. Ben: Using confidence-guided matting for dichotomous image segmentation. arXiv preprint arXiv:2501.06230, 2025. DOI: 10.48550/arXiv.2501.06230.

Meet BEN2, the AI Background Eraser That Actually Works And It Is Open Source. DigiAlps LTD. Available at: https://digialps.com/meetben2-the-ai-background-eraser-that-actually-works-and-it-is-opensource/ (accessed 24.02.2026).

Qin X., Dai H., Hu X., Fan D. P., Shao L., Van Gool L. Highly accurate dichotomous image segmentation. European Conference on Computer Vision. Cham, Springer Nature Switzerland. 2022, pp. 38–56. DOI: 10.1007/978-3-031-19797-0_3.

BEN2: Background Erase Network. Available at: https://huggingface.co/PramaLLC/BEN2 (accessed 24.02.2026).

Khorasani M., Abdou M., Fernández J. H. Streamlit for Web Development: Build and Scale Secure Python-Powered Apps with Streamlit. Apress Berkeley, CA, 2025. XXV, 413 p. DOI: https://doi.org/10.1007/979-8-8688-1826-4.

Downloads

Published

2026-05-20

How to Cite

Kovalenko, S., Kovalenko, S., Aleksandrova, T., & Malko, M. (2026). AUTOMATED OBJECT SEGMENTATION AND BACKGROUND REMOVAL IN MULTIMEDIA CONTENT PROCESSING SYSTEMS. Bulletin of National Technical University "KhPI". Series: System Analysis, Control and Information Technologies, (1 (15), 45–51. https://doi.org/10.20998/2079-0023.2026.01.07

Issue

Section

INFORMATION TECHNOLOGY