SOFTWARE IMPLEMENTATION USING TRANSFORMER WITH OPTICAL FLOW AND GEONET FOR IDENTIFYING PARAMETERS OF DYNAMIC OBJECTS
DOI:
https://doi.org/10.20998/2079-0023.2024.02.13Keywords:
Remote identification of dynamic objects, object detection, optical flow, velocity identification, deep learning, convolutional neural networksAbstract
Today, interdisciplinary research in computer science and engineering has become increasingly relevant due to the growing demand for real-time data processing in object detection and tracking applications. The identification of dynamic object parameters plays a crucial role in various domains such as autonomous transportation systems, robotics, and surveillance. Effective automated acquisition and processing of video data represent a promising field for scientists and practitioners working in these interconnected disciplines. This research aims to enhance object detection and tracking processes by developing and implementing an information technology solution based on modern machine learning methods, including DETR (Detection Transformer), Optical Flow, and GeoNet. The research methodology involves designing software using Python programming language and modern libraries and frameworks for image and video processing. The DETR method was employed for precise object detection within video frames, Optical Flow was used to determine the direction and velocity of object movement, and GeoNet provided depth and geometric scene analysis. The proposed technology was tested on diverse video recordings depicting complex scenarios with dynamic conditions, such as varying lighting, object occlusions, and rapid motion changes. The results demonstrate the high accuracy and reliability of the proposed approach for identifying dynamic object parameters under various conditions. The integration of these methods significantly improved the precision and robustness of the detection and tracking system, particularly in challenging environments or low-quality video scenarios. The study concludes that the proposed information technology is effective and can be applied in practical fields such as autonomous systems, robotics, and video surveillance.
References
Wang Z., Turko R., Shaikh O., Park H., Das N., Hohman F., Kahng M., Chau D. CNN Explainer: Learning Convolutional Neural Networks with Interactive Visualization. URL: https://arxiv.org/abs/2004.15004 (accessed: 06.11.2024).
Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A., Kaiser L., Polosukhin I. Attention Is All You Need. URL: https://arxiv.org/abs/1706.03762 (accessed: 06.11.2024).
Carion N., Massa F., Synnaeve G., Usunier N., Kirillov A., Zagoruyko S. End-to-End Object Detection with Transformers. URL: https://arxiv.org/abs/2005.12872v3 (accessed: 06.11.2024).
Zou Z., Chen K., Shi Z., Shi Z., Guo Y., Ye J. Object
Detection in 20 Years: A Survey. URL: https://arxiv.org/pdf/1905.05055.pdf?fbclid=IwAR0ILGAWTwU-9-iH6lZyPFXYXA5JRWarM_XoSJ78QEhmnn-txvr_iGEzCio (accessed: 06.11.2024).
Ammar A., Chebbah A., Fredj H., Souani C. Comparative Study of latest CNN based Optical Flow Estimation. URL: https://ieeexplore.ieee.org/document/9806070/references#references. (accessed: 06.11.2024).
Yin Z., Shi J. GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose. URL: https://arxiv.org/abs/1803.02276v2 (accessed: 06.11.2024).
Girshick R., Donahue J., Darrell T., and Malik J. Region-based convolutional networks for accurate object detection and segmentation. IEEE transactions on pattern analysis and machine intelligence. 2016, vol. 38, no. 1, pp. 142–158.
Nikulina O. M., Severin V. P., Kondratov O. М, Olhovoy O. M. Models of remote identification of parameters of dynamic objects using detection transformers and optical flow. Vestnik Nats. tekhn. un-ta "KhPI": sb. nauch. tr. Temat. vyp.: Sistemnyy analiz, upravlenie i informatsionnye tekhnologii [Bulletin of the National Technical University "KhPI": a collection of scientific papers. Thematic issue: System analysis, management and information technology]. Kharkiv, NTU "KhPI" Publ., no. 1 (11), pp. 52–57.
Nikulina O. M., Kondratov O. М. Dynamic object parameter identification model using DEtection TRansformer and Optical Flow. Information technologies: science, technology, technology, education, health: Abstracts of reports of XXII international scientific and practical conference MicroCAD-2024, May 22–24, 2024, Kharkiv, NTU "KhPI", 2024, p. 1047.
Nikulina O. M., Kondratov O. М. Methods of remote identification of dynamic object parameters. Information technologies: science, technology, technology, education, health: Abstracts of reports of the XXI international scientific and practical conference MicroCAD-2023, May 17-20, 2023, Kharkiv, NTU "KhPI", 2023, p. 1047.
Gracyk A., Chen X. GeONet: a neural operator for learning the Wasserstein geodesic. URL: https://arxiv.org/abs/2209.14440 (accessed: 06.11.2024).
Inomata T., Kimura K., Hagiwara M. Object Tracking and Classification System Using Agent Search. URL: https://www.jstage.jst.go.jp/article/ieejeiss/129/11/129_11_2065/_pdf/-char/ja (accessed: 06.11.2024).
Gavrylenko S., Chelak V., Hornostal O. Construction Method Of Fuzzy Decision Trees For Identification The Computer System State. 2022 XXXII International Scientific Symposium Metrology and Metrology Assurance (MMA). 2022, pp. 1–5.
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).