Automatic Annotation of Training Datasets in Computer Vision using Machine Learning Methods

Main Article Content

Abstract

This paper addresses the issue of automatic annotation of training datasets in the field of computer vision using machine learning methods. Data annotation is a key stage in the development and training of deep learning models, yet the process of creating labeled data often requires significant time and labor. This paper proposes a mechanism for automatic annotation based on the use of convolutional neural networks (CNN) and active learning methods.


The proposed methodology includes the analysis and evaluation of existing approaches to automatic annotation. The effectiveness of the proposed solutions is assessed on publicly available datasets. The results demonstrate that the proposed method significantly reduces the time required for data annotation, although operator intervention is still necessary.


The literature review includes an analysis of modern annotation methods and existing automatic systems, providing a better understanding of the context and advantages of the proposed approach. The conclusion discusses achievements, limitations, and possible directions for future research in this field.

Article Details

References

1. Council J. Data challenges are halting AI projects, IBM executive says // The Wall Street Journal. 2019. Vol. 28.
2. LabelImg for Image Annotation. URL: https://viso.ai/computer-vision/labelimg-for-image-annotation/.
3. VGG Image Annotator. URL: https://www.robots.ox.ac.uk/~vgg/software/via/via_demo.html.
4. Everingham M. et al. The pascal visual object classes challenge: A retrospective // International Journal of Computer Vision. 2015. Vol. 111. P. 98–136.
5. Berg A. et al. Semi-automatic annotation of objects in visual-thermal video // Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2019. https://doi.org/10.1109/ICCVW.2019.00277
6. Sager C., Janiesch C., Zschech P. A survey of image labelling for computer vision applications // Journal of Business Analytics. 2021. Vol. 4, No. 2. P. 91–110.
7. Cao J., Zhao A., Zhang Z. Automatic image annotation method based on a convolutional neural network with threshold optimization // Plos one. 2020. V. 15, No. 9. e0238956. https://doi.org/10.1371/journal.pone.0238956
8. Vatani A., Ahvanooey M.T., Rahimi M. An effective automatic image annotation model via attention model and data equilibrium // arXiv preprint arXiv:2001.10590. 2020.
9. Gu Y. et al. Automatic lung nodule detection using a 3D deep convolutional neural network combined with a multi-scale prediction strategy in chest CTs // Computers in Biology and Medicine. 2018. Vol. 103. P. 220–231.
10. Levine S. et al. Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection // The International Journal of Robotics Research. 2018. Vpl. 37, No. 4-5. P. 421–436.
11. Kirillov A. et al. Segment anything // Proceedings of the IEEE/CVF International Conference on Computer Vision. 2023. P. 4015–4026.
12. Zou X. et al. Segment everything everywhere all at once // Advances in Neural Information Processing Systems. NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing Systems. Article No. 868. P.19769–19782. https://dl.acm.org/doi/10.5555/3666122.3666990
13. Ultralytics YOLOv8 Docs. URL: https://docs.ultralytics.com/ru.
14. COCO Dataset. URL: https://cocodataset.org/#home.
15. Cityscapes Dataset. URL: https://www.cityscapes-dataset.com/.
16. Auto-Label. URL: https://roboflow.com/auto-label.