A Comparative Analysis of Image Segmentation Using Classical and Deep Learning Approach
Artykuł w czasopiśmie
MNiSW
100
Lista 2023
Status: | |
Autorzy: | Plaksyvyi Arsen, Skublewska-Paszkowska Maria, Powroźnik Paweł |
Dyscypliny: | |
Aby zobaczyć szczegóły należy się zalogować. | |
Rok wydania: | 2023 |
Wersja dokumentu: | Elektroniczna |
Język: | angielski |
Numer czasopisma: | 6 |
Wolumen/Tom: | 17 |
Strony: | 127 - 139 |
Impact Factor: | 1,0 |
Web of Science® Times Cited: | 2 |
Scopus® Cytowania: | 4 |
Bazy: | Web of Science | Scopus | BazTech |
Efekt badań statutowych | NIE |
Materiał konferencyjny: | NIE |
Publikacja OA: | TAK |
Licencja: | |
Sposób udostępnienia: | Witryna wydawcy |
Wersja tekstu: | Ostateczna wersja opublikowana |
Czas opublikowania: | W momencie opublikowania |
Data opublikowania w OA: | 21 listopada 2023 |
Abstrakty: | angielski |
Segmentation is one of the image processing techniques, widely used in computer vision, to extract various types of information represented as objects or areas of interest. The development of neural networks has influenced image processing techniques, including creation of new ways of image segmentation. The aim of this study is to compare classical algorithms and deep learning methods in RGB image segmentation tasks. Two hypotheses were put forward: (1) The quality of segmentation applying deep learning methods is higher than using classical methods for RGB images, and (2) The increase of the RGB image resolution has positive impact on the segmentation quality. Two traditional segmentation algorithms (Thresholding and K-means) were compared with deep learning approach (U-Net, SegNet and FCN 8) to verify RGB segmentation quality. Two resolutions of images were taken into consideration: 160x240 and 320x480 pixels. Segmentation quality for each algorithm was estimated based on four parameters: Accuracy, Precision, Recall and Sorensen-Dice ratio (Dice score). In the study the Carvana dataset, containing 5,088 high-resolution images of cars, was applied. The initial set was divided into training, validation and test subsets as 60%, 20%, 20%, respectively. As a result, the best Accuracy, Dice score and Recall for images with resolution 160x240 were obtained for U-Net, achieving 99.37%, 98.56%, and 98.93%, respectively. For the same resolution the highest Precision 98.19% was obtained for FCN-8 architecture. For higher resolution, 320x480, the best mean Accuracy, Dice score, and Precision were obtained for FCN-8 network, reaching 99.55%, 99.95% and 98.85%, respectively. The highest results for classical methods were obtained for Threshold algorithm reaching 80.41% Accuracy, 58.49% Dice score, 67.32% Recall and 52.62% Precision. The results confirm both hypotheses. |