Informacja o cookies

Zgadzam się Nasza strona zapisuje niewielkie pliki tekstowe, nazywane ciasteczkami (ang. cookies) na Twoim urządzeniu w celu lepszego dostosowania treści oraz dla celów statystycznych. Możesz wyłączyć możliwość ich zapisu, zmieniając ustawienia Twojej przeglądarki. Korzystanie z naszej strony bez zmiany ustawień oznacza zgodę na przechowywanie cookies w Twoim urządzeniu.

Publikacje Pracowników Politechniki Lubelskiej

MNiSW
100
Lista 2023
Status:
Autorzy: Plaksyvyi Arsen, Skublewska-Paszkowska Maria, Powroźnik Paweł
Dyscypliny:
Aby zobaczyć szczegóły należy się zalogować.
Rok wydania: 2023
Wersja dokumentu: Elektroniczna
Język: angielski
Numer czasopisma: 6
Wolumen/Tom: 17
Strony: 127 - 139
Web of Science® Times Cited: 0
Scopus® Cytowania: 0
Bazy: Web of Science | Scopus | BazTech
Efekt badań statutowych NIE
Materiał konferencyjny: NIE
Publikacja OA: TAK
Licencja:
Sposób udostępnienia: Witryna wydawcy
Wersja tekstu: Ostateczna wersja opublikowana
Czas opublikowania: W momencie opublikowania
Data opublikowania w OA: 21 listopada 2023
Abstrakty: angielski
Segmentation is one of the image processing techniques, widely used in computer vision, to extract various types of information represented as objects or areas of interest. The development of neural networks has influenced image processing techniques, including creation of new ways of image segmentation. The aim of this study is to compare classical algorithms and deep learning methods in RGB image segmentation tasks. Two hypotheses were put forward: (1) The quality of segmentation applying deep learning methods is higher than using classical methods for RGB images, and (2) The increase of the RGB image resolution has positive impact on the segmentation quality. Two traditional segmentation algorithms (Thresholding and K-means) were compared with deep learning approach (U-Net, SegNet and FCN 8) to verify RGB segmentation quality. Two resolutions of images were taken into consideration: 160x240 and 320x480 pixels. Segmentation quality for each algorithm was estimated based on four parameters: Accuracy, Precision, Recall and Sorensen-Dice ratio (Dice score). In the study the Carvana dataset, containing 5,088 high-resolution images of cars, was applied. The initial set was divided into training, validation and test subsets as 60%, 20%, 20%, respectively. As a result, the best Accuracy, Dice score and Recall for images with resolution 160x240 were obtained for U-Net, achieving 99.37%, 98.56%, and 98.93%, respectively. For the same resolution the highest Precision 98.19% was obtained for FCN-8 architecture. For higher resolution, 320x480, the best mean Accuracy, Dice score, and Precision were obtained for FCN-8 network, reaching 99.55%, 99.95% and 98.85%, respectively. The highest results for classical methods were obtained for Threshold algorithm reaching 80.41% Accuracy, 58.49% Dice score, 67.32% Recall and 52.62% Precision. The results confirm both hypotheses.