Факультет електроніки та інформаційних технологій (ЕлІТ)

Permanent URI for this communityhttps://devessuir.sumdu.edu.ua/handle/123456789/20

Browse

Search Results

Now showing 1 - 3 of 3

Model and Training Method of the Resilient Image Classifier Considering Faults, Concept Drift, and Adversarial Attacks
(MDPI, 2022) Москаленко, В`ячеслав Васильович; Москаленко, Вячеслав Васильевич; Moskalenko, Viacheslav Vasylovych; Kharchenko, V.; Москаленко, Альона Сергіївна; Москаленко, Алена Сергеевна; Moskalenko, Alona Serhiivna; Петров, Сергій Олександрович; Петров, Сергей Александрович; Petrov, Serhii Oleksandrovych
Modern trainable image recognition models are vulnerable to different types of perturbations; hence, the development of resilient intelligent algorithms for safety-critical applications remains a relevant concern to reduce the impact of perturbation on model performance. This paper proposes a model and training method for a resilient image classifier capable of efficiently functioning despite various faults, adversarial attacks, and concept drifts. The proposed model has a multi-section structure with a hierarchy of optimized class prototypes and hyperspherical class boundaries, which provides adaptive computation, perturbation absorption, and graceful degradation. The proposed training method entails the application of a complex loss function assembled from its constituent parts in a particular way depending on the result of perturbation detection and the presence of new labeled and unlabeled data. The training method implements principles of self-knowledge distillation, the compactness maximization of class distribution and the interclass gap, the compression of feature representations, and consistency regularization. Consistency regularization makes it possible to utilize both labeled and unlabeled data to obtain a robust model and implement continuous adaptation. Experiments are performed on the publicly available CIFAR-10 and CIFAR-100 datasets using model backbones based on modules ResBlocks from the ResNet50 architecture and Swin transformer blocks. It is experimentally proven that the proposed prototype-based classifier head is characterized by a higher level of robustness and adaptability in comparison with the dense layer-based classifier head. It is also shown that multi-section structure and self-knowledge distillation feature conserve resources when processing simple samples under normal conditions and increase computational costs to improve the reliability of decisions when exposed to perturbations.
Багатоетапний метод глибинного навчання з попереднім самонавчанням для класифікаційного аналізу дефектів стічних труб
(National Aerospace University "Kharkiv Aviation Institute", 2021) Москаленко, Владислав Вікторович; Москаленко, Владислав Викторович; Moskalenko, Vladyslav Viktorovych; Зарецький, Микола Олександрович; Зарецкий, Николай Александрович; Zaretskyi, Mykola Oleksandrovych; Москаленко, Альона Сергіївна; Москаленко, Алена Сергеевна; Moskalenko, Alona Serhiivna; Коробов, Артем Геннадійович; Коробов, Артем Геннадьевич; Korobov, Artem Hennadiiovych; Ковальський, Ярослав Юрійович; Ковальский, Ярослав Юрьевич; Kovalskyi, Yaroslav Yuriiovych
Розроблено багатоетапний метод машинного навчання з попереднім самонавчанням для класифікаційного аналізу дефектів на стінках стічних труб за зображеннями відеоінспекції. Об’єктом дослідження є процес розпізнавання дефектів на стінках стічних труб. Предметом дослідження є метод машинного навчання для класифікаційного аналізу дефетів стічних труб на зображеннях відеоінспекції за умов обмеженого та незбалансованого набору розмічених навчальних даних. Запропоновано п’ятиетапний алгоритм навчання класифікатора. На першому етапі відбувається контрастне навчання з використанням екземпляр-прототипної контрасної функції втрат, де для вимірювання схожості закодованих зразків використовується нормалізована відстань Евкліда. На другому етапі розглядаються два варіанти регуляризованої функції втрат – триплетна функція NCA та контрастноцентрована функція. Регуляризуюча складова на другому етапі навчання використовується для штрафування за помилку округлення вихідного вектора ознак до дискретного виду і забезпечує реалізацію інформаційного пляшкового горла. На наступному етапі здійснюється обчислення двійкового коду кожного класу для реалізації кодів, що виправляють помилки, але з урахуванням структури класів і відношень між їх ознаками. Отриманий еталонний вектор кожного класу є цільовою розміткою зображення для навчання з використанням крос-ентропійної функції втрат. Останній етап навчання здійснює оптимізацією параметрів вирішувальних правил за інформаційним критерієм для врахування дисперсії розподілу класів в двійковому просторі Хеммінга. Для порівняння результатів навчання на різних етапах та в рамках різного підходу використовується мікро-усереднена метрика F1, що обчислюється на тестових даних. Результати, отримані на відкритому наборі даних Sewer-ML, підтверджують придатність запропонованого методу навчання до практичного використання, отримане значення F1- метрики дорівнює 0,977. Було показано, що запропонований метод забезпечує збільшення значення мікро-усередненої метрики F1 на 9 % порівняно з результатами, отриманими традиційним методом навчання.
A model and training method for context classification in cctv sewer inspection video frames
(National University "Zaporizhzhia Polytechnic", 2021) Москаленко, В`ячеслав Васильович; Москаленко, Вячеслав Васильевич; Moskalenko, Viacheslav Vasylovych; Зарецький, Микола Олександрович; Зарецкий, Николай Александрович; Zaretskyi, Mykola Oleksandrovych; Москаленко, Альона Сергіївна; Москаленко, Алена Сергеевна; Moskalenko, Alona Serhiivna; Панич, Андрій Олександрович; Паныч, Андрей Александрович; Panych, Andrii Oleksandrovych; Lysyuk, V.V.
A model and training method for observational context classification in CCTV sewer inspection vide frames was developed and researched. The object of research is the process of detection of temporal-spatial context during CCTV sewer inspections. The subjects of the research are machine learning model and training method for classification analysis of CCTV video sequences under the limited and imbalanced training dataset constraint. Objective. Stated research goal is to develop an efficient context classifier model and training algorithm for CCTV sewer inspection video frames under the constraint of the limited and imbalanced labeled training set. Methods. The four-stage training algorithm of the classifier is proposed. The first stage involves training with soft triplet loss and regularisation component which penalises the network’s binary output code rounding error. The next stage is needed to determine the binary code for each class according to the principles of error-correcting output codes with accounting for intra- and interclass relationship. The resulting reference vector for each class is then used as a sample label for the future training with Joint Binary Cross Entropy Loss. The last machine learning stage is related to decision rule parameter optimization according to the information criteria to determine the boundaries of deviation of binary representation of observations for each class from the corresponding reference vector. A 2D convolutional frame feature extractor combined with the temporal network for inter-frame dependency analysis is considered. Variants with 1D Dilated Regular Convolutional Network, 1D Dilated Causal Convolutional Network, LSTM Network, GRU Network are considered. Model efficiency comparison is made on the basis of micro averaged F1 score calculated on the test dataset. Results. Results obtained on the dataset provided by Ace Pipe Cleaning, Inc confirm the suitability of the model and method for practical use, the resulting accuracy equals 92%. Comparison of the training outcome with the proposed method against the conventional methods indicated a 4% advantage in micro averaged F1 score. Further analysis of the confusion matrix had shown that the most significant increase in accuracy in comparison with the conventional methods is achieved for complex classes which combine both camera orientation and the sewer pipe construction features. Conclusions. The scientific novelty of the work lies in the new models and methods of classification analysis of the temporalspatial context when automating CCTV sewer inspections under imbalanced and limited training dataset conditions. Training results obtained with the proposed method were compared with the results obtained with the conventional method. The proposed method showed 4% advantage in micro averaged F1 score. It had been empirically proven that the use of the regular convolutional temporal network architecture is the most efficient in utilizing inter-frame dependencies. Resulting accuracy is suitable for practical use, as the additional error correction can be made by using the odometer data.

Факультет електроніки та інформаційних технологій (ЕлІТ)

Browse

Filters

Settings

Sort By

Results per page

Search Results