Перегляд за Автор "Yavorskyi, Oleksandr"
Зараз показуємо 1 - 2 з 2
Результатів на сторінці
Налаштування сортування
Документ Відкритий доступ Comparative analysis of classification techniques for topic-based biomedical literature categorisation(Frontiers Media S.A., 2023) Stepanov, Ihor; Ivasiuk, Arsentii; Yavorskyi, Oleksandr; Frolova, AlinaIntroduction: Scientific articles serve as vital sources of biomedical information, but with the yearly growth in publication volume, processing such vast amounts of information has become increasingly challenging. This difficulty is particularly pronounced when it requires the expertise of highly qualified professionals. Our research focused on the domain-specific articles classification to determine whether they contain information about drug-induced liver injury (DILI). DILI is a clinically significant condition and one of the reasons for drug registration failures. The rapid and accurate identification of drugs that may cause such conditions can prevent side effects in millions of patients. Methods: Developing a text classification method can help regulators, such as the FDA, much faster at a massive scale identify facts of potential DILI of concrete drugs. In our study, we compared several text classification methodologies, including transformers, LSTMs, information theory, and statistics-based methods. We devised a simple and interpretable text classification method that is as fast as Naïve Bayes while delivering superior performance for topic-oriented text categorisation. Moreover, we revisited techniques and methodologies to handle the imbalance of the data. Results: Transformers achieve the best results in cases if the distribution of classes and semantics of test data matches the training set. But in cases of imbalanced data, simple statistical-information theory-based models can surpass complex transformers, bringing more interpretable results that are so important for the biomedical domain. As our results show, neural networks can achieve better results if they are pre-trained on domain-specific data, and the loss function was designed to reflect the class distribution. Discussion: Overall, transformers are powerful architecture, however, in certain cases, such as topic classification, its usage can be redundant and simple statistical approaches can achieve compatible results while being much faster and explainable. However, we see potential in combining results from both worlds. Development of new neural network architectures, loss functions and training procedures that bring stability to unbalanced data is a promising topic of development.Документ Відкритий доступ Persistent Homology in Machine Learning: Applied Sciences Review(2023) Yavorskyi, Oleksandr; Asseko-Nkili, Andrii; Kussul, NataliiaTopological Data Analysis (‘TDA’) has become a vibrant and quickly developing field in recent years, providing topology-enhanced data processing and Machine Learning (‘ML’) applications. Due to the novelty of the field, as well as the dissimilarity between the mathematics behind the classical ML and TDA, it might be complicated for a field newcomer to assess the feasibility of the approaches proposed by TDA and the relevancy of the possible applications. The current paper aims to provide an overview of the recent developments that relate to persistent homology, a part of the mathematical machinery behind the TDA, with a particular focus on applied sciences. We consider multiple areas, such as physics, healthcare, material sciences, and others, examining the recent developments in the field. The resulting summary of this paper could be used by field experts to expand their knowledge on recent persistent homology applications, while field newcomers could assess the applicability of this TDA approach for their research. We also point out some of the current restrictions on the use of persistent homology, as well as potential development trajectories that might be useful to the whole field.