A multifactor model for detecting propaganda in textual data

Gavrilenko, Olena; Feshchenko, Kyryl

A multifactor model for detecting propaganda in textual data

dc.contributor.author	Gavrilenko, Olena
dc.contributor.author	Feshchenko, Kyryl
dc.date.accessioned	2026-02-09T10:13:49Z
dc.date.available	2026-02-09T10:13:49Z
dc.date.issued	2025
dc.description.abstract	Detecting elements of propaganda in large volumes of textual data is currently one of the key tools in combating the information warfare taking place worldwide. This paper presents a multifactor model for determining the level of propaganda in a publication. The analyzed publications included text-based news articles and social media posts, which were processed using both quantitative and semantic text analysis methods. The model was constructed using the method of linear convolution, which enables the integration of multiple heterogeneous indicators into a unified value reflecting the degree of propaganda. The proposed model considers thirteen indicators, each of which, when exhibiting a high value, signals the potential presence of propaganda within a text. The indicators encompass lexical, syntactic, and semantic characteristics such as emotional tone, subjective evaluation, presence of manipulative triggers, and calls to action. The value of each indicator was calculated using methods of statistical analysis, intelligent data analysis, and machine learning. An algorithm for determining the influence level of each factor was proposed, as well as a scale for assessing the overall level of propaganda. For every analyzed publication, a utility function value was computed to quantify its propaganda intensity. The threshold value of this utility function – beyond which a publication is considered propagandistic – was defined as the sample mean across the dataset. This approach allows for an objective classification of textual materials without the need for expert labeling. The advantage of the developed method lies in the fact that each indicator is derived exclusively from empirical statistical data and validated computational procedures, ensuring the elimination of human subjectivity. The study demonstrates that the modified multifactor model can serve as a universal analytical tool for detecting propaganda in various types of textual data, thereby enhancing the transparency and reliability of media content analysis.
dc.description.abstractother	Виявлення елементів пропаганди в масивах текстових даних наразі є одним із основних засобів боротьби в інформаційній війні, яка відбувається в світі. В даній роботі представлено багатофакторну модель для визначення рівня пропаганди в публікації. В якості публікацій використовувалися текстові новини та дописи в соціальних мережах. Модель була створена на основі методу лінійної згортки. В даній моделі було розглянуто тринадцять показників, високий рівень кожного з яких вказує на наявність пропаганди в публікації. Значення кожного показника обчислювалося за допомогою методів статистичного та інтелектуального аналізу. Запропоновано алгоритм обчислення рівня впливу кожного фактору. Також запропонована шкала, для визначення рівня пропаганди. Для кожної з відібраних публікацій було обчислено відповідне значення функції цінності. Допустимим рівнем для функції цінності, після якого публікацію можна вважати пропагандистською, вважалося вибіркове середнє її значень для всього набору публікацій. В результаті було сформовано рекомендації щодо того, чи є кожна публікація з даного набору пропагандистською чи ні. Перевагою даного підходу є те, що кожен показник в моделі обчислюється ґрунтуючись виключно на статистичних даних та коректних математичних методах, методах інтелектуального аналізу даних та машинного навчання. Це унеможливлює вплив людини, який може бути суб’єктивним, у даний процес.
dc.format.pagerange	P. 160-179
dc.identifier.citation	Gavrilenko, O. A multifactor model for detecting propaganda in textual data / Olena Gavrilenko, Kyryl Feshchenko // Information, Computing and Intelligent systems. – 2025. – No. 7. – P. 160-179. – Bibliogr.: 21 ref.
dc.identifier.doi	https://doi.org/10.20535/2786-8729.7.2025.342630
dc.identifier.orcid	0000-0003-0413-6274
dc.identifier.orcid	0009-0002-8142-179X
dc.identifier.uri	https://ela.kpi.ua/handle/123456789/78693
dc.language.iso	en
dc.publisher	National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"
dc.publisher.place	Kyiv
dc.relation.ispartof	Information, Computing and Intelligent systems, No. 7, 2025
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	information technology
dc.subject	propaganda
dc.subject	publication
dc.subject	multifactor model
dc.subject	statistical analysis
dc.subject	data mining
dc.subject	machine learning
dc.subject	text mining
dc.subject	recommendations
dc.subject	iнформацiйнi технологiї
dc.subject	пропаганда
dc.subject	публiкацiя
dc.subject	багатофакторна модель
dc.subject	статистичний аналiз
dc.subject	аналiз даних
dc.subject	машинне навчання
dc.subject	аналiз текстiв
dc.subject	рекомендацiї
dc.subject.udc	519.688; 004.89; 004.9
dc.title	A multifactor model for detecting propaganda in textual data
dc.title.alternative	Багатофакторна модель виявлення пропаганди в текстових даних
dc.type	Article

Файли

Контейнер файлів

Зараз показуємо 1 - 1 з 1

Назва:: 160-179.pdf
Розмір:: 491.01 KB
Формат:: Adobe Portable Document Format

Завантажити

Ліцензійна угода

Зараз показуємо 1 - 1 з 1

Назва:: license.txt
Розмір:: 8.98 KB
Формат:: Item-specific license agreed upon to submission
Опис:

Завантажити

Зібрання

Information, Computing and Intelligent systems, No. 7