Оцінювання якості спотворених мовних та музичних сигналів

dc.contributor.authorКотвицький, Ігор Валерійович
dc.date.accessioned2020-08-27T11:48:27Z
dc.date.available2020-08-27T11:48:27Z
dc.date.issued2020
dc.description.abstractenThe thesis is devoted to the objectification and automation of evaluation of the quality of speech and music signals distorted in communication channels. Particular attention is paid to the analysis of the objective measures of the quality of speech and music signals. They are segmental signal-to-noise ratio SSNR, log-spectral distortion LSD, bar-spectral distortion BSD, perceptual assessment of the speech quality PESQ and perceptual evaluation of the music quality PEAQ. These measures are promising for use in real-time signal quality assessment and correction systems. The matching maps between subjective DMOS quality measure and objective SSNR, LSD, BSD and PEAQ measures for situations of linear and non-linear signal distortion has been proposed. At the same time, such types of linear distortions as limiting the frequency band and nonlinearity of the phase frequency response of the communication channel were considered. The necessity of taking into account the increased sensitivity of the simple measure in the computational plan of an objective measure of the quality of signals in the form of log-spectral distortions to the spectral composition of the acoustic signals being analyzed is pointed out. The received recommendations for such high sensitivity make it possible to increase the reliability of conclusions when evaluating the quality of distorted speech and music signals. It further developed the provision on the fundamental possibility of increasing the reliability of objective estimation of the quality of distorted acoustic signals using such measures as the segmental signal-to-noise ratio, and it is shown that such a disadvantage as the sensitivity of the segmental signal-to-noise ratio to the error of signal alignment can be counteracted by increasing the sampling rate by interpolating at least 5 times for music signals, while for speech signals it is sufficient to increase the sampling rate by 2 times. To investigate the effects of phase distortions on the quality of speech and music signals, a computer model of a system with nonlinear frequency-phase response was used which contains a seven-band non-recursive octave filter, delay lines, and summation block. The group delay time was used as a measure of the nonlinearity of the frequency-phase response. Two types of group delay time has been considered: the first type is described by the decreasing dependence of the group delay time on the frequency, and the second, on the contrary, is described by the increasing dependence of the group delay time on the frequency. It was shown that the human auditory system does not perceive phase distortion, provided that the difference in the group delay times at high and low frequencies does not exceed 40 ms for speech signals and 80 ms for music signals. In addition, it was found that the human auditory system is sensitive to the sign of the difference in group delay times at high and low frequencies. The dependences of subjective quality indicators on clipping of speech and musical signals on the value of their clipping provide the necessary basis for further calibration of objective measures of the clipping degree. At the same time, objective measures of the clipping value of speech and musical signals has been proposed. These measures are kurtosis, inverse to kurtosis, and the square root of inverse to kurtosis. The results make it possible to calibrate the corresponding software and hardware systems for objective evaluation of the quality of distorted speech and music signals. The limits of the use of objective speech quality assessments instead of subjective noisy speech intelligibility have been clarified, which can greatly simplify the procedure for objectively assessing speech intelligibility. The technology of automation of subjective assessment of speech intelligibility by the articulation method has been improved and its efficiency was tested by means of the developed prototype of the automated system of articulation testing. The results of comparison, by objective and subjective assessment of sound quality, of signals processing algorithms in the multi-microphone arrays are refined. Comparison of two algorithms of space-time processing based on the criterion of the quality of the music signal distorted by additive interference allows us to recommend a simpler, from the point of view of calculations, algorithm of the sum and delays instead of the more complicated competitive algorithm. The analysis of the causes of the poor signal quality for algorithms providing frequency-independent diagrams of the increased directivity is analyzed, and it is stated that such deterioration is caused by a significant error of calculations in the implementation of these algorithms, and this error increases with decreasing the length of the signal segments and decreasing the microphone’s frequency band. Thus, the results obtained indicate that microphone arrays, although they can be used to improve the quality of acoustic signals distorted by noise and reverberation, however, should be taken into account the increased sensitivity of the respective algorithms for signal processing to calculation errors. The results obtained by the author can be used in the acoustic examination of rooms and communication lines, as well as in the educational process of colleges.uk
dc.description.abstractruДиссертация посвящена объективизации и автоматизации оценивания качества речевых и музыкальных сигналов, искаженных в коммуникационных каналах. Построены карты соответствия субъективной меры качества DMOS и объективных мер качества акустических сигналов. Впервые предложены объективные меры степени клипирования акустических сигналов. Усовершенствована технология автоматизации субъективного оценивания разборчивости речи. Дальнейшее развитие получило сопоставление, путем объективного и субъективного оценивания качества звука, алгоритмов обработки музыкальных сигналов в многомикрофонных массивах.uk
dc.description.abstractukДисертаційна робота присвячена об’єктивізації та автоматизації оцінювання якості мовних та музичних сигналів, спотворених в комунікаційних каналах. Побудовано карти відповідності між суб’єктивною мірою якості DMOS та об’єктивними мірами якості акустичних сигналів. Вперше запропоновано об’єктивні міри ступеня кліпування мовних та музичних сигналів. Вдосконалено технологію автоматизації суб’єктивного оцінювання розбірливості мови. Знайшло подальший розвиток зіставлення, шляхом об’єктивного та суб’єктивного оцінювання якості звуку, алгоритмів обробки музичних сигналів в багатомікрофонних масивах.uk
dc.format.pagerange24 с.uk
dc.identifier.citationКотвицький, І. В. Оцінювання якості спотворених мовних та музичних сигналів : автореф. дис. … канд. техн. наук. : 05.09.08 – прикладна акустика та звукотехніка / Котвицький Ігор Валерійович. – Київ, 2020. – 24 с.uk
dc.identifier.urihttps://ela.kpi.ua/handle/123456789/35852
dc.language.isoukuk
dc.publisherКПІ ім. Ігоря Сікорськогоuk
dc.publisher.placeКиївuk
dc.subjectкарта відповідностіuk
dc.subjectкліпований сигналuk
dc.subjectмасив мікрофонівuk
dc.subjectміра якостіuk
dc.subjectревербераціяuk
dc.subjectрозбірливість мовиuk
dc.subjectсуб'єктивна оцінкаuk
dc.subjectфазові спотворенняuk
dc.subjectшумuk
dc.subjectякість сигналуuk
dc.subjectclipped signaluk
dc.subjectmatching mapuk
dc.subjectmicrophone arrayuk
dc.subjectnoiseuk
dc.subjectphase distortionuk
dc.subjectquality measureuk
dc.subjectreverberationuk
dc.subjectsignal qualityuk
dc.subjectspeech intelligibilityuk
dc.subjectsubjective evaluationuk
dc.subjectкарта соответствияuk
dc.subjectкачество сигналаuk
dc.subjectклипированный сигналuk
dc.subjectмассив микрофоновuk
dc.subjectмера качестваuk
dc.subjectразборчивость речиuk
dc.subjectреверберацияuk
dc.subjectсубъективная оценкаuk
dc.subjectфазовые искаженияuk
dc.subject.udc621.391.83uk
dc.titleОцінювання якості спотворених мовних та музичних сигналівuk
dc.typeThesisuk

Файли

Контейнер файлів
Зараз показуємо 1 - 1 з 1
Вантажиться...
Ескіз
Назва:
Kotvytskyi_aref.pdf
Розмір:
662.98 KB
Формат:
Adobe Portable Document Format
Опис:
Ліцензійна угода
Зараз показуємо 1 - 1 з 1
Ескіз недоступний
Назва:
license.txt
Розмір:
9.06 KB
Формат:
Item-specific license agreed upon to submission
Опис: