Огляд методів реалізації нейронних обчислень на вбудованій системі

Скірко, П. О.; Редько, І. В.

Огляд методів реалізації нейронних обчислень на вбудованій системі

dc.contributor.author	Скірко, П. О.
dc.contributor.author	Редько, І. В.
dc.date.accessioned	2023-03-28T13:24:45Z
dc.date.available	2023-03-28T13:24:45Z
dc.date.issued	2020
dc.description.abstract	У статті виконаний огляд актуальних обмежень у впровадженні нейронних обчислень на вбудованих системах та шляхи їх подолання. Виконано порівняння класичної архітектури інтернету речей з обчисленнями у хмарі та більш сучасної – з частково перенесеною логікою на крайовий пристрій. Розглянуто які технології можуть бути застосовані для запровадження подібної системи та описано методику, що дозволяє досягти поставленої мети – тобто виконати корисні обчислення на крайовому пристрої у реальному часі.	uk
dc.description.abstractother	Convolutional neural networks have reached a stage of development where they can be applied to tasks such as posture, face, voice and situation recognition with high accuracy, but inference of classic high precision models require a powerful server that creates high requirements for the reliability and bandwidth of the network that connects the server to the Edge device. Traditional architecture of Internet of Things can be represented as follows: information from the outside world is accumulated through sensors, cameras or input by the user; the data obtained is aggregated and subjected to initial processing by the edge device (the edge unit is typically characterized by small dimensions, low cost, and low power consumption); the processed and compressed data is transmitted to the remote servers via the Internet, where it is further processed using more resource-intensive and complex algorithms; the cloud computes results and updates information in the database; the new, more up-to-date information goes down to the edge device, where it is used to change the status of certain indicators, engines, turn on or off the light, etc. At the same time, there are tasks in which delays of the network are unacceptable, its reliability cannot be guaranteed, raw data from the microphone, camera, or other sensors may contain sensitive information that is not desirable to send to the server, the expected number of devices will create such load of network and datacenter that will be too expensive to process. For these cases, technological solutions have been developed that allow performing neural computations on the edge device in real time, that is, recognizing visual images of 25 frames per second. These techniques includes: development of new neural network models capable of solving similar problems using fewer coefficients and simpler activation functions, but with not too high a drop in accuracy; applying specialized hardware, capable of performing calculations with higher efficiency (efficiency refers to the ratio of the number of operations per unit of time to the power consumed); optimization of existing neural network models by reducing the accuracy of coefficients representation, ignoring the coefficients close to zero, etc. The first class includes Google MobileNet V1 and V2 development. They are based on previous models but have fewer layers and neurons in them, which reduced the total number of parameters to 0.5-4.2 million. The second class includes ASIC (Application-specific integrated circuit), FPGA (Field-Programmable Gate Array) and GPU (Graphic Processing Unit). ASICs are specifically designed for a specific neural network model and have the highest performance and energy efficiency available today. The third class includes quantization algorithms and a thorough analysis of the structure of the neural network to identify which parts of it contribute most to the correct result. Such conversion, although loss of information, allows for accuracy close to the original. The combination of all the techniques allows you to achieve the task.	uk
dc.format.pagerange	С. 34-37	uk
dc.identifier.citation	Скірко, П. О. Огляд методів реалізації нейронних обчислень на вбудованій системі / Скірко П. О., Редько І. В. // Електронна та Акустична Інженерія : науково-технічний журнал. – 2020. – Т. 3, № 1. – С. 34-37. – Бібліогр.: 15 назв.	uk
dc.identifier.doi	https://doi.org/10.20535/2617-0965.2020.3.1.198586
dc.identifier.orcid	0000-0002-6709-1053	uk
dc.identifier.orcid	0000-0002-3121-1412	uk
dc.identifier.uri	https://ela.kpi.ua/handle/123456789/54064
dc.language.iso	uk	uk
dc.publisher	КПІ ім. Ігоря Сікорського	uk
dc.publisher.place	Київ	uk
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.source	Електронна та Акустична Інженерія : науково-технічний журнал, 2020, Т. 3, № 1
dc.subject	вбудовані системи	uk
dc.subject	крайовий пристрій	uk
dc.subject	нейромережа	uk
dc.subject	квантизація нейромережі	uk
dc.subject	згорткова нейромережа	uk
dc.subject	інтернет речей	uk
dc.subject	edge device	uk
dc.subject	convolutional network	uk
dc.subject	inference	uk
dc.subject	network quantization	uk
dc.subject	internet of things	uk
dc.subject.udc	004.8	uk
dc.title	Огляд методів реалізації нейронних обчислень на вбудованій системі	uk
dc.type	Article	uk

Файли

Контейнер файлів

Зараз показуємо 1 - 1 з 1

Назва:: EAI2020_3-1_34-37.pdf
Розмір:: 394.59 KB
Формат:: Adobe Portable Document Format
Опис:

Завантажити

Ліцензійна угода

Зараз показуємо 1 - 1 з 1

Назва:: license.txt
Розмір:: 9.1 KB
Формат:: Item-specific license agreed upon to submission
Опис:

Завантажити

Зібрання

Електронна та Акустична Інженерія: науково-технічний журнал, Т. 3, № 1