Speedup of deep neural network learning on the MIC-architecture

Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › научная

Ссылки

http://ieeexplore.ieee.org/document/7568443/

DOI

https://doi.org/10.1109/HPCSim.2016.7568443
Другие версии

Deep neural networks are more accurate, but require more computational power in the learning process. Moreover, it is an iterative process. The goal of the research is to investigate efficiency of solving this problem on MIC architecture without changing baseline algorithm. Well-known code vectorization and parallelization methods are used to increase the effectiveness of the program on MIC architecture. In the course of the experiments we test two coprocessor data transfer models: explicit and implicit one. We show that implicit memory copying is more efficient than explicit one, because only modified memory blocks are copied. MIC architecture shows competitive performance compared to multi-core ×86 processor.

Язык оригинала	английский
Название основной публикации	International Conference on High Performance Computing Simulation (HPCS'16)
Издатель	Institute of Electrical and Electronics Engineers Inc.
Страницы	989-992
ISBN (печатное издание)	978-1-5090-2088-1
DOI	https://doi.org/10.1109/HPCSim.2016.7568443
Состояние	Опубликовано - 2016

ID: 7632736

Pure – это продукт компании Elsevier
На данном информационном ресурсе могут быть опубликованы архивные материалы
с упоминанием физических и юридических лиц, включенных Министерством юстиции
Российской Федерации в реестр иностранных агентов

Вход в Pure