An Efficient Neural Network Model by Weight Roll Algorithm
Siddhartha Dhar Choudhury¹, Kunal Mehrotra², Christhu Raj³, Rajeev Sukumaran⁴
¹Siddhartha dhar Choudry, Computer Science and Engineering, SRM Institute of Science and Technology, Chennai, India.
²Kunal Mehrota, Christhu Raj, Computer Science and Engineering, SRM Institute of Science and Technology, Chennai, India.
³Rajeev Sukumaran, Teaching Learning Centre, Indian Institute of Technology Madras, Chennai, India.
Manuscript received on November 15, 2019. | Revised Manuscript received on November 23, 2019. | Manuscript published on November 30, 2019. | PP: 729-732 | Volume-8 Issue-4, November 2019. | Retrieval Number: D7016118419/2019©BEIESP | DOI: 10.35940/ijrte.D7016.118419
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: Deploying deep learning models require extraction of the model weights from the training environment and saving them to files that can be shipped to production. Often complex models have large model file size and it is difficult to transport those models, this paper aims to reduce the size of the model file while transferring the trained weights to production environ-ment. Weight rolls is an algorithm that rolls down (reduces) the trained model weights to a smaller size, in some cases even re-duced by a proportion of one thousand (1,000). On the produc-tion environment this is again unrolled to regain the original weights that were learned by the neural network during its train-ing phase. Weight rolls uses a compressed pictorial representa-tion of the weights array along with a pix-to-weight neural net-work to transport the learned weights which can be used on the other end for the unrolling process. The pix-to-weight network maps the pixels of the compressed weight image to the original floating point values which in the unrolling phase is used to transform the pixels into corresponding floating point values of trained weights.
Keywords: Loss Function, Regression, Neural Network, Training Time.
Scope of the Article: Algorithm Engineering.

Download PDF

JOURNAL

REQUIREMENTS

PRODUCT

CONTACT US