A generalized Lorenz system-based initialization method for deep neural networks
Jia, Bowen, Guo, Zhaoxia, Huang, Tao, Guo, Feng, and Wu, Huyu (2024) A generalized Lorenz system-based initialization method for deep neural networks. Applied Soft Computing, 167 (Part A). 112316.
![]() |
PDF (Published Version)
- Published Version
Restricted to Repository staff only |
Abstract
Deep neural networks (DNNs) are a powerful tool for solving complex problems. The effectiveness of DNNs largely depends on the initialization technique used. This research develops a new initialization method for DNNs that uses chaotic sequences from the generalized Lorenz system to improve their performance. The proposed method, termed Generalized Lorenz Initialization (GLI), has been compared with two established initialization methods (Kaiming and Xavier) across four different DNN architectures: Informer, Neural Basis Expansion Analysis for Interpretable Time Series, Long Short-Term Memory, and NeuRewriter. The performance of these methods has been evaluated on seven time series forecasting datasets and one combinatorial optimization dataset. Results show that the GLI method improved forecasting accuracy by up to 86.47% compared to the Kaiming method and 88.86% compared to the Xavier method across all time series datasets. For the combinatorial optimization task, the GLI method reduced computational time by up to 9.24% with the better solution quality. These indicate the superiority of the GLI method over the two representative initialization methods for different DNN architectures across different problem domains.
Item ID: | 85280 |
---|---|
Item Type: | Article (Research - C1) |
ISSN: | 1872-9681 |
Copyright Information: | © 2024 Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies. |
Date Deposited: | 29 Apr 2025 23:19 |
FoR Codes: | 46 INFORMATION AND COMPUTING SCIENCES > 4611 Machine learning > 461104 Neural networks @ 100% |
SEO Codes: | 22 INFORMATION AND COMMUNICATION SERVICES > 2204 Information systems, technologies and services > 220402 Applied computing @ 100% |
More Statistics |