Spatiotemporal Pretrained Large Language Model for Forecasting With Missing Values
Fang, Le, Xiang, Wei, Pan, Shirui, Salim, Flora D., and Chen, Yi Ping Phoebe (2025) Spatiotemporal Pretrained Large Language Model for Forecasting With Missing Values. IEEE Internet of Things Journal, 12 (10). pp. 13838-13850.
|
PDF (Published Version)
- Published Version
Restricted to Repository staff only |
Abstract
Spatiotemporal data collected by sensors within an urban Internet of Things (IoT) system inevitably contains some missing values, which significantly affects the accuracy of spatiotemporal data forecasting. However, existing techniques, including those based on large language models (LLMs), show limited effectiveness in forecasting with missing values, especially in scenarios involving high-dimensional sensor data. In this article, we propose a novel spatiotemporal pretrained LLM dubbed SPLLM for forecasting with missing values. In this network, we seamlessly integrate a specialized spatiotemporal fusion graph convolutional network (GCN) module that extracts intricate spatiotemporal and graph-based information, for generating suitable inputs to the SPLLM. Furthermore, we propose a feed-forward network (FFN) fine-tuning strategy within the LLM and a final fusion layer to enable the model to leverage the pretrained foundational knowledge of the LLM and adapt to new incomplete data simultaneously. The experimental results indicate that SPLLM outperforms state-of-the-art models on real-world public datasets. Notably, SPLLM exhibits a superior performance in tackling incomplete sensory data with a variety of missing rates. A comprehensive ablation study of key components is conducted to demonstrate their efficiency.
| Item ID: | 88637 |
|---|---|
| Item Type: | Article (Research - C1) |
| ISSN: | 2327-4662 |
| Keywords: | Forecasting, graph convolutional network (GCN), large language model (LLM), missing values, spatiotemporal data |
| Copyright Information: | Copyright © 2025, IEEE. |
| Date Deposited: | 05 Jun 2026 01:08 |
| FoR Codes: | 46 INFORMATION AND COMPUTING SCIENCES > 4611 Machine learning > 461103 Deep learning @ 100% |
| SEO Codes: | 22 INFORMATION AND COMMUNICATION SERVICES > 2204 Information systems, technologies and services > 220403 Artificial intelligence @ 100% |
| More Statistics |
