[1]杨思栋,王 珂,刘 兵,等.基于IFS-LCT-ViT的时间序列分类方法[J].南京师大学报(自然科学版),2025,48(02):91-101.[doi:10.3969/j.issn.1001-4616.2025.02.010]
 Yang Sidong,Wang Ke,Liu Bing,et al.Time Series Classification Method Based on IFS-LCT-ViT[J].Journal of Nanjing Normal University(Natural Science Edition),2025,48(02):91-101.[doi:10.3969/j.issn.1001-4616.2025.02.010]
点击复制

基于IFS-LCT-ViT的时间序列分类方法()

《南京师大学报(自然科学版)》[ISSN:1001-4616/CN:32-1239/N]

卷:
48
期数:
2025年02期
页码:
91-101
栏目:
计算机科学与技术
出版日期:
2025-04-15

文章信息/Info

Title:
Time Series Classification Method Based on IFS-LCT-ViT
文章编号:
1001-4616(2025)02-0091-11
作者:
杨思栋1王 珂1刘 兵1苏 冰2
(1.中国矿业大学计算机科学与技术学院,江苏 徐州 221000)
(2.河北广电网络集团秦皇岛有限公司,河北 秦皇岛 066000)
Author(s):
Yang Sidong1Wang Ke1Liu Bing1Su Bing2
(1.School of Computer Science and Technology,China University of Mining and Technology,Xuzhou 221000,China)
(2.Hebei Television Broadcasts and Information Network Group Qinhuangdao Co,Ltd,Qinhuangdao 066000,China)
关键词:
时间序列分类图像视角不平衡因子视觉自注意力网络轻量卷积令牌
Keywords:
time series classificationimage perspectiveimbalance factor subtractionvision transformerlightweight convolution token
分类号:
TP391
DOI:
10.3969/j.issn.1001-4616.2025.02.010
文献标志码:
A
摘要:
目前针对时间序列分类问题,大多采用一维视角进行分析. 二维视角下的时间序列具有更高量级的数据,但相关的研究较少且基本为格拉姆角场法(Gramian Angular Field,GAF)和卷积神经网络模型的组合. 文中将对图像视角下的时间序列分类进行深入研究,对目前方法存在的相关问题进行优化. 首先解决GAF算法的计算冗余问题,提出不平衡因子法(imbalance factor subtraction,IFS),以基础运算替换GAF的三角运算,在不损失分类精度的情况下,减少了图像生成过程的运算. 其次针对卷积类模型存在局部偏好的问题,文中将图像识别的任务交给视觉全自注意力网络(Vision Transformer,ViT),通过对时序转换图分割,再对分割后的子块以全局并行计算的方式分配注意力权重,得到图像的整体特征. 最后,提出适配ViT模型的轻量卷积令牌(lightweight convolutional token,LCT),通过一维卷积提取原始序列的局部特征,来弥补ViT模型对图像简单硬分割所带来的信息损失. 结合以上所有提出了IFS-LCT-ViT模型,为了验证模型的有效性,在UCR官网中的11个数据集上进行了实验. 结果表明,该模型与GRU-FCN、TST、GAF-CNN、XCM、OSCNN、MultiRocket相比,在6个数据集上获得了85.9%、80.2%、68.2%、63.0、85.3%和84.0%的最高准确率,证明了该模型在时间序列分类任务上的有效性.
Abstract:
Currently,most of time series classification problems are analyzed from a one-dimensional perspective. Time series in a two-dimensional perspective have higher levels of data,which brings more space for research exploration,but there are fewer related studies. And the studies are basically a combination of Gramian Angular Field(GAF)and convolutional neural network models. In this paper,time series classification from the image perspective will be studied in depth. The related problems existing in current method will be optimized. Firstly,the Imbalance Factor Subtraction(IFS)method is proposed to solve the computational redundancy problem of the GAF algorithm. It replaces the trigonometric operation of GAF with the basic operation,which reduces the operation of the image generation process without losing the classification accuracy. Secondly,aiming at the problem of local preference in convolutional class models,the task of image recognition is given to the Vision Transformer(ViT). The overall features of the image are obtained by segmenting the temporal transition graph and then assigning the attention weight in the same way to all segmented subblocks. Finally,a lightweight convolutional token(LCT)adapted to ViT is proposed to extract the local features of the original sequence through one-dimensional convolution to compensate for the information loss caused by the simple hard segmentation of the image by ViT. Combining all of the above,IFS-LCT-ViT is proposed,and to verify the validity of the model,experiments were carried out on 11 datasets on the UCR website. Experimental results show that compared with GRU-FCN,TST,GADF-CNN,XCM,OSCNN and MultiRocket,the model obtains the highest accuracy of 85.9%,80.2%,68.2%,63.0,85.3% and 84.0% on six datasets,which proves the effectiveness of the model in time series classification tasks.

参考文献/References:

[1]PETERSON L E. K-nearest neighbor[J]. Scholarpedia,2009,4(2):1883.
[2]ABANDA A,MORI U,LOZANO J A. A review on distance based time series classification[J]. Data mining and knowledge discovery,2019,33(2):378-412.
[3]MÜLLER M. Dynamic time warping[M]. Information Retrieval for Music and Motion. Berlin,Heidelberg:Springer,2007:69-84.
[4]ELSAYED N,MAIDA A S,BAYOUMI M. Deep gated recurrent and convolutional network hybrid model for univariate time series classification[J]. [2018-12-18]. https://arxiv.org/pdf/1812.07683.pdf.
[5]ZERVEAS G,JAYARAMAN S,PATEL D,et al. A transformer-based framework for multivariate time series representation learning[C]//Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. Online. 2021:2114-2124.
[6]FAUVEL K,LIN T,MASSON V,et al. Xcm:An explainable convolutional neural network for multivariate time series classification[J]. Mathematics,2021,9(23):3137.
[7]TANG W,LONG G,LIU L,et al. Omni-Scale CNNs:a simple and effective kernel size configuration for time series classification[C]//ICLR 2022-10th International Conference on Learning Representations. Online. 2022.
[8]TAN C W,DEMPSTER A,BERGMEIR C,et al. Multirocket:multiple poding operators and transformations for fast and effective time series classification[J]. Data mining and knowledge discovery,2022,36(5):1623-1646.
[9]DEMPSTER A,PETITJEAN F,WEBB G I. ROCKET:exceptionally fast and accurate time series classification using random convolutional kernels[J]. Data mining and knowledge discovery,2020,34(5):1454-1495.
[10]WANG Z,OATES T. Imaging time-series to improve classification and imputation[C]//Proceedings of the 24th International Conference on Artificial Intelligence. Bunenos Aires:Argentina. 2015:3939-3945.
[11]庞新宇,仝钰,魏子涵. 一种GAF-CNN行星齿轮箱故障诊断方法[J]. 北京理工大学学报,2020,40(11):1161-1167.
[12]姜家国,郭曼利,杨思国. 基于GAF和DenseNet的滚动轴承故障诊断方法[J]. 工矿自动化,2021,47(08):84-89.
[13]DOSOVITSKIY A,BEYER L,KOLESNIKOV A,et al. An image is worth 16x16 words:Transformers for image recognition at scale[J]. [2020-10-22]. https://arxiv.org/pdf/2010.11929.pdf.
[14]YUAN L,CHEN Y,WANG T,et al. Tokens-to-token vit:Training vision transformers from scratch on imagenet[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. Online. 2021:558-567.

备注/Memo

备注/Memo:
收稿日期:2023-05-03.
基金项目:国家自然科学基金项目(62276266).
通讯作者:王珂,博士,副教授,研究方向:虚拟现实矿山技术、无线传感器网络. E-mail:wangke@cumt.edu.cn
更新日期/Last Update: 2025-04-15