journal article Mar 01, 2025

Cross-Modal Learning for Anomaly Detection in Complex Industrial Process: Methodology and Benchmark

View at Publisher Save 10.1109/tcsvt.2024.3491865
Topics

No keywords indexed for this article. Browse by subject →

References
65
[6]
Deep learning

Yann LeCun, Yoshua Bengio, Geoffrey Hinton

Nature 10.1038/nature14539
[7]
Mastering the game of Go without human knowledge

David Silver, Julian Schrittwieser, Karen Simonyan et al.

Nature 10.1038/nature24270
[11]
Wu "Abnormal condition diagnosis through deep learning of image sequences for fused magnesium furnaces" Acta Autom. Sinica (2019)
[13]
Long Short-Term Memory

Sepp Hochreiter, Jürgen Schmidhuber

Neural Computation 10.1162/neco.1997.9.8.1735
[15]
Achiam "GPT-4 technical report" (2023)
[16]
Dosovitskiy "An image is worth 16 × 16 words: Transformers for image recognition at scale"
[18]
Driess "PaLM-E: An embodied multimodal language model"
[23]
Time Series Anomaly Detection With Adversarial Reconstruction Networks

Shenghua Liu, Bin Zhou, Quan Ding et al.

IEEE Transactions on Knowledge and Data Engineerin... 10.1109/tkde.2021.3140058
[25]
Xu "Anomaly transformer: Time series anomaly detection with association discrepancy"
[30]
Yu "Multi-scale context aggregation by dilated convolutions"
[42]
Yang "Learning vision-guided quadrupedal locomotion end-to-end with cross-modal transformers" arXiv:2107.03996 (2021)
[43]
Brown "Language models are few-shot learners"
[45]
Multimodal Learning With Transformers: A Survey

Peng Xu, Xiatian Zhu, David A. Clifton

IEEE Transactions on Pattern Analysis and Machine... 10.1109/tpami.2023.3275156
[47]
Yang "Zero-shot video question answering via frozen bidirectional language models"
[48]
Radford "Learning transferable visual models from natural language supervision"

Showing 50 of 65 references

Metrics
22
Citations
65
References
Details
Published
Mar 01, 2025
Vol/Issue
35(3)
Pages
2632-2645
License
View
Funding
National Natural Science Foundation of China Award: 61991404
Fundamental Research Funds for the Central Universities Award: N2424004
Research Program of the Liaoning Liaohe Laboratory Award: LLL23ZZ-05-01
Cite This Article
Guo‐Dong Wu, Yapeng Zhang, Lan Deng, et al. (2025). Cross-Modal Learning for Anomaly Detection in Complex Industrial Process: Methodology and Benchmark. IEEE Transactions on Circuits and Systems for Video Technology, 35(3), 2632-2645. https://doi.org/10.1109/tcsvt.2024.3491865
Related

You May Also Like

Overview of the H.264/AVC video coding standard

T. Wiegand, G.J. Sullivan · 2003

4,504 citations

An Introduction to Biometric Recognition

A.K. Jain, A. Ross · 2004

3,241 citations

Reversible data hiding

Zhicheng Ni, Yun-Qing Shi · 2006

2,131 citations