https://doi.org/10.1007/s41781-025-00147-2
Research
Anomaly Detection for Automated Data Quality Monitoring in the CMS Detector
1
Baylor University,
Waco, USA
2
Boston University, Boston
, USA
3
INFN Sezione di Torino, Turin
, Italy
4
Massachusetts Institute of Technology,
Cambridge, USA
5
Northeastern University,
Boston, USA
6
Panjab University,
Chandigarh, India
7
Rice University,
Houston, USA
8
RWTH Aachen University III. Physikalisches Institut A,
Aachen, Germany
9
University of Bristol,
Bristol, England
10
University of California Santa Barbara,
Santa Barbara, USA
11
University of Florida,
Gainesville, USA
a
This email address is being protected from spambots. You need JavaScript enabled to view it.
Received:
11
April
2025
Accepted:
30
September
2025
Published online:
9
February
2026
Abstract
Successful operation of large particle detectors like the Compact Muon Solenoid (CMS) at the CERN Large Hadron Collider requires rapid, in-depth assessment of data quality. We introduce the “AutoDQM” system for Automated Data Quality Monitoring using advanced statistical techniques and unsupervised machine learning. Anomaly detection algorithms based on the beta-binomial probability function and principal component analysis are tested on the full set of proton-proton collision data collected by CMS in 2022. AutoDQM identifies anomalous “bad” data affected by significant detector malfunction at a rate 4 – 6 times higher than “good” data, demonstrating its effectiveness as a general data quality monitoring tool.
Key words: Particle Physics / Anomoly Detection / PCA / Data Quality Monitoring
Supplementary Information The online version contains supplementary material available at https://doi.org/10.1007/s41781-025-00147-2.
© The Author(s) 2026
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

