Authors |
Andre, J-M ; Behrens, U ; Branson, J ; Chaze, O ; Cittolin, S ; Contescu, C ; Darlea, G-L ; Deldicque, C ; Demiragli, Z ; Dobson, M ; Doualot, N ; Erhan, S ; Fulcher, J. R ; Gigi, D ; Gladki, M ; Glege, F ; Gomez-Ceballos, G ; Hegeman, J ; Holzner, A ; Janulis, Mindaugas ; Lettrich, M ; Meijers, F ; Meschi, E ; Mommsen, R. K ; Morovic, S ; O'Dell, V ; Orn, S. J ; Orsini, L ; Papakrivopoulos, I ; Paus, C ; Petrova, P ; Petrucci, A ; Pieri, M ; Rabady, D ; Racz, A ; Reis, T ; Sakulin, H ; Schwick, C ; Šimelevičius, Dainius ; Vougioukas, M ; Zejdl, P |
Abstract [eng] |
The efficiency of the Data Acquisition (DAQ) of the Compact Muon Solenoid (CMS) experiment for LHC Run 2 is constantly being improved. A significant factor affecting the data taking efficiency is the experience of the DAQ operator. One of the main responsibilities of the DAQ operator is to carry out the proper recovery procedure in case of failure of data-taking. At the start of Run 2, understanding the problem and finding the right remedy could take a considerable amount of time (up to many minutes). Operators heavily relied on the support of on-call experts, also outside working hours. Wrong decisions due to time pressure sometimes lead to an additional overhead in recovery time. To increase the efficiency of CMS data-taking we developed a new expert system, the DAQExpert, which provides shifters with optimal recovery suggestions instantly when a failure occurs. DAQExpert is a web application analyzing frequently updating monitoring data from all DAQ components and identifying problems based on expert knowledge expressed in small, independent logic-modules written in Java. Its results are presented in real-time in the control room via a web-based GUI and a sound-system in a form of short description of the current failure, and steps to recover. |