TY - JOUR
T1 - Cleaning volcano-seismic event catalogues
T2 - a machine learning application for robust systems and potential crises in volcano observatories
AU - Anzieta, Juan
AU - Pacheco, Daniel
AU - Williams-Jones, Glyn
AU - Ruiz, Mario C.
N1 - Publisher Copyright:
© 2023, International Association of Volcanology & Chemistry of the Earth's Interior.
PY - 2023/10
Y1 - 2023/10
N2 - Complete and precise volcano-seismic event catalogues are important not only for the statistical value that they possess for describing past volcanic activity, but also because they constitute the input for automated systems that help monitor volcanic activity in real time. Computer systems are valuable assets in the task of volcano-seismic event classification because in theory they can have improved performance compared to humans due to speed, consistency, and unbiasedness. However, such systems are trained with data from previously created catalogues of events, and as such, if catalogues have noise, the systems will learn incorrectly. In this work, we propose the implementation of a methodology that is relatively easy and fast to apply for the identification of potentially mislabeled events in a seismic event catalogue. We compare the results of applying the procedure to two open catalogues from Cotopaxi and Llaima volcanoes. The first catalogue is believed to have an unknown but potentially significant level of noise, while the other is assumed to be clean. We further validate our results for one of the datasets with volcano observatory scientists in a blind-review fashion to demonstrate some of the hypotheses that can arise in a catalogue with a presumably important level of noise. We conclude that the methodology is valid for identifying potentially mislabeled seismic events and can help in assessing the quality of a given catalogue.
AB - Complete and precise volcano-seismic event catalogues are important not only for the statistical value that they possess for describing past volcanic activity, but also because they constitute the input for automated systems that help monitor volcanic activity in real time. Computer systems are valuable assets in the task of volcano-seismic event classification because in theory they can have improved performance compared to humans due to speed, consistency, and unbiasedness. However, such systems are trained with data from previously created catalogues of events, and as such, if catalogues have noise, the systems will learn incorrectly. In this work, we propose the implementation of a methodology that is relatively easy and fast to apply for the identification of potentially mislabeled events in a seismic event catalogue. We compare the results of applying the procedure to two open catalogues from Cotopaxi and Llaima volcanoes. The first catalogue is believed to have an unknown but potentially significant level of noise, while the other is assumed to be clean. We further validate our results for one of the datasets with volcano observatory scientists in a blind-review fashion to demonstrate some of the hypotheses that can arise in a catalogue with a presumably important level of noise. We conclude that the methodology is valid for identifying potentially mislabeled seismic events and can help in assessing the quality of a given catalogue.
KW - Machine learning
KW - Volcanic crises
KW - Volcano catalogues
KW - Volcano observatories
KW - Volcano-seismic events
UR - http://www.scopus.com/inward/record.url?scp=85171897268&partnerID=8YFLogxK
U2 - 10.1007/s00445-023-01674-9
DO - 10.1007/s00445-023-01674-9
M3 - Article
AN - SCOPUS:85171897268
SN - 0258-8900
VL - 85
JO - Bulletin of Volcanology
JF - Bulletin of Volcanology
IS - 10
M1 - 59
ER -