Explaining the GWSkyNet-Multi Machine Learning Classifier Predictions for Gravitational-wave Events

Abstract GWSkyNet-Multi is a machine learning model developed for the classification of candidate gravitational-wave events detected by the LIGO and Virgo observatories. The model uses limited information released in the low-latency Open Public Alerts to produce prediction scores indicating whether...

Full description

Saved in:

Bibliographic Details
Published in:	The Astrophysical journal Vol. 963; no. 2; pp. 98 - 117
Main Authors:	Raza, Nayyer, Chan, Man Leong, Haggard, Daryl, Mahabal, Ashish, McIver, Jess, Abbott, Thomas C., Buffaz, Eitan, Vieira, Nicholas
Format:	Journal Article
Language:	English
Published:	Philadelphia The American Astronomical Society 01-03-2024 IOP Publishing
Subjects:	Astronomical maps Black holes Coherence Convolutional neural networks Deep learning Gravitational wave astronomy Gravitational wave sources Gravitational waves Incoherence LIGO (observatory) Machine learning Neural networks Neutron stars Observatories
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Abstract GWSkyNet-Multi is a machine learning model developed for the classification of candidate gravitational-wave events detected by the LIGO and Virgo observatories. The model uses limited information released in the low-latency Open Public Alerts to produce prediction scores indicating whether an event is a merger of two black holes (BHs), a merger involving a neutron star (NS), or a non-astrophysical glitch. This facilitates time-sensitive decisions about whether to perform electromagnetic follow-up of candidate events during LIGO-Virgo-KAGRA (LVK) observing runs. However, it is not well understood how the model is leveraging the limited information available to make its predictions. As a deep learning neural network, the inner workings of the model can be difficult to interpret, impacting our trust in its validity and robustness. We tackle this issue by systematically perturbing the model and its inputs to explain what underlying features and correlations it has learned for distinguishing the sources. We show that the localization area of the 2D sky maps and the computed coherence versus incoherence Bayes factors are used as strong predictors for distinguishing between real events and glitches. The estimated distance to the source is further used to discriminate between binary BH mergers and mergers involving NSs. We leverage these findings to show that events misclassified by GWSkyNet-Multi in LVK’s third observing run have distinct sky areas, coherence factors, and distance values that influence the predictions and explain these misclassifications. The results help identify the model’s limitations and inform potential avenues for further optimization.
Bibliography:	AAS49017 High-Energy Phenomena and Fundamental Physics
ISSN:	0004-637X 1538-4357
DOI:	10.3847/1538-4357/ad13ea