Explaining the GWSkyNet-Multi Machine Learning Classifier Predictions for Gravitational-wave Events
Abstract GWSkyNet-Multi is a machine learning model developed for the classification of candidate gravitational-wave events detected by the LIGO and Virgo observatories. The model uses limited information released in the low-latency Open Public Alerts to produce prediction scores indicating whether...
Saved in:
Published in: | The Astrophysical journal Vol. 963; no. 2; pp. 98 - 117 |
---|---|
Main Authors: | , , , , , , , |
Format: | Journal Article |
Language: | English |
Published: |
Philadelphia
The American Astronomical Society
01-03-2024
IOP Publishing |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Abstract
GWSkyNet-Multi
is a machine learning model developed for the classification of candidate gravitational-wave events detected by the LIGO and Virgo observatories. The model uses limited information released in the low-latency Open Public Alerts to produce prediction scores indicating whether an event is a merger of two black holes (BHs), a merger involving a neutron star (NS), or a non-astrophysical glitch. This facilitates time-sensitive decisions about whether to perform electromagnetic follow-up of candidate events during LIGO-Virgo-KAGRA (LVK) observing runs. However, it is not well understood how the model is leveraging the limited information available to make its predictions. As a deep learning neural network, the inner workings of the model can be difficult to interpret, impacting our trust in its validity and robustness. We tackle this issue by systematically perturbing the model and its inputs to explain what underlying features and correlations it has learned for distinguishing the sources. We show that the localization area of the 2D sky maps and the computed coherence versus incoherence Bayes factors are used as strong predictors for distinguishing between real events and glitches. The estimated distance to the source is further used to discriminate between binary BH mergers and mergers involving NSs. We leverage these findings to show that events misclassified by
GWSkyNet-Multi
in LVK’s third observing run have distinct sky areas, coherence factors, and distance values that influence the predictions and explain these misclassifications. The results help identify the model’s limitations and inform potential avenues for further optimization. |
---|---|
Bibliography: | AAS49017 High-Energy Phenomena and Fundamental Physics |
ISSN: | 0004-637X 1538-4357 |
DOI: | 10.3847/1538-4357/ad13ea |