Interpretable Transformed ANOVA Approximation on the Example of the Prevention of Forest Fires

The distribution of data points is a key component in machine learning. In most cases, one uses min-max-normalization to obtain nodes in [0, 1] or Z -score normalization for standard normal distributed data. In this paper, we apply transformation ideas in order to design a complete orthonormal syste...

Full description

Saved in:
Bibliographic Details
Published in:Frontiers in applied mathematics and statistics Vol. 8
Main Authors: Potts, Daniel, Schmischke, Michael
Format: Journal Article
Language:English
Published: Frontiers Media S.A 26-01-2022
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The distribution of data points is a key component in machine learning. In most cases, one uses min-max-normalization to obtain nodes in [0, 1] or Z -score normalization for standard normal distributed data. In this paper, we apply transformation ideas in order to design a complete orthonormal system in the L 2 space of functions with the standard normal distribution as integration weight. Subsequently, we are able to apply the explainable ANOVA approximation for this basis and use Z -score transformed data in the method. We demonstrate the applicability of this procedure on the well-known forest fires dataset from the UCI machine learning repository. The attribute ranking obtained from the ANOVA approximation provides us with crucial information about which variables in the dataset are the most important for the detection of fires.
ISSN:2297-4687
2297-4687
DOI:10.3389/fams.2022.795250