Predicting rice phenotypes with meta and multi-target learning

The features in some machine learning datasets can naturally be divided into groups. This is the case with genomic data, where features can be grouped by chromosome. In many applications it is common for these groupings to be ignored, as interactions may exist between features belonging to different...

Full description

Saved in:
Bibliographic Details
Published in:Machine learning Vol. 109; no. 11; pp. 2195 - 2212
Main Authors: Orhobor, Oghenejokpeme I., Alexandrov, Nickolai N., King, Ross D.
Format: Journal Article
Language:English
Published: New York Springer US 01-11-2020
Springer Nature B.V
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The features in some machine learning datasets can naturally be divided into groups. This is the case with genomic data, where features can be grouped by chromosome. In many applications it is common for these groupings to be ignored, as interactions may exist between features belonging to different groups. However, including a group that does not influence a response introduces noise when fitting a model, leading to suboptimal predictive accuracy. Here we present two general frameworks for the generation and combination of meta-features when feature groupings are present. Furthermore, we make comparisons to multi-target learning, given that one is typically interested in predicting multiple phenotypes. We evaluated the frameworks and multi-target learning approaches on a genomic rice dataset where the regression task is to predict plant phenotype. Our results demonstrate that there are use cases for both the meta and multi-target approaches, given that overall, they significantly outperform the base case.
ISSN:0885-6125
1573-0565
1573-0565
DOI:10.1007/s10994-020-05881-9