Search Results - "Advances in data analysis and classification"
-
1
A computationally fast variable importance test for random forests for high-dimensional data
Published in Advances in data analysis and classification (01-12-2018)“…Random forests are a commonly used tool for classification and for ranking candidate predictors based on the so-called variable importance measures. These…”
Get full text
Journal Article -
2
Functional data clustering: a survey
Published in Advances in data analysis and classification (01-09-2014)“…Clustering techniques for functional data are reviewed. Four groups of clustering algorithms for functional data are proposed. The first group consists of…”
Get full text
Journal Article -
3
A comparison of instance-level counterfactual explanation algorithms for behavioral and textual data: SEDC, LIME-C and SHAP-C
Published in Advances in data analysis and classification (01-12-2020)“…Predictive systems based on high-dimensional behavioral and textual data have serious comprehensibility and transparency issues: linear models require…”
Get full text
Journal Article -
4
A novel method for forecasting time series based on fuzzy logic and visibility graph
Published in Advances in data analysis and classification (01-12-2017)“…Time series attracts much attention for its remarkable forecasting potential. This paper discusses how fuzzy logic improves accuracy when forecasting time…”
Get full text
Journal Article -
5
Greedy Gaussian segmentation of multivariate time series
Published in Advances in data analysis and classification (01-09-2019)“…We consider the problem of breaking a multivariate (vector) time series into segments over which the data is well explained as independent samples from a…”
Get full text
Journal Article -
6
Ensemble of optimal trees, random forest and random projection ensemble classification
Published in Advances in data analysis and classification (01-03-2020)“…The predictive performance of a random forest ensemble is highly associated with the strength of individual trees and their diversity. Ensemble of a small…”
Get full text
Journal Article -
7
A two-stage sparse logistic regression for optimal gene selection in high-dimensional microarray data classification
Published in Advances in data analysis and classification (01-09-2019)“…The common issues of high-dimensional gene expression data are that many of the genes may not be relevant, and there exists a high correlation among genes…”
Get full text
Journal Article -
8
From here to infinity: sparse finite versus Dirichlet process mixtures in model-based clustering
Published in Advances in data analysis and classification (2019)“…In model-based clustering mixture models are used to group data points into clusters. A useful concept introduced for Gaussian mixtures by Malsiner Walli et…”
Get full text
Journal Article -
9
Is there a role for statistics in artificial intelligence?
Published in Advances in data analysis and classification (01-12-2022)“…The research on and application of artificial intelligence (AI) has triggered a comprehensive scientific, economic, social and political discussion. Here we…”
Get full text
Journal Article -
10
Ensemble feature selection for high dimensional data: a new method and a comparative study
Published in Advances in data analysis and classification (01-12-2018)“…The curse of dimensionality is based on the fact that high dimensional data is often difficult to work with. A large number of features can increase the noise…”
Get full text
Journal Article -
11
Ensemble of a subset of kNN classifiers
Published in Advances in data analysis and classification (2018)“…Combining multiple classifiers, known as ensemble methods, can give substantial improvement in prediction performance of learning algorithms especially in the…”
Get full text
Journal Article -
12
Minimum adjusted Rand index for two clusterings of a given size
Published in Advances in data analysis and classification (01-03-2023)“…The adjusted Rand index (ARI) is commonly used in cluster analysis to measure the degree of agreement between two data partitions. Since its introduction,…”
Get full text
Journal Article -
13
On mixtures of skew normal and skew $$t$$ -distributions
Published in Advances in data analysis and classification (01-09-2013)Get full text
Journal Article -
14
Threshold-based Naïve Bayes classifier
Published in Advances in data analysis and classification (01-06-2024)“…The Threshold-based Naïve Bayes (Tb-NB) classifier is introduced as a (simple) improved version of the original Naïve Bayes classifier. Tb-NB extracts the…”
Get full text
Journal Article -
15
A principal component method to impute missing values for mixed data
Published in Advances in data analysis and classification (01-03-2016)“…We propose a new method to impute missing values in mixed data sets. It is based on a principal component method, the factorial analysis for mixed data, which…”
Get full text
Journal Article -
16
On discriminating between lognormal and Pareto tail: an unsupervised mixture-based approach
Published in Advances in data analysis and classification (01-06-2024)“…Many stochastic models in economics and finance are described by distributions with a lognormal body. Testing for a possible Pareto tail and estimating the…”
Get full text
Journal Article -
17
Identification of representative trees in random forests based on a new tree-based distance measure
Published in Advances in data analysis and classification (01-06-2024)“…In life sciences, random forests are often used to train predictive models. However, gaining any explanatory insight into the mechanics leading to a specific…”
Get full text
Journal Article -
18
Composite likelihood methods for parsimonious model-based clustering of mixed-type data
Published in Advances in data analysis and classification (01-06-2024)“…In this paper, we propose twelve parsimonious models for clustering mixed-type (ordinal and continuous) data. The dependence among the different types of…”
Get full text
Journal Article -
19
RGA: a unified measure of predictive accuracy
Published in Advances in data analysis and classification (17-01-2024)“…Abstract A key point to assess statistical forecasts is the evaluation of their predictive accuracy. Recently, a new measure, called Rank Graduation Accuracy…”
Get full text
Journal Article -
20
Notes on the H-measure of classifier performance
Published in Advances in data analysis and classification (01-03-2023)“…The H-measure is a classifier performance measure which takes into account the context of application without requiring a rigid value of relative…”
Get full text
Journal Article