Online Machine Learning from Non-stationary Data Streams in the Presence of Concept Drift and Class Imbalance: A Systematic Review

In IoT environment applications generate continuous non-stationary data streams with in-built problems of concept drift and class imbalance which cause classifier performance degradation. The imbalanced data affects the classifier during concept detection and concept adaptation. In general, for conc...

Full description

Saved in:
Bibliographic Details
Published in:Journal of ICT Vol. 23; no. 1; pp. 105 - 139
Main Authors: Palli, Abdul Sattar, Jaafar, Jafreezal, Gilal, Abdul Rehman, Alsughayyir, Aeshah, Gomes, Heitor Murilo, Alshanqiti, Abdullah, Omar, Mazni
Format: Journal Article
Language:English
Published: UUM Press 01-01-2024
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract In IoT environment applications generate continuous non-stationary data streams with in-built problems of concept drift and class imbalance which cause classifier performance degradation. The imbalanced data affects the classifier during concept detection and concept adaptation. In general, for concept detection, a separate mechanism is added in parallel with the classifier to detect the concept drift called a drift detector. For concept adaptation, the classifier updates itself or trains a new classifier to replace the older one. In case, the data stream faces a class imbalance issue, the classifier may not properly adapt to the latest concept. In this survey, we study how the existing work addresses the issues of class imbalance and concept drift while learning from nonstationarydata streams. We further highlight the limitation of existing work and challenges caused by other factors of class imbalance alongwith concept drift in data stream classification. Results of our survey found that, out of 1110 studies, by using our inclusion and exclusion criteria, we were able to narrow the pool of articles down to 35 that directly addressed our study objectives. The study found that issues such as multiple concept drift types, dynamic class imbalance ratio, and multi-class imbalance in presence of concept drift are still open for further research. We also observed that, while major research efforts have been dedicated to resolving concept drift and class imbalance, not much attention has been given to with-in-class imbalance, rear examples, and borderline instances when they exist with concept drift in multi-class data. This paper concludes with some suggested future directions.
AbstractList In IoT environment applications generate continuous non-stationary data streams with in-built problems of concept drift and class imbalance which cause classifier performance degradation. The imbalanced data affects the classifier during concept detection and concept adaptation. In general, for concept detection, a separate mechanism is added in parallel with the classifier to detect the concept drift called a drift detector. For concept adaptation, the classifier updates itself or trains a new classifier to replace the older one. In case, the data stream faces a class imbalance issue, the classifier may not properly adapt to the latest concept. In this survey, we study how the existing work addresses the issues of class imbalance and concept drift while learning from nonstationarydata streams. We further highlight the limitation of existing work and challenges caused by other factors of class imbalance alongwith concept drift in data stream classification. Results of our survey found that, out of 1110 studies, by using our inclusion and exclusion criteria, we were able to narrow the pool of articles down to 35 that directly addressed our study objectives. The study found that issues such as multiple concept drift types, dynamic class imbalance ratio, and multi-class imbalance in presence of concept drift are still open for further research. We also observed that, while major research efforts have been dedicated to resolving concept drift and class imbalance, not much attention has been given to with-in-class imbalance, rear examples, and borderline instances when they exist with concept drift in multi-class data. This paper concludes with some suggested future directions.
In IoT environment applications generate continuous non-stationary data streams with in-built problems of concept drift and class imbalance which cause classifier performance degradation. The imbalanced data affects the classifier during concept detection and concept adaptation. In general, for concept detection, a separate mechanism is added in parallel with the classifier to detect the concept drift called a drift detector. For concept adaptation, the classifier updates itself or trains a new classifier to replace the older one. In case, the data stream faces a class imbalance issue, the classifier may not properly adapt to the latest concept. In this survey, we study how the existing work addresses the issues of class imbalance and concept drift while learning from nonstationary data streams. We further highlight the limitation of existing work and challenges caused by other factors of class imbalance along with concept drift in data stream classification. Results of our survey found that, out of 1110 studies, by using our inclusion and exclusion criteria, we were able to narrow the pool of articles down to 35 that directly addressed our study objectives. The study found that issues such as multiple concept drift types, dynamic class imbalance ratio, and multi-class imbalance in presence of concept drift are still open for further research. We also observed that, while major research efforts have been dedicated to resolving concept drift and class imbalance, not much attention has been given to with-in-class imbalance, rear examples, and borderline instances when they exist with concept drift in multi-class data. This paper concludes with some suggested future directions.
Author Alsughayyir, Aeshah
Alshanqiti, Abdullah
Gomes, Heitor Murilo
Jaafar, Jafreezal
Omar, Mazni
Palli, Abdul Sattar
Gilal, Abdul Rehman
Author_xml – sequence: 1
  givenname: Abdul Sattar
  surname: Palli
  fullname: Palli, Abdul Sattar
– sequence: 2
  givenname: Jafreezal
  surname: Jaafar
  fullname: Jaafar, Jafreezal
– sequence: 3
  givenname: Abdul Rehman
  surname: Gilal
  fullname: Gilal, Abdul Rehman
– sequence: 4
  givenname: Aeshah
  surname: Alsughayyir
  fullname: Alsughayyir, Aeshah
– sequence: 5
  givenname: Heitor Murilo
  surname: Gomes
  fullname: Gomes, Heitor Murilo
– sequence: 6
  givenname: Abdullah
  surname: Alshanqiti
  fullname: Alshanqiti, Abdullah
– sequence: 7
  givenname: Mazni
  surname: Omar
  fullname: Omar, Mazni
BookMark eNo9kUtvFDEQhC0UJJaQO0f_gVn8mBnb3KINj5UWgghI3Ky2p53MasaObCsoV355nAToS5W6pE_dqtfkJKaIhLzlbCuFNuzdcfZVMNFvhdzy7fCCbATXrJN6FCdkw0c1dD3vf70iZ6UcWZtx4P0oNuTPZVzmiPQL-JtHPSDkOMdrGnJa6dcUu1KhzilCvqcXUIFe1YywFjpHWm-QfstYMHqkKdBdaua20os8h0ohTnS3QCl0vzpYoGXv6Tm9ui8V18b09Dvezfj7DXkZYCl49ldPyc-PH37sPneHy0_73fmh85KZ2sEEElw_Bg0aGDdsVNogauPEhFqpFrngm-VOGdk76Jl34B33cvLaDfKU7J-5U4Kjvc3z2n6yCWb7tEj52kJuZy1ouZi8UhKdwIZhaMIolVHcw-hRmdBY7JnlcyolY_jP48w-VWL_VWKFtNwO8gGQAYOf
CitedBy_id crossref_primary_10_1111_jiec_13509
ContentType Journal Article
DBID AAYXX
CITATION
DOA
DOI 10.32890/jict2024.23.1.5
DatabaseName CrossRef
Directory of Open Access Journals
DatabaseTitle CrossRef
DatabaseTitleList CrossRef

Database_xml – sequence: 1
  dbid: DOA
  name: Directory of Open Access Journals
  url: http://www.doaj.org/
  sourceTypes: Open Website
DeliveryMethod fulltext_linktorsrc
Discipline Journalism & Communications
Engineering
EISSN 2180-3862
EndPage 139
ExternalDocumentID oai_doaj_org_article_12dc773eb2ea400e9f637971ca6ce79f
10_32890_jict2024_23_1_5
GroupedDBID .4S
.DC
AAYXX
ADDVE
AFKRA
ALMA_UNASSIGNED_HOLDINGS
ARAPS
ARCSS
BENPR
BGLVJ
CCPQU
CITATION
EDO
EOJEC
GROUPED_DOAJ
HCIFZ
K7-
OBODZ
OK1
PIMPY
RNS
TH9
TUS
ID FETCH-LOGICAL-c309t-ada3ab46f8a8a01906789ee89b2de877b46bfcde81b7934ba40cbacb1c3dc8b53
IEDL.DBID DOA
ISSN 1675-414X
IngestDate Tue Oct 22 15:13:21 EDT 2024
Thu Nov 21 22:39:09 EST 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c309t-ada3ab46f8a8a01906789ee89b2de877b46bfcde81b7934ba40cbacb1c3dc8b53
OpenAccessLink https://doaj.org/article/12dc773eb2ea400e9f637971ca6ce79f
PageCount 35
ParticipantIDs doaj_primary_oai_doaj_org_article_12dc773eb2ea400e9f637971ca6ce79f
crossref_primary_10_32890_jict2024_23_1_5
PublicationCentury 2000
PublicationDate 2024-01-01
PublicationDateYYYYMMDD 2024-01-01
PublicationDate_xml – month: 01
  year: 2024
  text: 2024-01-01
  day: 01
PublicationDecade 2020
PublicationTitle Journal of ICT
PublicationYear 2024
Publisher UUM Press
Publisher_xml – name: UUM Press
SSID ssj0000651462
Score 2.3244877
Snippet In IoT environment applications generate continuous non-stationary data streams with in-built problems of concept drift and class imbalance which cause...
SourceID doaj
crossref
SourceType Open Website
Aggregation Database
StartPage 105
SubjectTerms Class Imbalance
Concept Adaptation
Concept Drift
Data Streams
Non-stationary
Title Online Machine Learning from Non-stationary Data Streams in the Presence of Concept Drift and Class Imbalance: A Systematic Review
URI https://doaj.org/article/12dc773eb2ea400e9f637971ca6ce79f
Volume 23
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV07b9swECbaTM1QNG6LPJrihqJABzmmKJpiN7dOkA4NCqQFsgnHV-AAlgNLHrrml-dIKoYzdekmSBRB3J3I-3R33zH2yTqhgzOu4FwRQNHaFaYSvqgk7cgoZXAp2-LyWl3d1PPzSJOzbfUVc8IyPXAW3BkvnVVKEAD0SPbmdZgKpRW3OLVe6ZB238l0B0zlPZgcgdRNlJNHXFS8uskxShEDa2d3C9sT6q_GpRjzsXx2Ju1Q96cz5uINez04hzDLizpgL3w7Yvs7lIEjdjIMWnRL-AzPyju6t-whM4fCz5Qi6WFgT72FWEUCV6u26HLoHdd_YY49QoxK47KDRQvkCsKvVI1kPawCTZ4qGmG-XoQesHWQOmjCj6WJ-ZDWf4UZXG-ZoCGHGd6xPxfnv79fFkOXhcKKie4LdCjQVNNQY42xspyOL-19rU3pfK0UPTLB0iU39C1XhrRgDVrDrXC2NlK8Z3vtqvWHDCI6cjLgBCWS9MuasCXpiV6sDaKsjtiXJzk395lMoyEQknTSPOmkKUXDG3nEvkVFbMdFGux0g4yjGYyj-ZdxHP-PSU7Yq7iu_N_lA9vr1xt_yl52bvMxGd0j3cfdJg
link.rule.ids 315,782,786,866,2106,27933,27934
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Online+Machine+Learning+from+Non-stationary+Data+Streams+in+the+Presence+of+Concept+Drift+and+Class+Imbalance%3A+A+Systematic+Review&rft.jtitle=Journal+of+ICT&rft.au=Palli%2C+Abdul+Sattar&rft.au=Jaafar%2C+Jafreezal&rft.au=Gilal%2C+Abdul+Rehman&rft.au=Alsughayyir%2C+Aeshah&rft.date=2024-01-01&rft.issn=1675-414X&rft.eissn=2180-3862&rft.volume=23&rft.issue=1&rft.spage=105&rft.epage=139&rft_id=info:doi/10.32890%2Fjict2024.23.1.5&rft.externalDBID=n%2Fa&rft.externalDocID=10_32890_jict2024_23_1_5
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1675-414X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1675-414X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1675-414X&client=summon