Online Machine Learning from Non-stationary Data Streams in the Presence of Concept Drift and Class Imbalance: A Systematic Review
In IoT environment applications generate continuous non-stationary data streams with in-built problems of concept drift and class imbalance which cause classifier performance degradation. The imbalanced data affects the classifier during concept detection and concept adaptation. In general, for conc...
Saved in:
Published in: | Journal of ICT Vol. 23; no. 1; pp. 105 - 139 |
---|---|
Main Authors: | , , , , , , |
Format: | Journal Article |
Language: | English |
Published: |
UUM Press
01-01-2024
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Abstract | In IoT environment applications generate continuous non-stationary data streams with in-built problems of concept drift and class imbalance which cause classifier performance degradation. The imbalanced data affects the classifier during concept detection and concept adaptation. In general, for concept detection, a separate mechanism is added in parallel with the classifier to detect the concept drift called a drift detector. For concept adaptation, the classifier updates itself or trains a new classifier to replace the older one. In case, the data stream faces a class imbalance issue, the classifier may not properly adapt to the latest concept. In this survey, we study how the existing work addresses the issues of class imbalance and concept drift while learning from nonstationarydata streams. We further highlight the limitation of existing work and challenges caused by other factors of class imbalance alongwith concept drift in data stream classification. Results of our survey found that, out of 1110 studies, by using our inclusion and exclusion criteria, we were able to narrow the pool of articles down to 35 that directly addressed our study objectives. The study found that issues such as multiple concept drift types, dynamic class imbalance ratio, and multi-class imbalance in presence of concept drift are still open for further research. We also observed that, while major research efforts have been dedicated to resolving concept drift and class imbalance, not much attention has been given to with-in-class imbalance, rear examples, and borderline instances when they exist with concept drift in multi-class data. This paper concludes with some suggested future directions. |
---|---|
AbstractList | In IoT environment applications generate continuous non-stationary data streams with in-built problems of concept drift and class imbalance which cause classifier performance degradation. The imbalanced data affects the classifier during concept detection and concept adaptation. In general, for concept detection, a separate mechanism is added in parallel with the classifier to detect the concept drift called a drift detector. For concept adaptation, the classifier updates itself or trains a new classifier to replace the older one. In case, the data stream faces a class imbalance issue, the classifier may not properly adapt to the latest concept. In this survey, we study how the existing work addresses the issues of class imbalance and concept drift while learning from nonstationarydata streams. We further highlight the limitation of existing work and challenges caused by other factors of class imbalance alongwith concept drift in data stream classification. Results of our survey found that, out of 1110 studies, by using our inclusion and exclusion criteria, we were able to narrow the pool of articles down to 35 that directly addressed our study objectives. The study found that issues such as multiple concept drift types, dynamic class imbalance ratio, and multi-class imbalance in presence of concept drift are still open for further research. We also observed that, while major research efforts have been dedicated to resolving concept drift and class imbalance, not much attention has been given to with-in-class imbalance, rear examples, and borderline instances when they exist with concept drift in multi-class data. This paper concludes with some suggested future directions. In IoT environment applications generate continuous non-stationary data streams with in-built problems of concept drift and class imbalance which cause classifier performance degradation. The imbalanced data affects the classifier during concept detection and concept adaptation. In general, for concept detection, a separate mechanism is added in parallel with the classifier to detect the concept drift called a drift detector. For concept adaptation, the classifier updates itself or trains a new classifier to replace the older one. In case, the data stream faces a class imbalance issue, the classifier may not properly adapt to the latest concept. In this survey, we study how the existing work addresses the issues of class imbalance and concept drift while learning from nonstationary data streams. We further highlight the limitation of existing work and challenges caused by other factors of class imbalance along with concept drift in data stream classification. Results of our survey found that, out of 1110 studies, by using our inclusion and exclusion criteria, we were able to narrow the pool of articles down to 35 that directly addressed our study objectives. The study found that issues such as multiple concept drift types, dynamic class imbalance ratio, and multi-class imbalance in presence of concept drift are still open for further research. We also observed that, while major research efforts have been dedicated to resolving concept drift and class imbalance, not much attention has been given to with-in-class imbalance, rear examples, and borderline instances when they exist with concept drift in multi-class data. This paper concludes with some suggested future directions. |
Author | Alsughayyir, Aeshah Alshanqiti, Abdullah Gomes, Heitor Murilo Jaafar, Jafreezal Omar, Mazni Palli, Abdul Sattar Gilal, Abdul Rehman |
Author_xml | – sequence: 1 givenname: Abdul Sattar surname: Palli fullname: Palli, Abdul Sattar – sequence: 2 givenname: Jafreezal surname: Jaafar fullname: Jaafar, Jafreezal – sequence: 3 givenname: Abdul Rehman surname: Gilal fullname: Gilal, Abdul Rehman – sequence: 4 givenname: Aeshah surname: Alsughayyir fullname: Alsughayyir, Aeshah – sequence: 5 givenname: Heitor Murilo surname: Gomes fullname: Gomes, Heitor Murilo – sequence: 6 givenname: Abdullah surname: Alshanqiti fullname: Alshanqiti, Abdullah – sequence: 7 givenname: Mazni surname: Omar fullname: Omar, Mazni |
BookMark | eNo9kUtvFDEQhC0UJJaQO0f_gVn8mBnb3KINj5UWgghI3Ky2p53MasaObCsoV355nAToS5W6pE_dqtfkJKaIhLzlbCuFNuzdcfZVMNFvhdzy7fCCbATXrJN6FCdkw0c1dD3vf70iZ6UcWZtx4P0oNuTPZVzmiPQL-JtHPSDkOMdrGnJa6dcUu1KhzilCvqcXUIFe1YywFjpHWm-QfstYMHqkKdBdaua20os8h0ohTnS3QCl0vzpYoGXv6Tm9ui8V18b09Dvezfj7DXkZYCl49ldPyc-PH37sPneHy0_73fmh85KZ2sEEElw_Bg0aGDdsVNogauPEhFqpFrngm-VOGdk76Jl34B33cvLaDfKU7J-5U4Kjvc3z2n6yCWb7tEj52kJuZy1ouZi8UhKdwIZhaMIolVHcw-hRmdBY7JnlcyolY_jP48w-VWL_VWKFtNwO8gGQAYOf |
CitedBy_id | crossref_primary_10_1111_jiec_13509 |
ContentType | Journal Article |
DBID | AAYXX CITATION DOA |
DOI | 10.32890/jict2024.23.1.5 |
DatabaseName | CrossRef Directory of Open Access Journals |
DatabaseTitle | CrossRef |
DatabaseTitleList | CrossRef |
Database_xml | – sequence: 1 dbid: DOA name: Directory of Open Access Journals url: http://www.doaj.org/ sourceTypes: Open Website |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Journalism & Communications Engineering |
EISSN | 2180-3862 |
EndPage | 139 |
ExternalDocumentID | oai_doaj_org_article_12dc773eb2ea400e9f637971ca6ce79f 10_32890_jict2024_23_1_5 |
GroupedDBID | .4S .DC AAYXX ADDVE AFKRA ALMA_UNASSIGNED_HOLDINGS ARAPS ARCSS BENPR BGLVJ CCPQU CITATION EDO EOJEC GROUPED_DOAJ HCIFZ K7- OBODZ OK1 PIMPY RNS TH9 TUS |
ID | FETCH-LOGICAL-c309t-ada3ab46f8a8a01906789ee89b2de877b46bfcde81b7934ba40cbacb1c3dc8b53 |
IEDL.DBID | DOA |
ISSN | 1675-414X |
IngestDate | Tue Oct 22 15:13:21 EDT 2024 Thu Nov 21 22:39:09 EST 2024 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 1 |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c309t-ada3ab46f8a8a01906789ee89b2de877b46bfcde81b7934ba40cbacb1c3dc8b53 |
OpenAccessLink | https://doaj.org/article/12dc773eb2ea400e9f637971ca6ce79f |
PageCount | 35 |
ParticipantIDs | doaj_primary_oai_doaj_org_article_12dc773eb2ea400e9f637971ca6ce79f crossref_primary_10_32890_jict2024_23_1_5 |
PublicationCentury | 2000 |
PublicationDate | 2024-01-01 |
PublicationDateYYYYMMDD | 2024-01-01 |
PublicationDate_xml | – month: 01 year: 2024 text: 2024-01-01 day: 01 |
PublicationDecade | 2020 |
PublicationTitle | Journal of ICT |
PublicationYear | 2024 |
Publisher | UUM Press |
Publisher_xml | – name: UUM Press |
SSID | ssj0000651462 |
Score | 2.3244877 |
Snippet | In IoT environment applications generate continuous non-stationary data streams with in-built problems of concept drift and class imbalance which cause... |
SourceID | doaj crossref |
SourceType | Open Website Aggregation Database |
StartPage | 105 |
SubjectTerms | Class Imbalance Concept Adaptation Concept Drift Data Streams Non-stationary |
Title | Online Machine Learning from Non-stationary Data Streams in the Presence of Concept Drift and Class Imbalance: A Systematic Review |
URI | https://doaj.org/article/12dc773eb2ea400e9f637971ca6ce79f |
Volume | 23 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV07b9swECbaTM1QNG6LPJrihqJABzmmKJpiN7dOkA4NCqQFsgnHV-AAlgNLHrrml-dIKoYzdekmSBRB3J3I-3R33zH2yTqhgzOu4FwRQNHaFaYSvqgk7cgoZXAp2-LyWl3d1PPzSJOzbfUVc8IyPXAW3BkvnVVKEAD0SPbmdZgKpRW3OLVe6ZB238l0B0zlPZgcgdRNlJNHXFS8uskxShEDa2d3C9sT6q_GpRjzsXx2Ju1Q96cz5uINez04hzDLizpgL3w7Yvs7lIEjdjIMWnRL-AzPyju6t-whM4fCz5Qi6WFgT72FWEUCV6u26HLoHdd_YY49QoxK47KDRQvkCsKvVI1kPawCTZ4qGmG-XoQesHWQOmjCj6WJ-ZDWf4UZXG-ZoCGHGd6xPxfnv79fFkOXhcKKie4LdCjQVNNQY42xspyOL-19rU3pfK0UPTLB0iU39C1XhrRgDVrDrXC2NlK8Z3vtqvWHDCI6cjLgBCWS9MuasCXpiV6sDaKsjtiXJzk395lMoyEQknTSPOmkKUXDG3nEvkVFbMdFGux0g4yjGYyj-ZdxHP-PSU7Yq7iu_N_lA9vr1xt_yl52bvMxGd0j3cfdJg |
link.rule.ids | 315,782,786,866,2106,27933,27934 |
linkProvider | Directory of Open Access Journals |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Online+Machine+Learning+from+Non-stationary+Data+Streams+in+the+Presence+of+Concept+Drift+and+Class+Imbalance%3A+A+Systematic+Review&rft.jtitle=Journal+of+ICT&rft.au=Palli%2C+Abdul+Sattar&rft.au=Jaafar%2C+Jafreezal&rft.au=Gilal%2C+Abdul+Rehman&rft.au=Alsughayyir%2C+Aeshah&rft.date=2024-01-01&rft.issn=1675-414X&rft.eissn=2180-3862&rft.volume=23&rft.issue=1&rft.spage=105&rft.epage=139&rft_id=info:doi/10.32890%2Fjict2024.23.1.5&rft.externalDBID=n%2Fa&rft.externalDocID=10_32890_jict2024_23_1_5 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1675-414X&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1675-414X&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1675-414X&client=summon |