Comparing Resampling Techniques in Stroke Prediction with Machine and Deep Learning

Cerebrovascular accident (CVA), commonly known as a stroke, is a major cause of morbidity and mortality worldwide. Recent techniques in stroke prediction include the application of machine learning and deep learning algorithms, the integration of multimodal data, and the use of advanced feature sele...

Full description

Saved in:
Bibliographic Details
Published in:2023 International Conference on Sustainable Computing and Smart Systems (ICSCSS) pp. 1415 - 1420
Main Authors: Thanka, M Roshni, Ram, Kommu Sri, Gandu, Shalem Preetham, Edwin, E Bijolin, Ebenezer, V, Joy, Priscilla
Format: Conference Proceeding
Language:English
Published: IEEE 14-06-2023
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract Cerebrovascular accident (CVA), commonly known as a stroke, is a major cause of morbidity and mortality worldwide. Recent techniques in stroke prediction include the application of machine learning and deep learning algorithms, the integration of multimodal data, and the use of advanced feature selection methods. Challenges in stroke prediction systems include class imbalance, limited and heterogeneous data availability, interpretability of black-box models, and generalizability across diverse populations. The proposed objective of this study is to address these challenges and enhance the accuracy and reliability of stroke prediction models. Quick detection and management of stroke risk factors can reduce the incidence and severity of stroke. In recent times, machine learning approaches have been applied to estimate stroke risk based on patient data. This study uses three machine learning algorithms and an artificial neural network (ANN) model to predict stroke incidence based on a dataset containing 17 variables. The ANN model was optimized using the RandomSearch hyperparameter tuning technique and trained and tested on both the original unbalanced dataset and six resampled datasets generated using different techniques to address the class imbalance problem. The results indicate that the ANN model performed well on both the original dataset and the resampled datasets. The model achieved a higher accuracy of 99.3% on the dataset resampled using the SMOTE+RandomUnderSampling technique. The study suggests that the resampling techniques employed were effective in improving the performance of the ANN model, especially in dealing with class imbalance challenges in the dataset. The outcomes of this study recommend that the ANN model has the potential to be used as a predictive tool for stroke incidence. More study is needed, however, to confirm the effectiveness of the model on bigger, more diversified datasets and to evaluate its generalizability to other populations.
AbstractList Cerebrovascular accident (CVA), commonly known as a stroke, is a major cause of morbidity and mortality worldwide. Recent techniques in stroke prediction include the application of machine learning and deep learning algorithms, the integration of multimodal data, and the use of advanced feature selection methods. Challenges in stroke prediction systems include class imbalance, limited and heterogeneous data availability, interpretability of black-box models, and generalizability across diverse populations. The proposed objective of this study is to address these challenges and enhance the accuracy and reliability of stroke prediction models. Quick detection and management of stroke risk factors can reduce the incidence and severity of stroke. In recent times, machine learning approaches have been applied to estimate stroke risk based on patient data. This study uses three machine learning algorithms and an artificial neural network (ANN) model to predict stroke incidence based on a dataset containing 17 variables. The ANN model was optimized using the RandomSearch hyperparameter tuning technique and trained and tested on both the original unbalanced dataset and six resampled datasets generated using different techniques to address the class imbalance problem. The results indicate that the ANN model performed well on both the original dataset and the resampled datasets. The model achieved a higher accuracy of 99.3% on the dataset resampled using the SMOTE+RandomUnderSampling technique. The study suggests that the resampling techniques employed were effective in improving the performance of the ANN model, especially in dealing with class imbalance challenges in the dataset. The outcomes of this study recommend that the ANN model has the potential to be used as a predictive tool for stroke incidence. More study is needed, however, to confirm the effectiveness of the model on bigger, more diversified datasets and to evaluate its generalizability to other populations.
Author Ebenezer, V
Edwin, E Bijolin
Gandu, Shalem Preetham
Ram, Kommu Sri
Thanka, M Roshni
Joy, Priscilla
Author_xml – sequence: 1
  givenname: M Roshni
  surname: Thanka
  fullname: Thanka, M Roshni
  email: roshni@karunya.edu
  organization: Karunya Institute of Technology and Sciences,Computer Science and Engineering,Coimbatore,India,641114
– sequence: 2
  givenname: Kommu Sri
  surname: Ram
  fullname: Ram, Kommu Sri
  email: kommusriram@karunya.edu.in
  organization: Karunya Institute of Technology and Sciences,Computer Science and Engineering,Coimbatore,India,641114
– sequence: 3
  givenname: Shalem Preetham
  surname: Gandu
  fullname: Gandu, Shalem Preetham
  email: gandushalem@karunya.edu.in
  organization: Karunya Institute of Technology and Sciences,Computer Science and Engineering,Coimbatore,India,641114
– sequence: 4
  givenname: E Bijolin
  surname: Edwin
  fullname: Edwin, E Bijolin
  email: bijolin@karunya.edu
  organization: Karunya Institute of Technology and Sciences,Computer Science and Engineering,Coimbatore,India,641114
– sequence: 5
  givenname: V
  surname: Ebenezer
  fullname: Ebenezer, V
  email: ebenezerv@karunya.edu
  organization: Karunya Institute of Technology and Sciences,Computer Science and Engineering,Coimbatore,India,641114
– sequence: 6
  givenname: Priscilla
  surname: Joy
  fullname: Joy, Priscilla
  email: priscillajoy@karunya.edu
  organization: Karunya Institute of Technology and Sciences,Computer Science and Engineering,Coimbatore,India,641114
BookMark eNo1j8tOhDAYRmuiCx3nDVzUBwD_tlPaLg1eZhKMRthPevmRxqEgYIxvr0ZdfWdzTvKdkeM0JCTkkkHOGJirXVmXdS1VISHnwEXOgBWGC3VE1kYZLSQIIQrgp6Quh360U0wv9Bln24-HH2zQdym-veNMY6L1Mg2vSJ8mDNEvcUj0Iy4dfbC-iwmpTYHeII60Qjulb_2cnLT2MOP6b1ekubttym1WPd7vyusqi4yZJfNK-0Kjb0G74DT44LVUzjHOPXcb4EZxMDq0PugNB-XQSRmU1EyCNIVYkYvfbETE_TjF3k6f-_-r4gtfgU8L
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ICSCSS57650.2023.10169237
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library Online
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library Online
  url: http://ieeexplore.ieee.org/Xplore/DynWel.jsp
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798350333602
EndPage 1420
ExternalDocumentID 10169237
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i119t-c78c68ecf08bdb80cdc857bb122c2b402972098dfcd84207beb55d7581505963
IEDL.DBID RIE
IngestDate Thu Jan 18 11:13:13 EST 2024
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i119t-c78c68ecf08bdb80cdc857bb122c2b402972098dfcd84207beb55d7581505963
PageCount 6
ParticipantIDs ieee_primary_10169237
PublicationCentury 2000
PublicationDate 2023-June-14
PublicationDateYYYYMMDD 2023-06-14
PublicationDate_xml – month: 06
  year: 2023
  text: 2023-June-14
  day: 14
PublicationDecade 2020
PublicationTitle 2023 International Conference on Sustainable Computing and Smart Systems (ICSCSS)
PublicationTitleAbbrev ICSCSS
PublicationYear 2023
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.9159514
Snippet Cerebrovascular accident (CVA), commonly known as a stroke, is a major cause of morbidity and mortality worldwide. Recent techniques in stroke prediction...
SourceID ieee
SourceType Publisher
StartPage 1415
SubjectTerms artificial neural network
Artificial neural networks
Cerebrovascular accident
Deep learning
hyperparameter tuning
machine learning
Machine learning algorithms
Prediction algorithms
Predictive models
RandomUnderSampling
resampling techniques
Sociology
stroke
Stroke (medical condition)
Tomek links
Title Comparing Resampling Techniques in Stroke Prediction with Machine and Deep Learning
URI https://ieeexplore.ieee.org/document/10169237
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEA62B_GkYsU3Ebymze4mm-x521IPirg9eCvZyWwpwrb08f-b2T7EgwdvISQEJgzz_j7GXqJEVz6TUoCKnFBVrIS1SSqUd5C6RMnE0bzzqDDvX7Y_IJgccZyFQcSm-Qy7tGxq-X4OG0qV9SjSDA6JabGWyexuWOuUPe9xM3uveZEXRXCgtewSK3j3cP4Xc0pjOIbn_3zygnV-RvD4x9G4XLITrK9Yke9YA-sp_8SVo2bwsBwfYFhXfFbzYr2cf9NdqsCQ1DmlWvlb0zSJ3NWe9xEXfA-sOu2w8XAwzkdiz4ogZlGUrQUYC6lFqKQtfWkleLDalGUUxxCXisioYplZX4G3KpamxFJrH8KC4PrpoG7XrF3Pa7xhHEBBikEJdeQVaONSxBgqXxlXaZOYW9YhgUwWO9yLyUEWd3_s37MzEjs1UkXqgbXXyw0-stbKb56ar9oCEnKWbw
link.rule.ids 310,311,782,786,791,792,798,27934,54767
linkProvider IEEE
linkToHtml http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEB5sBfWkYsW3Ebymze4mm_S8bWmxLeLuwVvJJrOlCNvSx_832T7EgwdvQ2AITBhmknzzfQCvQSQK22aMGh5oyouQU6WimHKrTawjziLt5537qRx_qk7X0-TQwywMIlbgM2x6s_rLt3Oz8U9lLX_TdA2JrMGx4DKW23GtE3jZMWe2BkmapKlroQVrel3w5t7jl3ZKVTp65__c9AIaP0N45P1QXi7hCMsrSJOtbmA5JR-40h4O7sxsT8S6IrOSpOvl_Mv7-j8YH3fiH1vJqIJNItGlJR3EBdlRq04bkPW6WdKnO10EOguC9poaqUys0BRM5TZXzFijhMzzIAxNmHMvRxWytrKFsYqHTOaYC2HdxcA1f8Il3DXUy3mJN0CM4SZGl4YisNwIqWPE0BS2kLoQMpK30PABmSy2zBeTfSzu_lh_htN-NhpOhoPx2z2c-SPwsKqAP0B9vdzgI9RWdvNUHds3LUGZwA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2023+International+Conference+on+Sustainable+Computing+and+Smart+Systems+%28ICSCSS%29&rft.atitle=Comparing+Resampling+Techniques+in+Stroke+Prediction+with+Machine+and+Deep+Learning&rft.au=Thanka%2C+M+Roshni&rft.au=Ram%2C+Kommu+Sri&rft.au=Gandu%2C+Shalem+Preetham&rft.au=Edwin%2C+E+Bijolin&rft.date=2023-06-14&rft.pub=IEEE&rft.spage=1415&rft.epage=1420&rft_id=info:doi/10.1109%2FICSCSS57650.2023.10169237&rft.externalDocID=10169237