Comparing Resampling Techniques in Stroke Prediction with Machine and Deep Learning
Cerebrovascular accident (CVA), commonly known as a stroke, is a major cause of morbidity and mortality worldwide. Recent techniques in stroke prediction include the application of machine learning and deep learning algorithms, the integration of multimodal data, and the use of advanced feature sele...
Saved in:
Published in: | 2023 International Conference on Sustainable Computing and Smart Systems (ICSCSS) pp. 1415 - 1420 |
---|---|
Main Authors: | , , , , , |
Format: | Conference Proceeding |
Language: | English |
Published: |
IEEE
14-06-2023
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Abstract | Cerebrovascular accident (CVA), commonly known as a stroke, is a major cause of morbidity and mortality worldwide. Recent techniques in stroke prediction include the application of machine learning and deep learning algorithms, the integration of multimodal data, and the use of advanced feature selection methods. Challenges in stroke prediction systems include class imbalance, limited and heterogeneous data availability, interpretability of black-box models, and generalizability across diverse populations. The proposed objective of this study is to address these challenges and enhance the accuracy and reliability of stroke prediction models. Quick detection and management of stroke risk factors can reduce the incidence and severity of stroke. In recent times, machine learning approaches have been applied to estimate stroke risk based on patient data. This study uses three machine learning algorithms and an artificial neural network (ANN) model to predict stroke incidence based on a dataset containing 17 variables. The ANN model was optimized using the RandomSearch hyperparameter tuning technique and trained and tested on both the original unbalanced dataset and six resampled datasets generated using different techniques to address the class imbalance problem. The results indicate that the ANN model performed well on both the original dataset and the resampled datasets. The model achieved a higher accuracy of 99.3% on the dataset resampled using the SMOTE+RandomUnderSampling technique. The study suggests that the resampling techniques employed were effective in improving the performance of the ANN model, especially in dealing with class imbalance challenges in the dataset. The outcomes of this study recommend that the ANN model has the potential to be used as a predictive tool for stroke incidence. More study is needed, however, to confirm the effectiveness of the model on bigger, more diversified datasets and to evaluate its generalizability to other populations. |
---|---|
AbstractList | Cerebrovascular accident (CVA), commonly known as a stroke, is a major cause of morbidity and mortality worldwide. Recent techniques in stroke prediction include the application of machine learning and deep learning algorithms, the integration of multimodal data, and the use of advanced feature selection methods. Challenges in stroke prediction systems include class imbalance, limited and heterogeneous data availability, interpretability of black-box models, and generalizability across diverse populations. The proposed objective of this study is to address these challenges and enhance the accuracy and reliability of stroke prediction models. Quick detection and management of stroke risk factors can reduce the incidence and severity of stroke. In recent times, machine learning approaches have been applied to estimate stroke risk based on patient data. This study uses three machine learning algorithms and an artificial neural network (ANN) model to predict stroke incidence based on a dataset containing 17 variables. The ANN model was optimized using the RandomSearch hyperparameter tuning technique and trained and tested on both the original unbalanced dataset and six resampled datasets generated using different techniques to address the class imbalance problem. The results indicate that the ANN model performed well on both the original dataset and the resampled datasets. The model achieved a higher accuracy of 99.3% on the dataset resampled using the SMOTE+RandomUnderSampling technique. The study suggests that the resampling techniques employed were effective in improving the performance of the ANN model, especially in dealing with class imbalance challenges in the dataset. The outcomes of this study recommend that the ANN model has the potential to be used as a predictive tool for stroke incidence. More study is needed, however, to confirm the effectiveness of the model on bigger, more diversified datasets and to evaluate its generalizability to other populations. |
Author | Ebenezer, V Edwin, E Bijolin Gandu, Shalem Preetham Ram, Kommu Sri Thanka, M Roshni Joy, Priscilla |
Author_xml | – sequence: 1 givenname: M Roshni surname: Thanka fullname: Thanka, M Roshni email: roshni@karunya.edu organization: Karunya Institute of Technology and Sciences,Computer Science and Engineering,Coimbatore,India,641114 – sequence: 2 givenname: Kommu Sri surname: Ram fullname: Ram, Kommu Sri email: kommusriram@karunya.edu.in organization: Karunya Institute of Technology and Sciences,Computer Science and Engineering,Coimbatore,India,641114 – sequence: 3 givenname: Shalem Preetham surname: Gandu fullname: Gandu, Shalem Preetham email: gandushalem@karunya.edu.in organization: Karunya Institute of Technology and Sciences,Computer Science and Engineering,Coimbatore,India,641114 – sequence: 4 givenname: E Bijolin surname: Edwin fullname: Edwin, E Bijolin email: bijolin@karunya.edu organization: Karunya Institute of Technology and Sciences,Computer Science and Engineering,Coimbatore,India,641114 – sequence: 5 givenname: V surname: Ebenezer fullname: Ebenezer, V email: ebenezerv@karunya.edu organization: Karunya Institute of Technology and Sciences,Computer Science and Engineering,Coimbatore,India,641114 – sequence: 6 givenname: Priscilla surname: Joy fullname: Joy, Priscilla email: priscillajoy@karunya.edu organization: Karunya Institute of Technology and Sciences,Computer Science and Engineering,Coimbatore,India,641114 |
BookMark | eNo1j8tOhDAYRmuiCx3nDVzUBwD_tlPaLg1eZhKMRthPevmRxqEgYIxvr0ZdfWdzTvKdkeM0JCTkkkHOGJirXVmXdS1VISHnwEXOgBWGC3VE1kYZLSQIIQrgp6Quh360U0wv9Bln24-HH2zQdym-veNMY6L1Mg2vSJ8mDNEvcUj0Iy4dfbC-iwmpTYHeII60Qjulb_2cnLT2MOP6b1ekubttym1WPd7vyusqi4yZJfNK-0Kjb0G74DT44LVUzjHOPXcb4EZxMDq0PugNB-XQSRmU1EyCNIVYkYvfbETE_TjF3k6f-_-r4gtfgU8L |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/ICSCSS57650.2023.10169237 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library Online IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library Online url: http://ieeexplore.ieee.org/Xplore/DynWel.jsp sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 9798350333602 |
EndPage | 1420 |
ExternalDocumentID | 10169237 |
Genre | orig-research |
GroupedDBID | 6IE 6IL CBEJK RIE RIL |
ID | FETCH-LOGICAL-i119t-c78c68ecf08bdb80cdc857bb122c2b402972098dfcd84207beb55d7581505963 |
IEDL.DBID | RIE |
IngestDate | Thu Jan 18 11:13:13 EST 2024 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i119t-c78c68ecf08bdb80cdc857bb122c2b402972098dfcd84207beb55d7581505963 |
PageCount | 6 |
ParticipantIDs | ieee_primary_10169237 |
PublicationCentury | 2000 |
PublicationDate | 2023-June-14 |
PublicationDateYYYYMMDD | 2023-06-14 |
PublicationDate_xml | – month: 06 year: 2023 text: 2023-June-14 day: 14 |
PublicationDecade | 2020 |
PublicationTitle | 2023 International Conference on Sustainable Computing and Smart Systems (ICSCSS) |
PublicationTitleAbbrev | ICSCSS |
PublicationYear | 2023 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
Score | 1.9159514 |
Snippet | Cerebrovascular accident (CVA), commonly known as a stroke, is a major cause of morbidity and mortality worldwide. Recent techniques in stroke prediction... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 1415 |
SubjectTerms | artificial neural network Artificial neural networks Cerebrovascular accident Deep learning hyperparameter tuning machine learning Machine learning algorithms Prediction algorithms Predictive models RandomUnderSampling resampling techniques Sociology stroke Stroke (medical condition) Tomek links |
Title | Comparing Resampling Techniques in Stroke Prediction with Machine and Deep Learning |
URI | https://ieeexplore.ieee.org/document/10169237 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEA62B_GkYsU3Ebymze4mm-x521IPirg9eCvZyWwpwrb08f-b2T7EgwdvISQEJgzz_j7GXqJEVz6TUoCKnFBVrIS1SSqUd5C6RMnE0bzzqDDvX7Y_IJgccZyFQcSm-Qy7tGxq-X4OG0qV9SjSDA6JabGWyexuWOuUPe9xM3uveZEXRXCgtewSK3j3cP4Xc0pjOIbn_3zygnV-RvD4x9G4XLITrK9Yke9YA-sp_8SVo2bwsBwfYFhXfFbzYr2cf9NdqsCQ1DmlWvlb0zSJ3NWe9xEXfA-sOu2w8XAwzkdiz4ogZlGUrQUYC6lFqKQtfWkleLDalGUUxxCXisioYplZX4G3KpamxFJrH8KC4PrpoG7XrF3Pa7xhHEBBikEJdeQVaONSxBgqXxlXaZOYW9YhgUwWO9yLyUEWd3_s37MzEjs1UkXqgbXXyw0-stbKb56ar9oCEnKWbw |
link.rule.ids | 310,311,782,786,791,792,798,27934,54767 |
linkProvider | IEEE |
linkToHtml | http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEB5sBfWkYsW3Ebymze4mm_S8bWmxLeLuwVvJJrOlCNvSx_832T7EgwdvQ2AITBhmknzzfQCvQSQK22aMGh5oyouQU6WimHKrTawjziLt5537qRx_qk7X0-TQwywMIlbgM2x6s_rLt3Oz8U9lLX_TdA2JrMGx4DKW23GtE3jZMWe2BkmapKlroQVrel3w5t7jl3ZKVTp65__c9AIaP0N45P1QXi7hCMsrSJOtbmA5JR-40h4O7sxsT8S6IrOSpOvl_Mv7-j8YH3fiH1vJqIJNItGlJR3EBdlRq04bkPW6WdKnO10EOguC9poaqUys0BRM5TZXzFijhMzzIAxNmHMvRxWytrKFsYqHTOaYC2HdxcA1f8Il3DXUy3mJN0CM4SZGl4YisNwIqWPE0BS2kLoQMpK30PABmSy2zBeTfSzu_lh_htN-NhpOhoPx2z2c-SPwsKqAP0B9vdzgI9RWdvNUHds3LUGZwA |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2023+International+Conference+on+Sustainable+Computing+and+Smart+Systems+%28ICSCSS%29&rft.atitle=Comparing+Resampling+Techniques+in+Stroke+Prediction+with+Machine+and+Deep+Learning&rft.au=Thanka%2C+M+Roshni&rft.au=Ram%2C+Kommu+Sri&rft.au=Gandu%2C+Shalem+Preetham&rft.au=Edwin%2C+E+Bijolin&rft.date=2023-06-14&rft.pub=IEEE&rft.spage=1415&rft.epage=1420&rft_id=info:doi/10.1109%2FICSCSS57650.2023.10169237&rft.externalDocID=10169237 |