SLA based healthcare big data analysis and computing in cloud network

Large volume of multi-structured and low-latency patient data are generated in healthcare services, which is achallenging task to process and analyze within the Service Level Agreement (SLA). In this paper, a Parallel Semi-Naive Bayes (PSNB) based probabilistic method is used to process the healthca...

Full description

Saved in:
Bibliographic Details
Published in:Journal of parallel and distributed computing Vol. 119; pp. 121 - 135
Main Authors: Sahoo, Prasan Kumar, Mohapatra, Suvendu Kumar, Wu, Shih-Lin
Format: Journal Article
Language:English
Published: Elsevier Inc 01-09-2018
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Large volume of multi-structured and low-latency patient data are generated in healthcare services, which is achallenging task to process and analyze within the Service Level Agreement (SLA). In this paper, a Parallel Semi-Naive Bayes (PSNB) based probabilistic method is used to process the healthcare big data in cloud for future health condition prediction. In order to improve the accuracy of PSNB method, a Modified Conjunctive Attribute (MCA) algorithm is proposed for reducing the dimension. Emergency condition of the patient is considered by setting a global priority among the patients and an Optimal Data Distribution (ODD) algorithm is proposed to position both batch and streaming patient data into the Spark nodes. Further, a Dynamic Job Scheduling (DJS) algorithm is designed to schedule the jobs efficiently to the most suitable nodes for processing the data taking SLA into account. Our proposed PSNB algorithm provides better accuracy of 87.8% for both batch and streaming data, which is 12.8% higher than the original Naive–Bayes (NB) algorithm and can conveniently be employed in various patient monitoring applications. •SLA based healthcare big data analytic architecture is designed for Spark platform.•Ranking of patient’s data is made to improve the processing speed.•Efficient data distribution mechanism is designed to allocate batch and streaming data.•Priority based job allocation algorithm is designed.•Analysis and prediction of future health condition of the patients is designed.
ISSN:0743-7315
1096-0848
DOI:10.1016/j.jpdc.2018.04.006