A Novel Approach in Clustering Algorithm to Evaluate the Performance of Regression Analysis

This paper, introduced a new methodology to raise the metric of a journal's impact. This method is depending on finding clusters from SC Imago database and creates datasets utilizing a modified k-means clustering algorithm. Farther, developing of linear regression analysis on these datasets is...

Full description

Saved in:
Bibliographic Details
Published in:2018 IEEE 8th International Advance Computing Conference (IACC) pp. 47 - 52
Main Authors: S., Govinda Rao, P., Varaprasada Rao, R., Rambabu, Reddy P., Chandra Sekahar
Format: Conference Proceeding
Language:English
Published: IEEE 01-12-2018
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper, introduced a new methodology to raise the metric of a journal's impact. This method is depending on finding clusters from SC Imago database and creates datasets utilizing a modified k-means clustering algorithm. Farther, developing of linear regression analysis on these datasets is perplexed by seeing index values are dependent variables and citation parameters as independent variables result in assessing contributing factors to increase bibliometric index of any journal. next step, cluster quality metrics enforced to evaluate the perfectness of fit of the cluster such as homogeneity score, completeness score, V measure, accommodated rand score and silhouette coefficient. The output of modified k-means algorithm on a dataset of 1445 journals resulted in 3 clusters (k=3). Each cluster data clustered depending on the title. The regression analysis states that the publisher who desires to enhance his journal bibliometric indexes should deliberate the advice conferred, in this work, bring large number of paper submissions to their journal especially. Almost four indices which are of main importance in the publisher industry having been used this. The analysis ensure in strong advantage as the testing of output produced including regression parameters clarified with the identification of outliers by the inclusion of relative error calculation. Accordingly, seeing the suggestive features with increase or decrease in TD3, TC3, CD3, CD2 and RD values, the publisher would profit from raising their respective bibliometric index.
ISSN:2473-3571
DOI:10.1109/IADCC.2018.8692130