Unsupervised Machine Learning Via Transfer Learning and k-Means Clustering to Classify Materials Image Data
Unsupervised machine learning offers significant opportunities for extracting knowledge from unlabeled datasets and for achieving maximum machine learning performance. This paper demonstrates how to construct, use, and evaluate a high-performance unsupervised machine learning system for classifying...
Saved in:
Published in: | Integrating materials and manufacturing innovation Vol. 10; no. 2; pp. 231 - 244 |
---|---|
Main Authors: | , |
Format: | Journal Article |
Language: | English |
Published: |
Cham
Springer International Publishing
01-06-2021
Springer Nature B.V |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Unsupervised machine learning offers significant opportunities for extracting knowledge from unlabeled datasets and for achieving maximum machine learning performance. This paper demonstrates how to construct, use, and evaluate a high-performance unsupervised machine learning system for classifying images in a popular microstructural dataset. The Northeastern University Steel Surface Defects Database includes micrographs of six different defects observed on hot-rolled steel in a format that is convenient for training and evaluating models for image classification. We use the VGG16 convolutional neural network pre-trained on the ImageNet dataset of natural images to extract feature representations for each micrograph. After applying principal component analysis to extract signal from the feature descriptors, we use
k
-means clustering to classify the images without needing labeled training data. The approach achieves 99.4% ± 0.16% accuracy, and the resulting model can be used to classify new images without retraining. This approach demonstrates an improvement in both performance and utility compared to a previous study. A sensitivity analysis is conducted to better understand the influence of each step on the classification performance. The results provide insight toward applying unsupervised machine learning techniques to problems of interest in materials science. |
---|---|
ISSN: | 2193-9764 2193-9772 |
DOI: | 10.1007/s40192-021-00205-8 |