A Weakly Supervised Learning Framework Utilizing Enhanced Class Activation Map for Object Detection in Construction Sites

Computer vision has emerged as a promising tool for improving safety at construction sites through automatic scene recognition. However, traditional approaches require significant labor-intensive and time-consuming efforts for annotations. Although weakly supervised learning has the advantage of loc...

Full description

Saved in:
Bibliographic Details
Published in:IEEE access Vol. 12; pp. 99989 - 100004
Main Authors: Yang, Jaehun, Lee, Eunju, Kwon, Junehyoung, Lee, Dongmin, Kim, Youngbin, Park, Chansik, Lee, Doyeop
Format: Journal Article
Language:English
Published: Piscataway IEEE 2024
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Computer vision has emerged as a promising tool for improving safety at construction sites through automatic scene recognition. However, traditional approaches require significant labor-intensive and time-consuming efforts for annotations. Although weakly supervised learning has the advantage of localizing objects without location information, the conventional class activation map (CAM) technique struggles with small and linear object localization and background noise at construction sites. For effective scene recognition related to construction safety, the spatial relationships between precisely localized objects are crucial. Due to the limitations of traditional CAM techniques in localizing small and linear objects, using CAM for construction safety monitoring remains inaccurate. Therefore, this study proposes a weakly supervised learning approach with an improved CAM for enhancing object detection in construction sites. The improved CAM localizes objects of various scales and is robust against background interference. Spatial relationships between localized objects are employed to determine the status of scenes for construction safety monitoring. Experiment using datasets associated with falls from ladders (FFL) demonstrates that the improved CAM surpasses the traditional CAM in mIoU (mean intersection over union) for the object localization performance as well as in the accuracy and F1-Score for recognizing unsafe scenes. This demonstrates the robust potential of employing CAM as a weakly supervised learning strategy, underlining its substantial feasibility for preventing hazards in construction sites. The proposed framework can minimize annotation efforts, demonstrating the potential of CAM as a viable computer vision technique for efficiently detecting hazards at construction sites.
ISSN:2169-3536
2169-3536
DOI:10.1109/ACCESS.2024.3423697