Why is the Winner the Best?

International benchmarking competitions have become fundamental for the comparative performance assessment of image analysis methods. However, little attention has been given to investigating what can be learnt from these competitions. Do they really generate scientific progress? What are common and...

Full description

Saved in:
Bibliographic Details
Published in:2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) pp. 19955 - 19966
Main Authors: Reinke, A., Tizabi, M. D., Isensee, F., Adler, T. J., Ali, S., Aubreville, M., Baid, U., Bakas, S., Bano, S., Bernal, J., Bodenstedt, S., Casella, A., Cheplygina, V., Daum, M., De Bruijne, M., Depeursinge, A., Dorent, R., Egger, J., Ellis, D. G., Engelhardt, S., Ghatwary, N., Girard, G., Godau, P., Hansen, L., Heinrich, M., Heller, N., Hering, A., Huaulme, A., Kavur, A. E., Kozubek, M., Li, J., Li, H., Ma, J., Martin-Isla, C., Menze, B., Noble, A., Oreiller, V., Padoy, N., Pati, S., Payette, K., Radsch, T., Rafael-Patino, J., Bawa, V. Singh, Speidel, S., Sudre, C. H., Van Wijnen, K., Wagner, M., Wei, D., Yamlahi, A., Yap, M. H., Yuan, C., Zenk, M., Zimmerer, D., Brtingel, R., Dou, Q., Ezhov, I., Friedrich, C. M., Fuller, C., Galdran, A., Faura, A. Garcia, Grammatikopoulou, M., Hong, S., Jahanifar, M., Jang, I., Kadkhodamohammadi, A., Kang, I., Kondo, S., Kuijf, H., Li, M., Luu, M., Martincic, T., Morais, P., Naser, M. A., Oliveira, B., Owen, D., Pang, S., Park, S., Plotka, S., Puybareau, E., Rajpoot, N., Ryu, K., Shephard, A., Shi, P., Stepec, D., Tochon, G., Urien, H., Vilaca, J. L., Wahid, K. A., Wang, J., Wang, L., Wang, X., Wodzinski, M., Xia, F., Xiong, Z., Yang, S., Yang, Y., Zhao, Z., Jager, P. F., Kopp-Schneider, A., Maier-Hein, L.
Format: Conference Proceeding
Language:English
Published: IEEE 01-06-2023
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:International benchmarking competitions have become fundamental for the comparative performance assessment of image analysis methods. However, little attention has been given to investigating what can be learnt from these competitions. Do they really generate scientific progress? What are common and successful participation strategies? What makes a solution superior to a competing method? To address this gap in the literature, we performed a multicenter study with all 80 competitions that were conducted in the scope of IEEE ISBI 2021 and MICCAI 2021. Statistical analyses performed based on comprehensive descriptions of the submitted algorithms linked to their rank as well as the underlying participation strategies revealed common characteristics of winning solutions. These typically include the use of multi-task learning (63%) and/or multi-stage pipelines (61%), and a focus on augmentation (100%), image preprocessing (97%), data curation (79%), and post-processing (66%). The "typical" lead of a winning team is a computer scientist with a doctoral degree, five years of experience in biomedical image analysis, and four years of experience in deep learning. Two core general development strategies stood out for highly-ranked teams: the reflection of the metrics in the method design and the focus on analyzing and handling failure cases. According to the organizers, 43% of the winning algorithms exceeded the state of the art but only 11% completely solved the respective domain problem. The insights of our study could help researchers (1) improve algorithm development strategies when approaching new problems, and (2) focus on open research questions revealed by this work.
ISSN:2575-7075
DOI:10.1109/CVPR52729.2023.01911