Model vs system level testing of autonomous driving systems: a replication and extension study

Offline model-level testing of autonomous driving software is much cheaper, faster, and diversified than in-field, online system-level testing. Hence, researchers have compared empirically model-level vs system-level testing using driving simulators. They reported the general usefulness of simulator...

Full description

Saved in:

Bibliographic Details
Published in:	Empirical software engineering : an international journal Vol. 28; no. 3; p. 73
Main Authors:	Stocco, Andrea, Pulfer, Brian, Tonella, Paolo
Format:	Journal Article
Language:	English
Published:	New York Springer US 01-05-2023 Springer Nature B.V
Subjects:	Compilers Computer Science Failure Failure analysis Interpreters Model testing Neural networks On-line systems Programming Languages Radio control Simulation Simulators Software Engineering/Programming and Operating Systems Validity System testing DNN testing Model testing Deep neural networks Autonomous driving
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Offline model-level testing of autonomous driving software is much cheaper, faster, and diversified than in-field, online system-level testing. Hence, researchers have compared empirically model-level vs system-level testing using driving simulators. They reported the general usefulness of simulators at reproducing the same conditions experienced in-field, but also some inadequacy of model-level testing at exposing failures that are observable only in online mode. In this work, we replicate the reference study on model vs system-level testing of autonomous vehicles while acknowledging several assumptions that we had reconsidered. These assumptions are related to several threats to validity affecting the original study that motivated additional analysis and the development of techniques to mitigate them. Moreover, we also extend the replicated study by evaluating the original findings when considering a physical, radio-controlled autonomous vehicle. Our results show that simulator-based testing of autonomous driving systems yields predictions that are close to the ones of real-world datasets when using neural-based translation to mitigate the reality gap induced by the simulation platform. On the other hand, model-level testing failures are in line with those experienced at the system level, both in simulated and physical environments, when considering the pre-failure site, similar-looking images, and accurate labels.
ISSN:	1382-3256 1573-7616
DOI:	10.1007/s10664-023-10306-x