Synthesising large, low cost and diverse datasets for robust semantic segmentation in self-driving tasks

Robust scene understanding algorithms are essential for the success of autonomous navigation. Unfortunately the supervised learning of semantic segmentation requires large and diverse datasets. For certain self-driving tasks like navigating a robot inside an industrial facility no datasets are freely available and the manual annotation of large datasets is impracticable for smaller development teams. Although approaches exist to automatically generate synthetic data, they are either too computational expensive, demand a huge preparation effort or miss a large variety of features. This paper presents a novel framework to generate synthetic datasets with great variance for low computing demand that are easily adaptable to different self-driving tasks (Available at https://github.com/cITIcar/SAD-Generator). As demonstration this approach was applied to a semantic segmentation task on a miniature road with random obstacles, lane markings and disturbing artefacts. Training thus synthesized data in a U-Net and and later fine-tuning it with a small amount of manually annotated data, improved pixel accuracy (PA) by 2.5 percentage points and mean intersection over union (mIoU) by 11.19 percentage points.

Description

The captions of figures 7 and 9 do not match the figures and have been revised in the 2nd edition.

2nd revised edition:
https://doi.org/10.25819/ubsi/10510

This article presents a framework to artificially generate computer vision datasets with great variance for low computing demand that is easily adaptable to different semantic segmentation tasks.

The source code for this article is available on Github (https://github.com/cITIcar/SAD-Generator).

DOI

10.25819/ubsi/10111

URN

nbn:de:hbz:467-22008

URI

https://dspace.ub.uni-siegen.de/handle/ubsi/2200

License

http://creativecommons.org/licenses/by/4.0/

File(s)