Efficient semantic segmentation through dense upscaling convolutions
Conference Publication ResearchOnline@JCUAbstract
Semantic segmentation is the classification of each pixel in an image to an object, the resultant pixel map has significant usage in many fields. Some fields where this technology is being actively researched is in medicine, agriculture and robotics. For uses where the resources or power requirements are restricted such as robotics or where large amounts of images are required to process, efficiency can be key to the feasibility of a technique. Other applications that require real-time processing have a need for fast and efficient methods, especially where collision avoidance or safety may be involved. We take a combination of existing semantic segmentation methods and improve upon the efficiency by the replacement of the decoder network in ERFNet with a method based upon Dense Upscaling Convolutions, we then add a novel layer that allows the fine tuning of the decoder channel depth and therefore the efficiency of the network. Our proposed modification achieves 20-30% improvement in efficiency on moderate hardware (Nvidia GTX 960) over the original ERFNET and an additional 10% efficiency over the original Dense Upscaling Convolution. We perform a series of experiments to determine viable hyperparameters for the modification and measure the efficiency and accuracy over a range of image sizes, proving the viability of our approach.
Journal
N/A
Publication Name
ICSIM'20: 3rd International Conference on Software Engineering and Information Management
Volume
N/A
ISBN/ISSN
978-1-4503-7690-7
Edition
N/A
Issue
N/A
Pages Count
5
Location
Sydney, NSW, Australia
Publisher
Association for Computing Machinery
Publisher Url
N/A
Publisher Location
New York, NY, USA
Publish Date
N/A
Url
N/A
Date
N/A
EISSN
N/A
DOI
10.1145/3378936.3378941