Multi-task Perception for Autonomous Driving

Xiaodan Liang 1
Xiwen Liang 1
Hang Xu 2
Publication typeBook Chapter
Publication date2023-10-06
SJR
CiteScore2.7
Impact factor
ISSN21916586, 21916594
Abstract
Multi-task learning has become a popular paradigm to tackle multiple tasks simultaneously with less inference time and computation resources. Recently, many self-supervised pre-training methods have been proposed and they have achieved impressive performance on a range of computer vision tasks. However, their generalization ability to multi-task scenarios is yet to be explored. Besides, most multi-task algorithms are designed for specific tasks usually not within the scope of autonomous driving, which makes it difficult to compare state-of-the-art multi-task learning methods in autonomous driving. In this chapter, we divide the multi-task perception into 2D perception and 3D perception in autonomous driving. For 2D perception, we extensively investigate the transfer ability of various self-supervised methods and reproduce multiple popular multi-task methods. Then we introduce a simple and effective pretrain-adapt-finetune paradigm for multi-task learning and a novel adapter named LV-Adapter which reuses powerful knowledge from the Contrastive Language-Image Pre-training (CLIP) model pre-trained on image-text pairs. We further present an effective multi-task framework for autonomous driving, GT-Prompt, which learns general prompts and generates task-specific prompts to guide the model to capture task-invariant and task-specific information. For 3D perception, we investigate both multi-modality fusion and multi-task learning, and introduce an effective multi-level gradient calibration learning framework across tasks and modalities during optimization.
Found 
Found 

Top-30

Journals

1
Advanced Engineering Informatics
1 publication, 33.33%
Lecture Notes in Computer Science
1 publication, 33.33%
IEEE Transactions on Pattern Analysis and Machine Intelligence
1 publication, 33.33%
1

Publishers

1
Elsevier
1 publication, 33.33%
Springer Nature
1 publication, 33.33%
Institute of Electrical and Electronics Engineers (IEEE)
1 publication, 33.33%
1
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
3
Share
Cite this
GOST |
Cite this
GOST Copy
Liang X., Liang X., Xu H. Multi-task Perception for Autonomous Driving // Advances in Computer Vision and Pattern Recognition. 2023. pp. 281-321.
GOST all authors (up to 50) Copy
Liang X., Liang X., Xu H. Multi-task Perception for Autonomous Driving // Advances in Computer Vision and Pattern Recognition. 2023. pp. 281-321.
RIS |
Cite this
RIS Copy
TY - GENERIC
DO - 10.1007/978-981-99-4287-9_9
UR - https://doi.org/10.1007/978-981-99-4287-9_9
TI - Multi-task Perception for Autonomous Driving
T2 - Advances in Computer Vision and Pattern Recognition
AU - Liang, Xiaodan
AU - Liang, Xiwen
AU - Xu, Hang
PY - 2023
DA - 2023/10/06
PB - Springer Nature
SP - 281-321
SN - 2191-6586
SN - 2191-6594
ER -
BibTex
Cite this
BibTex (up to 50 authors) Copy
@incollection{2023_Liang,
author = {Xiaodan Liang and Xiwen Liang and Hang Xu},
title = {Multi-task Perception for Autonomous Driving},
publisher = {Springer Nature},
year = {2023},
pages = {281--321},
month = {oct}
}