Refining Action Segmentation with Hierarchical Video Representations

Publication typeProceedings Article
Publication date2021-10-01
Abstract
In this paper, we propose Hierarchical Action Segmentation Refiner (HASR), which can refine temporal action segmentation results from various models by understanding the overall context of a given video in a hierarchical way. When a backbone model for action segmentation estimates how the given video can be segmented, our model extracts segment-level representations based on frame-level features, and extracts a video-level representation based on the segment-level representations. Based on these hierarchical representations, our model can refer to the overall context of the entire video, and predict how the segment labels that are out of context should be corrected. Our HASR can be plugged into various action segmentation models (MS-TCN, SSTDA, ASRF), and improve the performance of state-of-the-art models based on three challenging datasets (GTEA, 50Salads, and Breakfast). For example, in 50Salads dataset, the segmental edit score improves from 67.9% to 77.4% (MS-TCN), from 75.8% to 77.3% (SSTDA), from 79.3% to 81.0% (ASRF). In addition, our model can refine the segmentation result from the unseen backbone model, which was not referred to when training HASR. This generalization performance would make HASR be an effective tool for boosting up the existing approaches for temporal action segmentation. Our code is available at https://github.com/cotton-ahn/HASR_iccv2021.
Found 
Found 

Top-30

Journals

1
2
3
4
Lecture Notes in Computer Science
4 publications, 8%
Pattern Recognition
3 publications, 6%
Mathematics
2 publications, 4%
IEEE Transactions on Pattern Analysis and Machine Intelligence
2 publications, 4%
Energies
1 publication, 2%
Multimedia Systems
1 publication, 2%
Neural Processing Letters
1 publication, 2%
IEEE Transactions on Industrial Informatics
1 publication, 2%
Multimedia Tools and Applications
1 publication, 2%
Machine Vision and Applications
1 publication, 2%
Applied Intelligence
1 publication, 2%
IEEE Transactions on Circuits and Systems for Video Technology
1 publication, 2%
Applied Sciences (Switzerland)
1 publication, 2%
International journal of computer assisted radiology and surgery
1 publication, 2%
Signal, Image and Video Processing
1 publication, 2%
Neural Computing and Applications
1 publication, 2%
IISE Transactions
1 publication, 2%
IEEE Open Journal of Signal Processing
1 publication, 2%
Image and Vision Computing
1 publication, 2%
IEEE Transactions on Neural Networks and Learning Systems
1 publication, 2%
Computers and Graphics
1 publication, 2%
1
2
3
4

Publishers

5
10
15
20
25
30
Institute of Electrical and Electronics Engineers (IEEE)
26 publications, 52%
Springer Nature
12 publications, 24%
Elsevier
5 publications, 10%
MDPI
4 publications, 8%
Taylor & Francis
1 publication, 2%
Association for Computing Machinery (ACM)
1 publication, 2%
5
10
15
20
25
30
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
50
Share