volume 129 pages 108764

Maximization and restoration: Action segmentation through dilation passing and temporal reconstruction

Publication typeJournal Article
Publication date2022-09-01
scimago Q1
wos Q1
SJR2.058
CiteScore15.8
Impact factor7.6
ISSN00313203, 18735142
Artificial Intelligence
Software
Signal Processing
Computer Vision and Pattern Recognition
Abstract
• We propose a divide-and-conquer method that first maximizes frame accuracy and then reconstructs the features to reduce over-segmentation. • Dilation passing network propagates long- and short-range features enabling better understanding of the relation between frames. • Temporal reconstruction network uses a convolutional encoder-decoder to capture local context for temporal consistency among frames. • Our model achieves meaningful results over the state-of-the-art models on three challenging datasets. Action segmentation aims to split videos into segments of different actions. Recent work focuses on dealing with long-range dependencies of long, untrimmed videos, but still suffers from over-segmentation and performance saturation due to increased model complexity. This paper addresses the aforementioned issues through a divide-and-conquer strategy that first maximizes the frame-wise classification accuracy of the model and then reduces the over-segmentation errors. This strategy is implemented with the Dilation Passing and Reconstruction Network, composed of the Dilation Passing Network, which primarily aims to increase accuracy by propagating information of different dilations, and the Temporal Reconstruction Network, which reduces over-segmentation errors by temporally encoding and decoding the output features from the Dilation Passing Network. We also propose a weighted temporal mean squared error loss that further reduces over-segmentation. Through evaluations on the 50Salads, GTEA, and Breakfast datasets, we show that our model achieves significant results compared to existing state-of-the-art models.
Found 
Found 

Top-30

Journals

1
2
3
4
5
6
Pattern Recognition
6 publications, 17.65%
Lecture Notes in Computer Science
2 publications, 5.88%
IEEE Transactions on Pattern Analysis and Machine Intelligence
2 publications, 5.88%
Engineering Applications of Artificial Intelligence
2 publications, 5.88%
Multimedia Tools and Applications
1 publication, 2.94%
Applied Intelligence
1 publication, 2.94%
Mathematics
1 publication, 2.94%
International journal of computer assisted radiology and surgery
1 publication, 2.94%
ACM Transactions on Multimedia Computing, Communications and Applications
1 publication, 2.94%
Neural Computing and Applications
1 publication, 2.94%
Nature Communications
1 publication, 2.94%
Computers and Graphics
1 publication, 2.94%
IEEE Transactions on Multimedia
1 publication, 2.94%
IEEE Transactions on Image Processing
1 publication, 2.94%
1
2
3
4
5
6

Publishers

2
4
6
8
10
12
14
Institute of Electrical and Electronics Engineers (IEEE)
13 publications, 38.24%
Elsevier
9 publications, 26.47%
Springer Nature
7 publications, 20.59%
Association for Computing Machinery (ACM)
2 publications, 5.88%
MDPI
1 publication, 2.94%
2
4
6
8
10
12
14
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
34
Share
Cite this
GOST |
Cite this
GOST Copy
Park J. et al. Maximization and restoration: Action segmentation through dilation passing and temporal reconstruction // Pattern Recognition. 2022. Vol. 129. p. 108764.
GOST all authors (up to 50) Copy
Park J., Kim D., Huh S., Jo S. Maximization and restoration: Action segmentation through dilation passing and temporal reconstruction // Pattern Recognition. 2022. Vol. 129. p. 108764.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1016/j.patcog.2022.108764
UR - https://doi.org/10.1016/j.patcog.2022.108764
TI - Maximization and restoration: Action segmentation through dilation passing and temporal reconstruction
T2 - Pattern Recognition
AU - Park, Junyong
AU - Kim, Daekyum
AU - Huh, Sejoon
AU - Jo, Sungho
PY - 2022
DA - 2022/09/01
PB - Elsevier
SP - 108764
VL - 129
SN - 0031-3203
SN - 1873-5142
ER -
BibTex
Cite this
BibTex (up to 50 authors) Copy
@article{2022_Park,
author = {Junyong Park and Daekyum Kim and Sejoon Huh and Sungho Jo},
title = {Maximization and restoration: Action segmentation through dilation passing and temporal reconstruction},
journal = {Pattern Recognition},
year = {2022},
volume = {129},
publisher = {Elsevier},
month = {sep},
url = {https://doi.org/10.1016/j.patcog.2022.108764},
pages = {108764},
doi = {10.1016/j.patcog.2022.108764}
}
Profiles