Open Access
,
pages 226-239
Towards Visual Storytelling by Understanding Narrative Context Through Scene-Graphs
Itthisak Phueaksri
1, 2
,
Marc A Kastner
1, 3
,
Yasutomo Kawanishi
1, 2
,
Takahiro Komamizu
1
,
Ichiro Ide
1
Publication type: Book Chapter
Publication date: 2025-01-01
scimago Q2
SJR: 0.352
CiteScore: 2.4
Impact factor: —
ISSN: 03029743, 16113349, 18612075, 18612083
Abstract
VIsual STorytelling (VIST) is a task that transforms a sequence of images into narrative text stories. A narrative story requires an understanding of the contexts and relationships among images. Our study introduces a story generation process that emphasizes creating a coherent narrative by constructing both image and narrative contexts to control the coherence. First, the image contexts are generated from the content of individual images, using image features and scene graphs that detail the elements of the images. Second, the narrative context is generated by focusing on the overall image sequence. Ensuring that each caption fits within the overall story maintaining continuity and coherence. We also introduce a narrative concept summary, which is external knowledge represented as a knowledge graph. This summary encapsulates the narrative concept of an image sequence to enhance the understanding of its overall content. Following this, both image and narrative contexts are used to generate a coherent and engaging narrative. This framework is based on Long Short-Term Memory (LSTM) with an attention mechanism. We evaluate the proposed method using the VIST dataset, and the results highlight the importance of understanding the context of an image sequence in generating coherent and engaging stories. The study demonstrates the significance of incorporating narrative context into the generation process to ensure the coherence of the generated narrative.
Found
Nothing found, try to update filter.
Are you a researcher?
Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
0
Total citations:
0
Cite this
GOST |
RIS |
BibTex
Cite this
GOST
Copy
Phueaksri I. et al. Towards Visual Storytelling by Understanding Narrative Context Through Scene-Graphs // Lecture Notes in Computer Science. 2025. pp. 226-239.
GOST all authors (up to 50)
Copy
Phueaksri I., Kastner M. A., Kawanishi Y., Komamizu T., Ide I. Towards Visual Storytelling by Understanding Narrative Context Through Scene-Graphs // Lecture Notes in Computer Science. 2025. pp. 226-239.
Cite this
RIS
Copy
TY - GENERIC
DO - 10.1007/978-981-96-2071-5_17
UR - https://link.springer.com/10.1007/978-981-96-2071-5_17
TI - Towards Visual Storytelling by Understanding Narrative Context Through Scene-Graphs
T2 - Lecture Notes in Computer Science
AU - Phueaksri, Itthisak
AU - Kastner, Marc A
AU - Kawanishi, Yasutomo
AU - Komamizu, Takahiro
AU - Ide, Ichiro
PY - 2025
DA - 2025/01/01
PB - Springer Nature
SP - 226-239
SN - 0302-9743
SN - 1611-3349
SN - 1861-2075
SN - 1861-2083
ER -
Cite this
BibTex (up to 50 authors)
Copy
@incollection{2025_Phueaksri,
author = {Itthisak Phueaksri and Marc A Kastner and Yasutomo Kawanishi and Takahiro Komamizu and Ichiro Ide},
title = {Towards Visual Storytelling by Understanding Narrative Context Through Scene-Graphs},
publisher = {Springer Nature},
year = {2025},
pages = {226--239},
month = {jan}
}
Profiles