Open Access
Open access

Towards Visual Storytelling by Understanding Narrative Context Through Scene-Graphs

Publication typeBook Chapter
Publication date2025-01-01
scimago Q2
SJR0.352
CiteScore2.4
Impact factor
ISSN03029743, 16113349, 18612075, 18612083
Abstract
VIsual STorytelling (VIST) is a task that transforms a sequence of images into narrative text stories. A narrative story requires an understanding of the contexts and relationships among images. Our study introduces a story generation process that emphasizes creating a coherent narrative by constructing both image and narrative contexts to control the coherence. First, the image contexts are generated from the content of individual images, using image features and scene graphs that detail the elements of the images. Second, the narrative context is generated by focusing on the overall image sequence. Ensuring that each caption fits within the overall story maintaining continuity and coherence. We also introduce a narrative concept summary, which is external knowledge represented as a knowledge graph. This summary encapsulates the narrative concept of an image sequence to enhance the understanding of its overall content. Following this, both image and narrative contexts are used to generate a coherent and engaging narrative. This framework is based on Long Short-Term Memory (LSTM) with an attention mechanism. We evaluate the proposed method using the VIST dataset, and the results highlight the importance of understanding the context of an image sequence in generating coherent and engaging stories. The study demonstrates the significance of incorporating narrative context into the generation process to ensure the coherence of the generated narrative.
Found 

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
0
Share
Cite this
GOST |
Cite this
GOST Copy
Phueaksri I. et al. Towards Visual Storytelling by Understanding Narrative Context Through Scene-Graphs // Lecture Notes in Computer Science. 2025. pp. 226-239.
GOST all authors (up to 50) Copy
Phueaksri I., Kastner M. A., Kawanishi Y., Komamizu T., Ide I. Towards Visual Storytelling by Understanding Narrative Context Through Scene-Graphs // Lecture Notes in Computer Science. 2025. pp. 226-239.
RIS |
Cite this
RIS Copy
TY - GENERIC
DO - 10.1007/978-981-96-2071-5_17
UR - https://link.springer.com/10.1007/978-981-96-2071-5_17
TI - Towards Visual Storytelling by Understanding Narrative Context Through Scene-Graphs
T2 - Lecture Notes in Computer Science
AU - Phueaksri, Itthisak
AU - Kastner, Marc A
AU - Kawanishi, Yasutomo
AU - Komamizu, Takahiro
AU - Ide, Ichiro
PY - 2025
DA - 2025/01/01
PB - Springer Nature
SP - 226-239
SN - 0302-9743
SN - 1611-3349
SN - 1861-2075
SN - 1861-2083
ER -
BibTex
Cite this
BibTex (up to 50 authors) Copy
@incollection{2025_Phueaksri,
author = {Itthisak Phueaksri and Marc A Kastner and Yasutomo Kawanishi and Takahiro Komamizu and Ichiro Ide},
title = {Towards Visual Storytelling by Understanding Narrative Context Through Scene-Graphs},
publisher = {Springer Nature},
year = {2025},
pages = {226--239},
month = {jan}
}