Open Access

Lecture Notes in Computer Science

, pages 226-239

Towards Visual Storytelling by Understanding Narrative Context Through Scene-Graphs

Itthisak Phueaksri ^{1, 2}

Marc A Kastner ^{1, 3}

Yasutomo Kawanishi ^{1, 2}

Takahiro Komamizu ¹

Ichiro Ide ¹

Hide authors affiliations Show authors affiliations: 3 affiliations

Nagoya University, Nagoya, Japan |

RIKEN, Kyoto, Japan |

Hiroshima City University, Hiroshima, Japan |

Publication type: Book Chapter

Publication date: 2025-01-01

Springer Nature

Lecture Notes in Computer Science

scimago Q2

SJR: 0.352

CiteScore: 2.4

Impact factor: —

ISSN: 03029743, 16113349, 18612075, 18612083

DOI: 10.1007/978-981-96-2071-5_17

Copy DOI

Abstract

VIsual STorytelling (VIST) is a task that transforms a sequence of images into narrative text stories. A narrative story requires an understanding of the contexts and relationships among images. Our study introduces a story generation process that emphasizes creating a coherent narrative by constructing both image and narrative contexts to control the coherence. First, the image contexts are generated from the content of individual images, using image features and scene graphs that detail the elements of the images. Second, the narrative context is generated by focusing on the overall image sequence. Ensuring that each caption fits within the overall story maintaining continuity and coherence. We also introduce a narrative concept summary, which is external knowledge represented as a knowledge graph. This summary encapsulates the narrative concept of an image sequence to enhance the understanding of its overall content. Following this, both image and narrative contexts are used to generate a coherent and engaging narrative. This framework is based on Long Short-Term Memory (LSTM) with an attention mechanism. We evaluate the proposed method using the VIST dataset, and the results highlight the importance of understanding the context of an image sequence in generating coherent and engaging stories. The study demonstrates the significance of incorporating narrative context into the generation process to ensure the coherence of the generated narrative.

Found

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.

Metrics

Cite this

GOST |

Cite this

GOST Copy

Phueaksri I. et al. Towards Visual Storytelling by Understanding Narrative Context Through Scene-Graphs // Lecture Notes in Computer Science. 2025. pp. 226-239.

GOST all authors (up to 50) Copy

Phueaksri I., Kastner M. A., Kawanishi Y., Komamizu T., Ide I. Towards Visual Storytelling by Understanding Narrative Context Through Scene-Graphs // Lecture Notes in Computer Science. 2025. pp. 226-239.

RIS |

Cite this

RIS Copy

TY - GENERIC

DO - 10.1007/978-981-96-2071-5_17

UR - https://link.springer.com/10.1007/978-981-96-2071-5_17

TI - Towards Visual Storytelling by Understanding Narrative Context Through Scene-Graphs

T2 - Lecture Notes in Computer Science

AU - Phueaksri, Itthisak

AU - Kastner, Marc A

AU - Kawanishi, Yasutomo

AU - Komamizu, Takahiro

AU - Ide, Ichiro

PY - 2025

DA - 2025/01/01

PB - Springer Nature

SP - 226-239

SN - 0302-9743

SN - 1611-3349

SN - 1861-2075

SN - 1861-2083

ER -

BibTex

Cite this

BibTex (up to 50 authors) Copy

@incollection{2025_Phueaksri,

author = {Itthisak Phueaksri and Marc A Kastner and Yasutomo Kawanishi and Takahiro Komamizu and Ichiro Ide},

title = {Towards Visual Storytelling by Understanding Narrative Context Through Scene-Graphs},

publisher = {Springer Nature},

year = {2025},

pages = {226--239},

month = {jan}

}

Publisher

Springer Nature

Journal

Lecture Notes in Computer Science

scimago Q2

SJR

0.352

CiteScore

2.4

Impact factor

—

ISSN

03029743 (Print)

16113349 (Electronic)

18612075 (Print)

18612083 (Electronic)

Profiles

Yasutomo Kawanishi