volume 56 issue 3 pages 1-58

A Primer on Seq2Seq Models for Generative Chatbots

Vincenzo Scotti 1
Licia Sbattella 1
Roberto C. Tedesco 1
Publication typeJournal Article
Publication date2023-10-06
scimago Q1
wos Q1
SJR5.797
CiteScore51.6
Impact factor28.0
ISSN03600300, 15577341
Theoretical Computer Science
General Computer Science
Abstract

The recent spread of Deep Learning-based solutions for Artificial Intelligence and the development of Large Language Models has pushed forwards significantly the Natural Language Processing area. The approach has quickly evolved in the last ten years, deeply affecting NLP, from low-level text pre-processing tasks –such as tokenisation or POS tagging– to high-level, complex NLP applications like machine translation and chatbots. This article examines recent trends in the development of open-domain data-driven generative chatbots, focusing on the Seq2Seq architectures. Such architectures are compatible with multiple learning approaches, ranging from supervised to reinforcement and, in the last years, allowed to realise very engaging open-domain chatbots. Not only do these architectures allow to directly output the next turn in a conversation but, to some extent, they also allow to control the style or content of the response. To offer a complete view on the subject, we examine possible architecture implementations as well as training and evaluation approaches. Additionally, we provide information about the openly available corpora to train and evaluate such models and about the current and past chatbot competitions. Finally, we present some insights on possible future directions, given the current research status.

Found 
Found 

Top-30

Journals

1
Algorithms
1 publication, 6.25%
International Journal of Speech Technology
1 publication, 6.25%
ACM Computing Surveys
1 publication, 6.25%
Journal of Medical Internet Research
1 publication, 6.25%
Machines
1 publication, 6.25%
Procedia Computer Science
1 publication, 6.25%
IEEE Transactions on Software Engineering
1 publication, 6.25%
1

Publishers

1
2
3
4
5
6
7
8
9
Institute of Electrical and Electronics Engineers (IEEE)
9 publications, 56.25%
Association for Computing Machinery (ACM)
2 publications, 12.5%
MDPI
2 publications, 12.5%
Springer Nature
1 publication, 6.25%
JMIR Publications
1 publication, 6.25%
Elsevier
1 publication, 6.25%
1
2
3
4
5
6
7
8
9
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
16
Share
Cite this
GOST |
Cite this
GOST Copy
Scotti V., Sbattella L., Tedesco R. C. A Primer on Seq2Seq Models for Generative Chatbots // ACM Computing Surveys. 2023. Vol. 56. No. 3. pp. 1-58.
GOST all authors (up to 50) Copy
Scotti V., Sbattella L., Tedesco R. C. A Primer on Seq2Seq Models for Generative Chatbots // ACM Computing Surveys. 2023. Vol. 56. No. 3. pp. 1-58.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1145/3604281
UR - https://doi.org/10.1145/3604281
TI - A Primer on Seq2Seq Models for Generative Chatbots
T2 - ACM Computing Surveys
AU - Scotti, Vincenzo
AU - Sbattella, Licia
AU - Tedesco, Roberto C.
PY - 2023
DA - 2023/10/06
PB - Association for Computing Machinery (ACM)
SP - 1-58
IS - 3
VL - 56
SN - 0360-0300
SN - 1557-7341
ER -
BibTex |
Cite this
BibTex (up to 50 authors) Copy
@article{2023_Scotti,
author = {Vincenzo Scotti and Licia Sbattella and Roberto C. Tedesco},
title = {A Primer on Seq2Seq Models for Generative Chatbots},
journal = {ACM Computing Surveys},
year = {2023},
volume = {56},
publisher = {Association for Computing Machinery (ACM)},
month = {oct},
url = {https://doi.org/10.1145/3604281},
number = {3},
pages = {1--58},
doi = {10.1145/3604281}
}
MLA
Cite this
MLA Copy
Scotti, Vincenzo, et al. “A Primer on Seq2Seq Models for Generative Chatbots.” ACM Computing Surveys, vol. 56, no. 3, Oct. 2023, pp. 1-58. https://doi.org/10.1145/3604281.