The Rare Word Issue in Natural Language Generation: A Character-Based Solution

Giovanni Bonetta; Marco Roberti; Rossella Cancelliere; Patrick Gallinari

doi:10.3390/informatics8010020

The Rare Word Issue in Natural Language Generation: A Character-Based Solution

Informatics ◽

10.3390/informatics8010020 ◽

2021 ◽

Vol 8 (1) ◽

pp. 20

Author(s):

Giovanni Bonetta ◽

Marco Roberti ◽

Rossella Cancelliere ◽

Patrick Gallinari

Keyword(s):

Natural Language ◽

Error Probability ◽

Essential Feature ◽

Proper Names ◽

Neural Model ◽

Natural Language Generation ◽

Training Phase ◽

Tabular Data ◽

Rare Word ◽

Language Generation

In this paper, we analyze the problem of generating fluent English utterances from tabular data, focusing on the development of a sequence-to-sequence neural model which shows two major features: the ability to read and generate character-wise, and the ability to switch between generating and copying characters from the input: an essential feature when inputs contain rare words like proper names, telephone numbers, or foreign words. Working with characters instead of words is a challenge that can bring problems such as increasing the difficulty of the training phase and a bigger error probability during inference. Nevertheless, our work shows that these issues can be solved and efforts are repaid by the creation of a fully end-to-end system, whose inputs and outputs are not constrained to be part of a predefined vocabulary, like in word-based models. Furthermore, our copying technique is integrated with an innovative shift mechanism, which enhances the ability to produce outputs directly from inputs. We assess performance on the E2E dataset, the benchmark used for the E2E NLG challenge, and on a modified version of it, created to highlight the rare word copying capabilities of our model. The results demonstrate clear improvements over the baseline and promising performance compared to recent techniques in the literature.

Download Full-text

Computing Accurate Grammatical Feedback in a Virtual Writing Conference for German-Speaking Elementary-School Children: An Approach Based on Natural Language Generation

CALICO Journal ◽

10.1558/cj.v26i3.626-643 ◽

2013 ◽

Vol 26 (3) ◽

pp. 626-643 ◽

Cited By ~ 1

Author(s):

Karin Harbusch ◽

Gergana Itsova ◽

Ulrich Koch ◽

Christine Kühner

Keyword(s):

Elementary School ◽

Natural Language ◽

School Children ◽

Elementary School Children ◽

Natural Language Generation ◽

Language Generation ◽

Writing Conference ◽

German Speaking

Download Full-text

Why Business Intelligence Needs Artificial Intelligence (AI) and Advanced Natural Language Generation (NLG)

Journal of Environmental Science Computer Science and Engineering & Technology ◽

10.24214/jecet.b.6.4.266274 ◽

2017 ◽

Vol 6 (4) ◽

Keyword(s):

Artificial Intelligence ◽

Natural Language ◽

Business Intelligence ◽

Natural Language Generation ◽

Language Generation

Download Full-text

Proceedings of the 8th European workshop on Natural Language Generation - EWNLG '01

10.3115/1117840 ◽

2001 ◽

Keyword(s):

Natural Language ◽

Natural Language Generation ◽

Language Generation ◽

European Workshop

Download Full-text

Proceedings of the Fifth International Natural Language Generation Conference on - INLG '08

10.3115/1708322 ◽

2008 ◽

Keyword(s):

Natural Language ◽

Natural Language Generation ◽

Language Generation

Download Full-text

The errors analysis of natural language generation — A case study of Topic-to-Essay generation

2020 16th International Conference on Computational Intelligence and Security (CIS) ◽

10.1109/cis52066.2020.00027 ◽

2020 ◽

Author(s):

Ping Cai ◽

Xingyuan Chen ◽

Hongjun Wang ◽

Peng Jin

Keyword(s):

Natural Language ◽

Natural Language Generation ◽

Language Generation ◽

Errors Analysis

Download Full-text

Towards more effective online environmental information provision through tailored Natural Language Generation: Profiles of Scottish river user groups and an evaluative online experiment

The Science of The Total Environment ◽

10.1016/j.scitotenv.2019.03.440 ◽

2019 ◽

Vol 673 ◽

pp. 643-655 ◽

Cited By ~ 1

Author(s):

Koen Arts ◽

Christopher J.A. Macleod ◽

Antonio A.R. Ioris ◽

Xiwu Han ◽

Somayajulu Sripada ◽

...

Keyword(s):

Natural Language ◽

Information Provision ◽

Natural Language Generation ◽

Environmental Information ◽

Online Experiment ◽

Language Generation ◽

User Groups

Download Full-text

ProBot – A Procedure Chatbot for Digital Procedural Adherence

Proceedings of the Human Factors and Ergonomics Society Annual Meeting ◽

10.1177/1071181320641054 ◽

2020 ◽

Vol 64 (1) ◽

pp. 224-228

Author(s):

Nilesh Ade ◽

Noor Quddus ◽

Trent Parker ◽

S.Camille Peres

Keyword(s):

Artificial Intelligence ◽

Deep Learning ◽

Natural Language ◽

Industry 4.0 ◽

Natural Language Generation ◽

Conversational Agent ◽

Safe Distance ◽

Process Industries ◽

Language Generation ◽

Plastic Pellets

One of the major implications of Industry 4.0 will be the application of digital procedures in process industries. Digital procedures are procedures that are accessed through a smart gadget such as a tablet or a phone. However, like paper-based procedures their usability is limited by their access. The issue of accessibility is magnified in tasks such as loading a hopper car with plastic pellets wherein the operators typically place the procedure at a safe distance from the worksite. This drawback can be tackled in the case of digital procedures using artificial intelligence-based voice enabled conversational agent (chatbot). As a part of this study, we have developed a chatbot for assisting digital procedure adherence. The chatbot is trained using the possible set of queries from the operator and text from the digital procedures through deep learning and provides responses using natural language generation. The testing of the chatbot is performed using a simulated conversation with an operator performing the task of loading a hopper car.

Download Full-text