CATS: Customizable Abstractive Topic-based Summarization

Seyed Ali Bahrainian; George Zerveas; Fabio Crestani; Carsten Eickhoff

doi:10.1145/3464299

CATS: Customizable Abstractive Topic-based Summarization

ACM Transactions on Information Systems ◽

10.1145/3464299 ◽

2022 ◽

Vol 40 (1) ◽

pp. 1-24

Author(s):

Seyed Ali Bahrainian ◽

George Zerveas ◽

Fabio Crestani ◽

Carsten Eickhoff

Keyword(s):

Computer Science ◽

State Of The Art ◽

Original Text ◽

Learning Method ◽

Source Text ◽

Resource Setting ◽

Low Resource Setting ◽

Topic Distribution ◽

Latent Topic ◽

Abstractive Summarization

Neural sequence-to-sequence models are the state-of-the-art approach used in abstractive summarization of textual documents, useful for producing condensed versions of source text narratives without being restricted to using only words from the original text. Despite the advances in abstractive summarization, custom generation of summaries (e.g., towards a user’s preference) remains unexplored. In this article, we present CATS, an abstractive neural summarization model that summarizes content in a sequence-to-sequence fashion while also introducing a new mechanism to control the underlying latent topic distribution of the produced summaries. We empirically illustrate the efficacy of our model in producing customized summaries and present findings that facilitate the design of such systems. We use the well-known CNN/DailyMail dataset to evaluate our model. Furthermore, we present a transfer-learning method and demonstrate the effectiveness of our approach in a low resource setting, i.e., abstractive summarization of meetings minutes, where combining the main available meetings’ transcripts datasets, AMI and International Computer Science Institute(ICSI) , results in merely a few hundred training documents.

Download Full-text

Skeleton to Abstraction: An Attentive Information Extraction Schema for Enhancing the Saliency of Text Summarization

Information ◽

10.3390/info9090217 ◽

2018 ◽

Vol 9 (9) ◽

pp. 217 ◽

Cited By ~ 1

Author(s):

Xiujuan Xiang ◽

Guangluan Xu ◽

Xingyu Fu ◽

Yang Wei ◽

Li Jin ◽

...

Keyword(s):

Information Extraction ◽

Full Text ◽

State Of The Art ◽

Irrelevant Information ◽

Source Text ◽

Daily Mail ◽

Human Evaluation ◽

Proposed Model ◽

Abstractive Summarization ◽

Extraction Model

Current popular abstractive summarization is based on an attentional encoder-decoder framework. Based on the architecture, the decoder generates a summary according to the full text that often results in the decoder being interfered by some irrelevant information, thereby causing the generated summaries to suffer from low saliency. Besides, we have observed the process of people writing summaries and find that they write a summary based on the necessary information rather than the full text. Thus, in order to enhance the saliency of the abstractive summarization, we propose an attentive information extraction model. It consists of a multi-layer perceptron (MLP) gated unit that pays more attention to the important information of the source text and a similarity module to encourage high similarity between the reference summary and the important information. Before the summary decoder, the MLP and the similarity module work together to extract the important information for the decoder, thus obtaining the skeleton of the source text. This effectively reduces the interference of irrelevant information to the decoder, therefore improving the saliency of the summary. Our proposed model was tested on CNN/Daily Mail and DUC-2004 datasets, and achieved a 42.01 ROUGE-1 f-score and 33.94 ROUGE-1, recall respectively. The result outperforms the state-of-the-art abstractive model on the same dataset. In addition, by subjective human evaluation, the saliency of the generated summaries was further enhanced.

Download Full-text

A Survey on Low-Resource Neural Machine Translation

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/629 ◽

2021 ◽

Author(s):

Rui Wang ◽

Xu Tan ◽

Renqian Luo ◽

Tao Qin ◽

Tie-Yan Liu

Keyword(s):

Machine Translation ◽

Large Scale ◽

State Of The Art ◽

Neural Machine Translation ◽

Modal Data ◽

Low Resource ◽

Resource Setting ◽

Low Resource Setting ◽

Parallel Data ◽

Target Languages

Neural approaches have achieved state-of-the-art accuracy on machine translation but suffer from the high cost of collecting large scale parallel data. Thus, a lot of research has been conducted for neural machine translation (NMT) with very limited parallel data, i.e., the low-resource setting. In this paper, we provide a survey for low-resource NMT and classify related works into three categories according to the auxiliary data they used: (1) exploiting monolingual data of source and/or target languages, (2) exploiting data from auxiliary languages, and (3) exploiting multi-modal data. We hope that our survey can help researchers to better understand this field and inspire them to design better algorithms, and help industry practitioners to choose appropriate algorithms for their applications.

Download Full-text

Extremely Low-Resource Text Simplification with Pre-trained Transformer Language Model

International Journal of Asian Language Processing ◽

10.1142/s2717554520500010 ◽

2020 ◽

Vol 30 (01) ◽

pp. 2050001

Author(s):

Takumi Maruyama ◽

Kazuhide Yamamoto

Keyword(s):

Machine Translation ◽

Large Scale ◽

State Of The Art ◽

Language Model ◽

Fine Tuning ◽

Neural Machine Translation ◽

Low Resource ◽

Resource Setting ◽

Text Simplification ◽

Low Resource Setting

Inspired by machine translation task, recent text simplification approaches regard a task as a monolingual text-to-text generation, and neural machine translation models have significantly improved the performance of simplification tasks. Although such models require a large-scale parallel corpus, such corpora for text simplification are very few in number and smaller in size compared to machine translation task. Therefore, we have attempted to facilitate the training of simplification rewritings using pre-training from a large-scale monolingual corpus such as Wikipedia articles. In addition, we propose a translation language model to seamlessly conduct a fine-tuning of text simplification from the pre-training of the language model. The experimental results show that the translation language model substantially outperforms a state-of-the-art model under a low-resource setting. In addition, a pre-trained translation language model with only 3000 supervised examples can achieve a performance comparable to that of the state-of-the-art model using 30,000 supervised examples.

Download Full-text

Initial Clinical Experience With a State-of-the-Art Linear Accelerator for Radiotherapy in a Low-Resource Setting: The First 35 Patients Treated Via a Guatemalan-American Partnership

International Journal of Radiation Oncology*Biology*Physics ◽

10.1016/j.ijrobp.2020.07.2505 ◽

2020 ◽

Vol 108 (3) ◽

pp. e427-e428

Author(s):

K. Lee ◽

A. Velarde ◽

K.D. Najera ◽

L. Sobrevilla ◽

E. Palacios ◽

...

Keyword(s):

Clinical Experience ◽

Linear Accelerator ◽

State Of The Art ◽

Low Resource ◽

Resource Setting ◽

Low Resource Setting ◽

Initial Clinical Experience

Download Full-text

Evaluation of a New Protocol of Insulin Dose Adjustment in a Low-Resource Setting

Diabetes ◽

10.2337/db18-93-lb ◽

2018 ◽

Vol 67 (Supplement 1) ◽

pp. 93-LB

Author(s):

EDDY JEAN BAPTISTE ◽

PHILIPPE LARCO ◽

MARIE-NANCY CHARLES LARCO ◽

JULIA E. VON OETTINGEN ◽

EDDLYS DUBOIS ◽

...

Keyword(s):

Dose Adjustment ◽

Insulin Dose ◽

Low Resource ◽

Resource Setting ◽

Low Resource Setting

Download Full-text

Quality indicators and post operative outcome of ERCP performed in a low resource setting ; can quality indicators from developed settings be applied?

10.26226/morressier.59a6b34bd462b80290b557f5 ◽

2017 ◽

Author(s):

Supun Kulatunge

Keyword(s):

Quality Indicators ◽

Low Resource ◽

Resource Setting ◽

Low Resource Setting ◽

Operative Outcome

Download Full-text

Right paraduodenal hernia with extensive bowel gangrene treated with staged surgery: a Bogota bag followed by resection in a low-resource setting

BMJ Case Reports ◽

10.1136/bcr-2020-239250 ◽

2021 ◽

Vol 14 (4) ◽

pp. e239250

Author(s):

Vijay Anand Ismavel ◽

Moloti Kichu ◽

David Paul Hechhula ◽

Rebecca Yanadi

Keyword(s):

Plastic Material ◽

Successful Outcome ◽

Paraduodenal Hernia ◽

Transparent Plastic ◽

Low Resource ◽

Entire Small Bowel ◽

Resource Setting ◽

Low Resource Setting ◽

Right Paraduodenal Hernia ◽

Bowel Gangrene

We report a case of right paraduodenal hernia with strangulation of almost the entire small bowel at presentation. Since resection of all bowel of doubtful viability would have resulted in too little residual length to sustain life, a Bogota bag was fashioned using transparent plastic material from an urine drainage bag and the patient monitored intensively for 18 hours. At re-laparotomy, clear demarcation lines had formed with adequate length of viable bowel (100 cm) and resection with anastomosis was done with a good outcome on follow-up, 9 months after surgery. Our description of a rare cause of strangulated intestinal obstruction and a novel method of maximising length of viable bowel is reported for its successful outcome in a low-resource setting.

Download Full-text

Humanitarian Surgical Missions in Times of COVID-19: Recommendations to Safely Return to a Sub-Saharan Africa Low-Resource Setting

World Journal of Surgery ◽

10.1007/s00268-021-06001-x ◽

2021 ◽

Author(s):

Víctor Lopez-Lopez ◽

Ana Morales ◽

Elisa García-Vazquez ◽

Miguel González ◽

Quiteria Hernandez ◽

...

Keyword(s):

Sub Saharan Africa ◽

Low Resource ◽

Resource Setting ◽

Low Resource Setting ◽

Sub Saharan

Download Full-text

Vulvar cancer: surgical management and survival trends in a low resource setting

Journal of the Egyptian National Cancer Institute ◽

10.1186/s43046-019-0015-y ◽

2020 ◽

Vol 32 (1) ◽

Author(s):

Navin Kumar ◽

Mukur Dipi Ray ◽

D. N. Sharma ◽

Rambha Pandey ◽

Kanak Lata ◽

...

Keyword(s):

Surgical Management ◽

Vulvar Cancer ◽

Low Resource ◽

Resource Setting ◽

Low Resource Setting

Download Full-text

Uptake of next‐generation sequencing in children with end‐stage renal disease secondary to focal segmental glomerulosclerosis and parental decision for kidney transplantation—Experience from a low resource setting: A Retrospective Cohort Study

Pediatric Transplantation ◽

10.1111/petr.13960 ◽

2020 ◽

Author(s):

Rajiv Sinha ◽

Subhankar Sarkar ◽

Kausik Mandal ◽

Yincent Tse

Keyword(s):

Cohort Study ◽

Kidney Transplantation ◽

Next Generation Sequencing ◽

End Stage Renal Disease ◽

Retrospective Cohort ◽

Resource Setting ◽

Low Resource Setting ◽

Stage Renal Disease ◽

End Stage ◽

Generation Sequencing

Download Full-text