DOZEN: Cross-Domain Zero Shot Named Entity Recognition with Knowledge Graph

Building the Knowledge Graph for Zakat (KGZ) in Indonesian Language

ASM Science Journal ◽

10.32802/asmscj.2021.758 ◽

2021 ◽

Vol 16 ◽

pp. 1-10

Author(s):

Husni Teja Sukmana ◽

JM Muslimin ◽

Asep Fajar Firmansyah ◽

Lee Kyung Oh

Keyword(s):

Named Entity Recognition ◽

General Purpose ◽

Entity Recognition ◽

Basic Knowledge ◽

Knowledge Graph ◽

Specific Domain ◽

Named Entity ◽

Description Framework ◽

General Source ◽

Domain Information

In Indonesia, philanthropy is identical to Zakat. Zakat belongs to a specific domain because it has its characteristics of knowledge. This research studied knowledge graph in the Zakat domain called KGZ which is conducted in Indonesia. This area is still rarely performed, thus it becomes the first knowledge graph for Zakat in Indonesia. It is designed to provide basic knowledge on Zakat and managing the Zakat in Indonesia. There are some issues with building KGZ, firstly, the existing Indonesian named entity recognition (NER) is non-restricted and general-purpose based which data is obtained from a general source like news. Second, there is no dataset for NER in the Zakat domain. We define four steps to build KGZ, involving data acquisition, extracting entities and their relationship, mapping to ontology, and deploying knowledge graphs and visualizations. This research contributed a knowledge graph for Zakat (KGZ) and a building NER model for Zakat, called KGZ-NER. We defined 17 new named entity classes related to Zakat with 272 entities, 169 relationships and provided labelled datasets for KGZ-NER that are publicly accessible. We applied the Indonesian-Open Domain Information Extractor framework to process identifying entities’ relationships. Then designed modeling of information using resources description framework (RDF) to build the knowledge base for KGZ and store it to GraphDB, a product from Ontotext. This NER model has a precision 0.7641, recall 0.4544, and F1-score 0.5655. The increasing data size of KGZ is required to discover all of the knowledge of Zakat and managing Zakat in Indonesia. Moreover, sufficient resources are required in future works.

Download Full-text

Cross-Domain and Semisupervised Named Entity Recognition in Chinese Social Media: A Unified Model

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2018.2856625 ◽

2018 ◽

Vol 26 (11) ◽

pp. 2142-2152 ◽

Cited By ~ 8

Author(s):

Jingjing Xu ◽

Hangfeng He ◽

Xu Sun ◽

Xuancheng Ren ◽

Sujian Li

Keyword(s):

Social Media ◽

Named Entity Recognition ◽

Unified Model ◽

Entity Recognition ◽

Named Entity ◽

Cross Domain ◽

Chinese Social Media

Download Full-text

Deep Learning-Based Named Entity Recognition and Knowledge Graph Construction for Geological Hazards

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi9010015 ◽

2019 ◽

Vol 9 (1) ◽

pp. 15 ◽

Cited By ~ 2

Author(s):

Runyu Fan ◽

Lizhe Wang ◽

Jining Yan ◽

Weijing Song ◽

Yingqian Zhu ◽

...

Keyword(s):

Deep Learning ◽

Large Scale ◽

Conditional Random Field ◽

Named Entity Recognition ◽

Entity Recognition ◽

Knowledge Graph ◽

Geological Hazard ◽

Geological Hazards ◽

Named Entity ◽

Corpus Construction

Constructing a knowledge graph of geological hazards literature can facilitate the reuse of geological hazards literature and provide a reference for geological hazard governance. Named entity recognition (NER), as a core technology for constructing a geological hazard knowledge graph, has to face the challenges that named entities in geological hazard literature are diverse in form, ambiguous in semantics, and uncertain in context. This can introduce difficulties in designing practical features during the NER classification. To address the above problem, this paper proposes a deep learning-based NER model; namely, the deep, multi-branch BiGRU-CRF model, which combines a multi-branch bidirectional gated recurrent unit (BiGRU) layer and a conditional random field (CRF) model. In an end-to-end and supervised process, the proposed model automatically learns and transforms features by a multi-branch bidirectional GRU layer and enhances the output with a CRF layer. Besides the deep, multi-branch BiGRU-CRF model, we also proposed a pattern-based corpus construction method to construct the corpus needed for the deep, multi-branch BiGRU-CRF model. Experimental results indicated the proposed deep, multi-branch BiGRU-CRF model outperformed state-of-the-art models. The proposed deep, multi-branch BiGRU-CRF model constructed a large-scale geological hazard literature knowledge graph containing 34,457 entities nodes and 84,561 relations.

Download Full-text

Transfer joint embedding for cross-domain named entity recognition

ACM Transactions on Information Systems ◽

10.1145/2457465.2457467 ◽

2013 ◽

Vol 31 (2) ◽

pp. 1-27 ◽

Cited By ~ 5

Author(s):

Sinno Jialin Pan ◽

Zhiqiang Toh ◽

Jian Su

Keyword(s):

Named Entity Recognition ◽

Entity Recognition ◽

Named Entity ◽

Cross Domain ◽

Joint Embedding

Download Full-text

Design and Evaluation of a Prescription Drug Monitoring Program for Chinese Patent Medicine based on Knowledge Graph

Evidence-based Complementary and Alternative Medicine ◽

10.1155/2021/9970063 ◽

2021 ◽

Vol 2021 ◽

pp. 1-8

Author(s):

Wangping Xiong ◽

Jun Cao ◽

Xian Zhou ◽

Jianqiang Du ◽

Bin Nie ◽

...

Keyword(s):

Prescription Drug ◽

Drug Monitoring ◽

Named Entity Recognition ◽

Monitoring Program ◽

Entity Recognition ◽

Knowledge Graph ◽

Named Entity ◽

Patent Medicines ◽

Prescription Drug Monitoring Program ◽

Chinese Patent

Background. Chinese patent medicines are increasingly used clinically, and the prescription drug monitoring program is an effective tool to promote drug safety and maintain health. Methods. We constructed a prescription drug monitoring program for Chinese patent medicines based on knowledge graphs. First, we extracted the key information of Chinese patent medicines, diseases, and symptoms from the domain-specific corpus by the information extraction. Second, based on the extracted entities and relationships, a knowledge graph was constructed to form a rule base for the monitoring of data. Then, the named entity recognition model extracted the key information from the electronic medical record to be monitored and matched the knowledge graph to realize the monitoring of the Chinese patent medicines in the prescription. Results. Named entity recognition based on the pretrained model achieved an F1 value of 83.3% on the Chinese patent medicines dataset. On the basis of entity recognition technology and knowledge graph, we implemented a prescription drug monitoring program for Chinese patent medicines. The accuracy rate of combined medication monitoring of three or more drugs of the program increased from 68% to 86.4%. The accuracy rate of drug control monitoring increased from 70% to 97%. The response time for conflicting prescriptions with two drugs was shortened from 1.3S to 0.8S. The response time for conflicting prescriptions with three or more drugs was shortened from 5.2S to 1.4S. Conclusions. The program constructed in this study can respond quickly and improve the efficiency of monitoring prescriptions. It is of great significance to ensure the safety of patients’ medication.

Download Full-text

Neural Adaptation Layers for Cross-domain Named Entity Recognition

10.18653/v1/d18-1226 ◽

2018 ◽

Cited By ~ 7

Author(s):

Bill Yuchen Lin ◽

Wei Lu

Keyword(s):

Named Entity Recognition ◽

Neural Adaptation ◽

Entity Recognition ◽

Named Entity ◽

Cross Domain

Download Full-text

DeNERT-KG: Named Entity and Relation Extraction Model Using DQN, Knowledge Graph, and BERT

Applied Sciences ◽

10.3390/app10186429 ◽

2020 ◽

Vol 10 (18) ◽

pp. 6429

Author(s):

SungMin Yang ◽

SoYeop Yoo ◽

OkRan Jeong

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Language Model ◽

Named Entity Recognition ◽

Relation Extraction ◽

Entity Recognition ◽

Knowledge Graph ◽

Named Entity ◽

Artificial Intelligence Technology

Along with studies on artificial intelligence technology, research is also being carried out actively in the field of natural language processing to understand and process people’s language, in other words, natural language. For computers to learn on their own, the skill of understanding natural language is very important. There are a wide variety of tasks involved in the field of natural language processing, but we would like to focus on the named entity registration and relation extraction task, which is considered to be the most important in understanding sentences. We propose DeNERT-KG, a model that can extract subject, object, and relationships, to grasp the meaning inherent in a sentence. Based on the BERT language model and Deep Q-Network, the named entity recognition (NER) model for extracting subject and object is established, and a knowledge graph is applied for relation extraction. Using the DeNERT-KG model, it is possible to extract the subject, type of subject, object, type of object, and relationship from a sentence, and verify this model through experiments.

Download Full-text

Named Entity Recognition in Traditional Chinese Medicine Clinical Cases Combining BiLSTM-CRF with Knowledge Graph

Knowledge Science, Engineering and Management - Lecture Notes in Computer Science ◽

10.1007/978-3-030-29551-6_48 ◽

2019 ◽

pp. 537-548 ◽

Cited By ~ 2

Author(s):

Zhe Jin ◽

Yin Zhang ◽

Haodan Kuang ◽

Liang Yao ◽

Wenjin Zhang ◽

...

Keyword(s):

Chinese Medicine ◽

Traditional Chinese Medicine ◽

Named Entity Recognition ◽

Entity Recognition ◽

Knowledge Graph ◽

Named Entity ◽

Clinical Cases

Download Full-text

PDALN: Progressive Domain Adaptation over a Pre-trained Model for Low-Resource Cross-Domain Named Entity Recognition

10.18653/v1/2021.emnlp-main.442 ◽

2021 ◽

Author(s):

Tao Zhang ◽

Congying Xia ◽

Philip S. Yu ◽

Zhiwei Liu ◽

Shu Zhao

Keyword(s):

Domain Adaptation ◽

Named Entity Recognition ◽

Entity Recognition ◽

Low Resource ◽

Named Entity ◽

Cross Domain

Download Full-text

Understanding Horizon 2020 Data: A Knowledge Graph-Based Approach

Applied Sciences ◽

10.3390/app112311425 ◽

2021 ◽

Vol 11 (23) ◽

pp. 11425

Author(s):

Nikolaos Giarelis ◽

Nikos Karacapilidis

Keyword(s):

Named Entity Recognition ◽

Data Representation ◽

Entity Recognition ◽

Knowledge Graph ◽

Graph Database ◽

Keyphrase Extraction ◽

Graph Analytics ◽

Aggregated Data ◽

Horizon 2020 ◽

Named Entity

This paper aims to meaningfully analyse the Horizon 2020 data existing in the CORDIS repository of EU, and accordingly offer evidence and insights to aid organizations in the formulation of consortia that will prepare and submit winning research proposals to forthcoming calls. The analysis is performed on aggregated data concerning 32,090 funded projects, 34,295 organizations participated in them, and 87,067 public deliverables produced. The modelling of data is performed through a knowledge graph-based approach, aiming to semantically capture existing relationships and reveal hidden information. The main contribution of this work lies in the proper utilization and orchestration of keyphrase extraction and named entity recognition models, together with meaningful graph analytics on top of an efficient graph database. The proposed approach enables users to ask complex questions about the interconnection of various entities related to previously funded research projects. A set of representative queries demonstrating our data representation and analysis approach are given at the end of the paper.

Download Full-text