Skip to content

research study for a Master's Degree in Data Science & AI done in August, 2022.

Notifications You must be signed in to change notification settings

ShinThant3010/Deep-Learning-based-KG-for-Covid19-Vaccination

Repository files navigation

Deep-Learning-based-KG-Construction-for-Covid19-Vaccination

DOI

Master's Research work and NII Internship Research supervised by Dr. Chutiporn Anutariya (Asian Institute of Technology, Thailand), Prof. Frederic Andreas (National Institute of Informatics, Japan), and Dr. Teeradaj Racharak (Japan Advanced Institute of Science and Technology (JAIST)).

Project Description

The research is about construction of content-based knowledge graph using the up-to-date vaccine specific literatures from the CORD-19 dataset.

Dataset

  • Dataset for Knowledge Graph Extraction: CORD-19
  • Dataset for NER: CORD-NER

Methodology

  1. Dataset Preparation: Latent Dirichlet Allocation (LDA) to focus on vaccine specific information
  2. Named Entity Recognition (NER): fine-tuning DistilBERT [Fine-tuned Dataset: https://dx.doi.org/10.21227/m7gj-ks21]
  3. Relation Extraction: Verb Phrase Extraction, Relation Clustering: Synset Grouping (using Synset information from WordNet Database)
  4. Triple Construction: using extracted entities and relations and Analysis: Entity type-based, Relation-based