Master's Research work and NII Internship Research supervised by Dr. Chutiporn Anutariya (Asian Institute of Technology, Thailand), Prof. Frederic Andreas (National Institute of Informatics, Japan), and Dr. Teeradaj Racharak (Japan Advanced Institute of Science and Technology (JAIST)).
The research is about construction of content-based knowledge graph using the up-to-date vaccine specific literatures from the CORD-19 dataset.
- Dataset Preparation: Latent Dirichlet Allocation (LDA) to focus on vaccine specific information
- Named Entity Recognition (NER): fine-tuning DistilBERT [Fine-tuned Dataset: https://dx.doi.org/10.21227/m7gj-ks21]
- Relation Extraction: Verb Phrase Extraction, Relation Clustering: Synset Grouping (using Synset information from WordNet Database)
- Triple Construction: using extracted entities and relations and Analysis: Entity type-based, Relation-based