Logo image
Topic Discovery for Biomedical Corpus Using MeSH Embeddings
Conference proceeding

Topic Discovery for Biomedical Corpus Using MeSH Embeddings

Guangxu Xun, Kishlay Jha, Ye Yuan and Aidong Zhang
2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI), pp.1-4
05/2019
DOI: 10.1109/BHI.2019.8834559

View Online

Abstract

Discovering latent topics from biomedical documents has become a pivotal task in many biomedical text mining applications. Medical Subject Headings (MeSH) terms, which are curated by human experts, provide highly precise keyword representations for biomedical documents. However, the performance of conventional topic models on MeSH documents is usually unsatisfying due to the limited length of individual MeSH documents. In this paper, we propose a novel topic model for MeSH documents using MeSH embeddings. The proposed topic model is able to overcome the lack of context information problem in MeSH documents by 1) exploiting the rich term-level co-occurrence patterns instead of the sparse document-level co-occurrence patterns, and 2) incorporating additional MeSH semantics in MeSH embeddings learned from a large external biomedical knowledge base. Experimental result on a real-world biomedical dataset shows the efficacy of the proposed model in discovering coherent topics from MeSH documents.
Biological system modeling Biomedical topic discovery Coherence Diabetes Diseases Gaussian distribution Insulin MeSH embeddings Semantics

Details

Metrics

10 Record Views
Logo image