Journal article
Advanced graph and sequence neural networks for molecular property prediction and drug discovery
BIOINFORMATICS, Vol.38(9), pp.2579-2586
04/28/2022
DOI: 10.1093/bioinformatics/btac112
PMID: 35179547
Abstract
Motivation: Properties of molecules are indicative of their functions and thus are useful in many applications. With the advances of deep-learning methods, computational approaches for predicting molecular properties are gaining increasing momentum. However, there lacks customized and advanced methods and comprehensive tools for this task currently.
Results: Here, we develop a suite of comprehensive machine-learning methods and tools spanning different computational models, molecular representations and loss functions for molecular property prediction and drug discovery. Specifically, we represent molecules as both graphs and sequences. Built on these representations, we develop novel deep models for learning from molecular graphs and sequences. In order to learn effectively from highly imbalanced datasets, we develop advanced loss functions that optimize areas under precision-recall curves (PRCs) and receiver operating characteristic (ROC) curves. Altogether, our work not only serves as a comprehensive tool, but also contributes toward developing novel and advanced graph and sequence-learning methodologies. Results on both online and offline antibiotics discovery and molecular property prediction tasks show that our methods achieve consistent improvements over prior methods. In particular, our methods achieve #1 ranking in terms of both ROC-AUC (area under curve) and PRC-AUC on the AI Cures open challenge for drug discovery related to COVID-19.
Details
- Title: Subtitle
- Advanced graph and sequence neural networks for molecular property prediction and drug discovery
- Creators
- Zhengyang Wang - Texas A&M UniversityMeng Liu - Texas A&M UniversityYouzhi Luo - Texas A&M UniversityZhao Xu - Texas A&M UniversityYaochen Xie - Texas A&M UniversityLimei Wang - Texas A&M UniversityLei Cai - Texas A&M UniversityQi Qi - Univ Iowa, Dept Comp Sci, Iowa City, IA 52242 USAZhuoning Yuan - Univ Iowa, Dept Comp Sci, Iowa City, IA 52242 USATianbao Yang - Univ Iowa, Dept Comp Sci, Iowa City, IA 52242 USAShuiwang Ji - Texas A&M University
- Resource Type
- Journal article
- Publication Details
- BIOINFORMATICS, Vol.38(9), pp.2579-2586
- DOI
- 10.1093/bioinformatics/btac112
- PMID
- 35179547
- NLM abbreviation
- Bioinformatics
- ISSN
- 1367-4803
- eISSN
- 1460-2059
- Publisher
- Oxford Univ Press
- Number of pages
- 8
- Grant note
- DBI-1922969; IIS-1908198; IIS-1955189; 1933212; 1844403 / National Science Foundation; National Science Foundation (NSF)
- Language
- English
- Date published
- 04/28/2022
- Academic Unit
- Computer Science
- Record Identifier
- 9984259430002771
Metrics
38 Record Views