Data Sources
SgTxGNN integrates data from multiple authoritative sources to provide comprehensive drug repurposing predictions and evidence.
Primary Sources
TxGNN Model
Source: Harvard Medical School, Zitnik Lab
The core prediction engine, published in Nature Medicine (2023).
Singapore HSA
Source: Health Sciences Authority, Singapore
Drug registration data for Singapore-approved medications.
Evidence Sources
ClinicalTrials.gov
Source: U.S. National Library of Medicine
Global registry of clinical trials.
PubMed
Source: U.S. National Library of Medicine
Biomedical literature database.
DrugBank
Source: University of Alberta
Comprehensive drug and target database.
- Website
- Data used under academic license
Data Processing
Drug Mapping
- HSA drug names extracted from registration data
- Normalized using chemical name standardization
- Mapped to DrugBank identifiers
- Successfully mapped: 745 drugs
Prediction Generation
- DrugBank IDs matched to TxGNN knowledge graph
- Knowledge Graph predictions generated
- Deep Learning predictions generated
- Results combined and deduplicated
Update Schedule
| Source | Last Updated | Frequency |
|---|---|---|
| TxGNN Model | 2023 | As published |
| HSA Data | March 2026 | Quarterly |
| DrugBank | 2025 | Annual |
Licensing
- TxGNN: Academic use permitted
- HSA Data: Open Government License
- DrugBank: Academic license
- PubMed/ClinicalTrials: Public domain
Citation
When using SgTxGNN data, please cite:
- This platform (see About for citation format)
- Original TxGNN paper (Huang et al., Nature Medicine 2023)
- Relevant data sources as appropriate