Building Standards, Knowledge Sources and Software

On this page:

COVID-19 Open Data Portal

The COVID-19 Open Data Portal provides open access to screening data, animal model data and multi-omics data. Informatics scientists at NCATS are contributing to back-end and front-end development of the website and are helping coordinate the data ingestion into the site. This resource enables a variety of drug repurposing activities and allows researchers to formulate hypotheses, prioritize research opportunities, and speed the search for effective therapies against SARS-CoV-2 and COVID-19.


CURE ID, a website and mobile app, is a collaboration between the U.S. Food and Drug Administration (FDA) and NCATS that gives the global clinical community the opportunity to report novel uses of existing drugs for patients with difficult-to-treat infectious diseases through a website or a smartphone or other mobile device. It was developed with support from the Infectious Diseases Society of America, the Centers for Disease Control and Prevention, and the World Health Organization.

Genetic and Rare Diseases (GARD) Information Center

The Genetic and Rare Diseases (GARD) Information Center provides the public with access to current, reliable and easy-to-understand information about rare or genetic diseases. Informatics scientists organize, synthesize and update the backbone database for GARD. The program is funded by NCATS and the National Human Genome Research Institute.

Drug and Chemical Annotations and Applications

Global Substance Registration System (GSRS)

GSRS provides a common identifier for all of the substances regulated by the FDA that are used in medicinal products. GSRS is the only system that enables ISO 11238 standard compliance, allowing consistent definitions of substances globally, including active substances under clinical investigation. GSRS is a collaborative effort between NCATS and the FDA.

NCATS Inxight Drugs

Inxight Drugs incorporates and unifies manually curated data supplied by the FDA and private companies, and it provides marketing and regulatory status, rigorous drug ingredient definitions, information about biological activity and clinical use, and more. NCATS has developed Inxight data resources to facilitate translational research.


NCATS Predictor provides chemical structure-property and structure-activity models for drug discovery and development. The models are developed from NCATS’ in-house data sets and published literature, and they use innovative deep-learning approaches to make predictions.

Learn more: Novel Consensus Architecture to Improve Performance of Large-Scale Multitask Deep Learning QSAR Models.

Matrix Client

NCATS’ Matrix client is a publicly available web-based interface for analysis and access to NCATS combination screening data.

Thousands of combination pairs can be analyzed with this platform in a robust and cost- and time-effective way, enabling NCATS scientists to quickly narrow down a long list of drug pairs to identify the most effective drug combinations for follow-up and clinical studies.

Tox21 Gateway

The Tox21 Gateway is a web-based interface for the analysis of and access to Tox21 screening data. The Tox21 10K compound library has been screened against approximately 70 cell-based assays in qHTS format, generating approximately 100 million data points. The Gateway contains a suite of tools, including a public website for browsing and downloading Tox21 assay data and compound library annotations, such as analytical QC results; an assay tracking system that stores assay metadata and detailed experimental conditions; a structure-activity analysis tool; and links to Tox21 publications and presentations.

Biological and Molecular Knowledge Sources and Applications


Pharos is a comprehensive, integrated knowledge base for drug discovery and target validation. It was created to help illuminate the uncharacterized or poorly annotated portion of the genome. Pharos is the user interface to the Knowledge Management Center for the Illuminating the Druggable Genome (IDG) program funded by the NIH Common Fund.

Learn more: Pharos: Collating Protein Information to Shed Light on the Druggable Genome.

Relational Database of Metabolic Pathways (RaMP)



The Relational Database of Metabolic Pathways (RaMP) is a publicly available relational database that integrates multiple sources of biological, chemical and analyte (metabolite, protein, gene) annotations. The source code for building the database is available, and a user-friendly application to query the database is provided. RaMP also supports pathway enrichment analysis for multi-omics data input.


BioPlanet is a comprehensive, publicly accessible informatics resource that catalogues annotations and relationships between pathways, healthy and disease states, and targets. BioPlanet integrates pathway annotations from publicly available, manually curated sources that have been subjected to thorough redundancy and consistency cross-evaluation via extensive manual curation. The browser supports interactive browsing; retrieval and analysis of pathways; exploration of pathway connections; and pathway search by gene targets, category and availability of bioactivity assays.


SmartGraph is a predictive network-pharmacology platform that integrates drug-target and protein-protein interactions. Investigators can analyze the perturbation of the protein interaction network caused by single or multiple (small-molecule) agents. SmartGraph enables researchers without an in-depth informatics background to perform analyses by providing powerful network visualization and user-friendly “single click” options to perform complex workflows.