> For the complete documentation index, see [llms.txt](https://academy.dnanexus.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://academy.dnanexus.com/public-datasets-on-the-dnanexus-platform/target-validation/open-targets.md).

# Open Targets

## Necessary Disclaimers and Legal

The user is responsible for reviewing and complying with the license requirements of the software, notebooks, and data referenced in this documentation.

Users are responsible for the costs associated with analyzing the Open Targets dataset and its storage in their project spaces.

Instance type availability and pricing are subject to the contract between the user or the user’s organization and DNAnexus.

## Citations for the Open Targets

The latest publication about Open Targets can be found in [Open Targets Platform: facilitating therapeutic hypotheses building in drug discovery](https://academic.oup.com/nar/article/53/D1/D1467/7917960) (2025) which describes recent updates to the Open Targets Platform. Users can also find more information about Open Targets in previous publications:

1. [The next-generation Open Targets Platform: reimagined, redesigned, rebuilt](https://academic.oup.com/nar/article/51/D1/D1353/6833237?login=false) (2023)
2. [Open Targets Platform: supporting systematic drug–target identification and prioritisation](https://academic.oup.com/nar/article/49/D1/D1302/5983621?login=false) (2021)
3. [Open Targets Platform: new developments and updates two years on](https://academic.oup.com/nar/article/47/D1/D1056/5193331?login=false) (2019)
4. [Open Targets: a platform for therapeutic target identification and validation ](https://academic.oup.com/nar/article/45/D1/D985/2605745?login=false) (2017)

Open Targets is a public-private initiative led by the European Bioinformatics Institute (EBI) that comprehensively aggregates public data sources for drug discovery. The official version of dataset is hosted on [Open Targets Platform](https://platform.opentargets.org/downloads)

## Overview of the Open Targets Dataset

Open Targets is an integrated data resource that enables the systematic identification and prioritization of therapeutic targets. It combines diverse publicly available datasets with resources generated by the Open Targets consortium to compute and score target–disease associations, helping drive more informed decisions in early drug discovery. By aggregating evidence across genetics, molecular QTLs, somatic variation, expression, pathways, chemical biology, pharmacology, and literature, it provides comprehensive annotation of targets, diseases and drugs within a unified framework.

<img src="/files/Z30oz7N0eOX4oflpBGRL" alt="" height="188" width="624">

The Open Targets Platform integrates data informing multiple steps in the target identification and prioritization process, from assessing the casual and supporting evidence of a target’s role in disease through target prioritisation to therapeutic hypothesis generation.

For dataset information can be found in the [Official Open Target documentation](https://platform-docs.opentargets.org/). The schema for each dataset can be viewed online in the [Open Target Data Download](https://platform.opentargets.org/downloads) section.

On DNAnexus, we provide the complete Open Targets Platform release (version 25.09), including 38 datasets across seven major categories (target–disease associations, targets, ontology, genetics, diseases, drugs, and literature). For an overview of this release, users can refer to the official [Release blog](https://blog.opentargets.org/open-targets-platform-25-09-release/) and [Release note](https://platform-docs.opentargets.org/release-notes?ref=blog.opentargets.org). In December 2025, Open Targets released a new version (25.12). We also provide this version on DNAnexus. Please refer to their [Release note ](https://platform-docs.opentargets.org/release-notes)to learn more about this new version.

See the “Where to Access Open Targets” section below to start accessing the dataset

## Where to Access Open Targets

The following files are available for the Open Targets datasets:

* All 38 datasets were downloaded directly from the official Open Targets FTP repository (version 25.09) and stored in Parquet. Users can use big data analytics tools including Spark to query and analyze. These files are found here for [AWS US East](https://platform.dnanexus.com/panx/projects/J3JyY6j030gzQypGpk273241/data/OpenTargets/datasets), [AWS Europe (Frankfurt)](https://platform.dnanexus.com/panx/projects/J780j7848VpfB6kJ8p7y29xG/data/OpenTargets/datasets), [AWS Europe (London)](https://platform.dnanexus.com/panx/projects/J780fzpKpb7Gq5X4ZJfBP7QX/data/OpenTargets/datasets), [Azure Amsterdam](https://platform.dnanexus.com/panx/projects/J780gY0B34pvq5X4ZJfBP7YP/data/OpenTargets/datasets), [Azure US (West)](https://platform.dnanexus.com/panx/projects/J780v289Z00G4Kx14b188ybj/data/OpenTargets/datasets).
* Notebook to showcase how to query and integrate datasets from Open Targets can be found on the platform for [AWS US East](https://platform.dnanexus.com/panx/projects/J3JyY6j030gzQypGpk273241/data/OpenTargets/notebooks), [AWS Europe (Frankfurt)](https://platform.dnanexus.com/panx/projects/J780j7848VpfB6kJ8p7y29xG/data/OpenTargets/notebooks), [AWS Europe (London),](https://platform.dnanexus.com/panx/projects/J780fzpKpb7Gq5X4ZJfBP7QX/data/OpenTargets/notebooks) [Azure Amsterdam](https://platform.dnanexus.com/panx/projects/J780gY0B34pvq5X4ZJfBP7YP/data/OpenTargets/notebooks), [Azure US (West)](https://platform.dnanexus.com/panx/projects/J780v289Z00G4Kx14b188ybj/data/OpenTargets/notebooks). The file endings are .ipynb.

To use the dataset and notebooks, please copy the data and notebooks into your own project space. Details on how to copy the data are present under the section titled "Copying Data and Notebooks into a Project".

## Running analyses on Open Targets

### Copying Data and Notebooks into a Project&#x20;

To utilize the dataset, please copy the data from the project listed above into your own project.

Here are the steps to copy the Open Targets data into a Project Space:

1. Create a project for your Open Targets dataset, billed to your own organization.  Tutorials on how to set up a project can be [found on this page](https://academy.dnanexus.com/overview-of-the-platform/setting-up-a-project).
2. Go to Resources Tab and find the project titled “Public Datasets *Region*” and select the folder "Open-Targets". &#x20;
3. Select the data folder and the notebooks
4. Select "Copy" on the top right menu, and select the project that you created in Step 1.&#x20;
5. Then, go to the project space you created in Step 1 to start exploring the Open Targets dataset and notebooks.
6. To run the JupyterLab Notebooks, please see the [JupyterLab](https://academy.dnanexus.com/interactivecloudcomputing/jupyterlab/introduction) section including [a JupyterLab Notebook](https://academy.dnanexus.com/interactivecloudcomputing/jupyterlab/running_notebook) and [Running a Spark JupyterLab Notebook](https://academy.dnanexus.com/interactivecloudcomputing/jupyterlab/running-a-spark-jupyterlab-notebook) of the Academy Documentation&#x20;

### Example notebook

We prepared an example of a notebook showing extracting colocalizations for GWAS credible sets associated with autoimmune diseases. The notebook is named as “autoimmune\_colocalisations\_spark.ipynb” and is optimized for the [JupyterLab with Spark Cluster](https://platform.dnanexus.com/panx/tool/app/app-J2PfkXj005PyVJQJ133B9k1v)

* Instance type: mem1\_ssd1\_v2\_x16
* Please follow the provided command-line instructions in the terminal that are found in the notebook example before running the notebook.

### Video: Utilizing the Open Targets Dataset on the DNAnexus Platform

{% embed url="<https://youtu.be/JjueXk0K58Y>" %}


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://academy.dnanexus.com/public-datasets-on-the-dnanexus-platform/target-validation/open-targets.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
