Academy Documentation
  • Usage of Academy Documentation
  • Getting Started
    • Background Information
    • For Apollo Users
    • For Titan Users
    • For Scientists
    • For HPC Users
    • For Experienced Users
  • Cloud Computing
    • General Information
    • Cloud Computing for Scientists
    • Cloud Computing for HPC Users
  • Overview of the Platform
    • Overview of the Platform User Interface
    • Tool Library and App Introduction
  • Billing Access and Orgs
    • Orgs and Account Management
    • Billing and Pricing
  • Cohort Browser
    • Apollo Introduction
    • Overview of the Cohort Browser
    • Combining Cohorts
    • Genomic Variant Browser
    • Somatic Variants
  • JSON
    • Introduction
    • JSON on the Platform
  • Command Line Interface (CLI)
    • Introduction to CLI
    • Advanced CLI
  • Building Applets
    • Introduction
    • Bash
      • Example 1: Word Count (wc)
      • Example 2: fastq_quality_trimmer
      • Example 3: samtools
      • Example 4: cnvkit
      • Example 5: samtools with a Docker Image
    • Python
      • Example 1: Word Count (wc)
      • Example 2: fastq_quality_trimmer
      • Example 3: cnvkit
    • Publishing Applets to Apps
  • Building Workflows
    • Native Workflows
    • WDL
      • Example 1: hello
      • Example 2: Word Count (wc)
      • Example 3: fastq_trimmer
      • Example 4: cnvkit
      • Example 5: workflow
    • Nextflow
      • Resources To Learn Nextflow
      • Overview of Nextflow
      • Nextflow Setup
      • Importing Nf-Core
      • Building Nextflow Applets
      • Error Strategies for Nextflow
      • Job Failures
      • Useful Information
  • Interactive Cloud Computing
    • Cloud Workstation
    • TTYD
    • TTYD vs Cloud Workstation
    • JupyterLab
      • Introduction
      • Running a JupyterLab Notebook
  • Docker
    • Using Docker
    • Creating Docker Snapshots
    • Running Docker with Swiss Army Knife
  • Portals
    • Overview of JSON files for Portals
    • Branding JSON File
    • Home JSON File
    • Navigation JSON File
    • Updating Your Portal
  • AI/ ML Accelerator
    • Data Profiler
      • Introduction to Data Profiler
      • Utilizing Data Profiler Navigator
      • Dataset Level Screen
      • Table Level Screen
      • Column Level Screen
      • Explorer Mode
      • Accessing Data Profiler in ML JupyterLab
    • ML JupyterLab
      • Introduction to ML JupyterLab
      • Launching a ML JupyterLab Job
      • In App Features
      • Getting Started with ML JupyterLab
    • MLflow
      • Introduction to MLflow
      • Getting Started with MLflow
      • Using MLflow Tracking Server
      • Model Registry
      • Using Existing Model
      • Utilizing MLflow in JupyterLab
Powered by GitBook
On this page
  • Introduction to Cohort Browser
  • Adding Tiles
  • Filtering a Dataset
  • Using Cohorts for Comparisons
  • Applications for File Based Analysis
  • Supplemental Information
  • Resources

Was this helpful?

Export as PDF
  1. Cohort Browser

Overview of the Cohort Browser

PreviousApollo IntroductionNextCombining Cohorts

Last updated 4 months ago

Was this helpful?

Please note: in order to use Cohort Browser on the Platform, an Apollo License is needed.

Introduction to Cohort Browser

What is it for?

  • Use cohort browser for browsing, visualizing data, and creating cohorts

  • Access geno-pheno patient records from customer's studies, those of collaborators, and/or public datasets (UK Biobank)

    • Quickly identify phenotype-specific and/or genotype-specific virtual cohorts for GWAS/PheWAS analyses as well as compare cohorts

    • Visualize the cohort data to understand the underlying pheno/geno distribution

High Altitude Tour

Limits on the Cohort Browser

  • Limited to 15 tiles overall in dashboard

  • Limited to 30 columns in Data Preview

  • Add 1-2 tiles at a time, wait for them to refresh before adding more tiles.

Adding Tiles

Simple Tiles

  1. Open Cohort Browser

  2. Select Add tile on the top right corner

  3. Find the characteristic you want as a tile and select "add tile"

  4. Repeat until you select the amount of tiles that you are wanting (up to 15)

2D Plots

  • Used for more advanced comparisons

  • Add comparisons by selecting the first filter, then selecting the "+" sign for a secondary field

  • Then, you will edit in the data field details.

  • Here is the overview of the 2D plots that you can have:

Steps to Create 2D plots

  1. Open Cohort Browser

  2. Select Add tile on the top right corner

  3. Find the characteristic you are wanting to start with and select it, such as biological sex. This is the same step as adding a regular tile, but you will NOT select Add tile.

  4. Instead, add a secondary field by selecting this next to the second characteristic you are wanting to view, such as Age at Recruitment

  5. You will then have options to change the graph with those parameters. It will look something like this

  6. Then, select the add tile button on the bottom right below the new graph. This will add it to the cohort browser.

Filtering a Dataset

Basic Filtering

  • In the Cohort section, select the + Add Filter

  • Search or Select your characteristic

    • Example: Population Characteristics > Baseline Characteristics > Sex

  • Click Add Cohort Filter

  • Make sure "Is Any of" is selected, click on empty field

  • Select details for the characteristic

    • Example: selecting female

  • Your cohort panel will then look like this:

  • Repeat steps as necessary to filter as needed to create your cohort

Inclusive Filtering

By default, filters return all rows belonging to a patient when you query a secondary entity

Option 1: 2 filters at the same "level" but different rows

  • In this example, we are telling the cohort browser to find patients that have taken the medication ipilimumab OR (and with) the medication stop reason being side effects

  • This is because of how the filters were added. Essentially, this example is looking at the the medication list and the medication stop reason at the same time.

Option 2: filters on the same row

  • In this example, we are telling the cohort browser to find patients that have taken the medication ipilimumab AND who stopped taking it due to side effects.

  • This example, essentially, puts them on the same row and views the parameters together as a pair.

And/ Or Functionality

Option 1:

In the example, we use coffee drinkers.

If in the cohort filter we select sex as female AND the coffee consumed is 0, then only females with an instance of 0 for coffee consumed will show.

The differences to note between the and/ or function are highlighted below:

If in the cohort filter we select sex as female OR the coffee consumed is 0, then both females and males with an instance of 0 for coffee consumed will show. This increases the participant size because all the females are included, OR those that have a coffee instance of 0 (could be male OR female).

The differences in this cohort filtering is highlighted below:

Option 2:

  • In this example, we wanted patients with variants that are common or rare germline OR (and with) variants that are somatic AND have a strong evidence level (A/B).

  • This example demonstrates combining AND/ OR filters with data elements from a single table so that you can view what you are wanting in a cohort.

Combining Phenotype and Genotype Filters

  • In this example, we wanted a patient with a Tumor that has an intermediate or high tumor mutation burden OR (and with) a moderate or high impact mutation in APC, TP53, MSH6, PMS2.

  • This example is highlighting that you can combine phenotype and genotype filters so that you can customize your cohort further.

Using Cohorts for Comparisons

  • You can compare 2 cohorts in order to test hypotheses

  • To make a control cohort, you can copy the test cohort and change the filters.

Example 1:

Example 2:

  • For this example, we used coffee drinkers and then everyone else as a comparison

  • The first cohort is based on coffee type. This cohort was about 17,000 phenotypes.

  • The second cohort was all 100,000 phenotypes. We add this by selecting on the top by the name of the dataset to add another cohort in.

  • This allows us to see the 2 of them side by side in the tiles that we have selected. Coffee Drinkers was blue, and all of the 100,00 were purple in the example given.

Analyzing Cohorts with Spark Jupyterab/ dxdata

  • Once data is ingested, they are available as separate Spark databases. Apollo unifies accessing data in these databases through what's called a dataset.

  • A dataset can be thought of as a giant multi-omics matrix.

  • Datasets can be further refined into Cohorts within the Apollo Interface, allowing complex queries across omics type

Applications for File Based Analysis

  • Need applications for datasets/ cohorts

  • They exist on the platform in the tool library

  • This allows for further functionality with Apollo.

  • These apps allow the use of multiple files at a time (such as a cohort) and process them accordingly.

  • Some of these can also help process the dataset further as well.

Supplemental Information

Cohort Browser Overview

Filtering a Dataset

2D Plots with Advanced Filtering

Comparing Cohorts

Resources

To create a support ticket if there are technical issues:

  1. Go to the Help header (same section where Projects and Tools are) inside the platform

  2. Select "Contact Support"

  3. Fill in the Subject and Message to submit a support ticket.

See the for how to use these.

Apollo App documentation
Full Documentation
Cohort Browser Documentation
Chart Types