Introduction
Last updated
Was this helpful?
Last updated
Was this helpful?
Before you begin, review the overview documentation and log onto the
If you have never used a JupyterLab notebook before, please view this information:
We can interact with the platform in several different ways and install software packages in these different environments depending on what we are wanting to use and how we want to use it. Here is what we are explaining in these sections:
Many Data Science Tasks are Interactive
Notebook- based analysis
Exploratory Data Analysis (EDA)
Data Preprocessing/ Cleaning
Implementing new Machine Learning/ Model
Building Workflows
Work can be done on a single machine instance
Main Use Cases:
Python/R
Image Processing
ML
Stata
Working with very large datasets that will not fit in memory on a single instance
Using Apollo and querying a large ingested dataset
Need to use Spark based tools such as dxdata, HAIL or GLOW
From Project List: Tools > JupyterLab
New JupyterLab button (top right)
Name your JupyterLab Environment
Snapshot (if you have one): single_jupyterlab.tar.gz
Project: Select Your Project
Choose your configuration
Choose your instance type
Choose your duration
Select your feature
Click Start Environment
In the JupyterLab list, select the one you are wanting to monitor
On the right had side, info about the job will appear
It will look like this:
You can also view this on the monitor tab by selecting in the info "View this Job in Monitor" (highlighted in gold above) or by using the Monitor Tab in the project space. It will look like this on the Monitor Tab
Running instances may take a while to load as the allocations become available.
Once it says "ready" select open
To create a support ticket if there are technical issues:
Go to the Help header (same section where Projects and Tools are) inside the platform
Select "Contact Support"
Fill in the Subject and Message to submit a support ticket.