Real biological datasets for practicing data analysis and building your Python skills. All datasets are research-grade and used in published studies.
The Cancer Dependency Map is a genome-wide CRISPR screening dataset that measures how dependent cancer cell lines are on each gene for survival. High dependency scores indicate genes that are essential for cancer cell survival.
Complete gene dependency dataset from the Cancer Dependency Map project
Curated subset focusing on breast and myeloid cancers
Click download to save the CSV file to your computer
Use pandas.read_csv() to import the data into your notebook
Follow along with lecture notebooks to discover biological insights
import pandas as pd
df = pd.read_csv('combined_model_crispr_data_filtered.csv')
df.head()