Complete single-cell RNAseq analysis walkthrough | Advanced introduction

StatQuest: Random Forests in R

Comparing single-cell RNA integration methods | Which is the best?

Captain America: Brave New World | Official Teaser | In Theaters February 14, 2025

Can Paris fix its poop problem before the Olympics?

ALTERNATIVE BUILD | Fitting Our Family of 5 In A 40 Ft Shipping Container

Applying random forest classifiers to single-cell RNAseq data

Sanbomics

Просмотров 6 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 11 июл 2024
Learn how to apply machine learning to single-cell data. Random forest is a powerful machine learning classifier and a great tool for analyzing single-cell RNAseq data. In addition to predicting classifications, you can extract the gene importance from the model as a way to identify genes that describe your populations. Here I use several examples to show you how to use the random forest model in Python to do single-cell analyses.
Notebook:
github.com/mousepixels/sanbom...
Reference:
www.nature.com/articles/s4158...
tabula-sapiens-portal.ds.czbi...
0:00 - Intro
1:10 - Basic RF usage
5:05 - Classifying cells in other data
12:15 - Classifying cells in same data
Наука

Комментарии • 19

@jsm640 Год назад
Thanks for your helpful and technological video! And looking forward to some videos about scATAC seq.
@sanbomics Год назад
Sometime in the future! I have a few more planned before that and not enough free time. But one day!
@muhammadjamalahmed2273 Год назад ⁺¹
Love your work..
@sanbomics Год назад ⁺¹
Thank you!
@user-zm6ip8iy9q Год назад
Thank you so much!
@sanbomics Год назад
You're welcome!
@jamilaiqbal202 3 месяца назад
Cool video again. Would you be able to make a video on neural network applied to scRNAseq
@shilpasy Год назад ⁺¹
Thank you so much, amazing video. Can you please tell me where can I get this kind of dataset to try this?
@sanbomics Год назад
Any single-cell paper should have a data availability or equivalent section that contains links to the raw data or counts tables. Or you can search something like NCBI geo directly. Or you can look at the list of publications on the 10x genomics website.
@shreyaslabhsetwar6083 Год назад
Amazing video! Are there any existing pre-trained models which we can directly use to auto-annotate cell types given cell clusters?
@sanbomics Год назад
Not sure about pre-trained models. There are simple models like SingleR or CellTypist. But if you have a reference dataset you can train a model with SCANVI. I have a video on that
@shreyaslabhsetwar6083 Год назад
@@sanbomics Thanks!
@garyhoward8198 Год назад
This is amazing! So helpful! I'm looking at applying some of these to publicly available data. How would this workflow change for k-nearest neighbour classification ? What would one need to change to do this ?
@sanbomics Год назад
I haven't tried KNN for classification in single-cell, but neighborhood graphs are used all the time for unsupervised sc clustering. I'm not sure how well KNN would work without dimension reduction first but you could definitely try it. But dim reduction, like PCA, will require processing of your train/test together. Maybe there is a better way to do dim reduction but keep the train/test independent. RF is pretty flexible with the number of features. TLDR, I don't know, you should try it with only the variable features and see how accurate it is. Please let me know because I am curious!
@garyhoward8198 Год назад
@@sanbomics I tried it how can I send you the code ? Trying to do a ROC curve with it as well but the kernel keeps dying (even when I'm running it on the cluster).
@sanbomics Год назад
You can upload it to a public github repository. Were you able to fix it? (sorry i just saw this, I don't get notifications for responses to my response)
@mst63th Год назад
That was cool. Do you use your PC to run ML tasks, or are you using HPC systems?
@sanbomics Год назад ⁺¹
Usually just my PC, but sometimes an AWS EC2 with Nvidia GPUs. Simple models like RF don't take much processing power at all. My PC is decently beefy too with Nvidia gpu, 128 gb memory, 24 cpu
@savparker9743 Год назад
🌹 【ｐｒｏｍｏｓｍ】

Следующие

Автовоспроизведение

Complete single-cell RNAseq analysis walkthrough | Advanced introduction

Complete single-cell RNAseq analysis walkthrough | Advanced introduction

StatQuest: Random Forests in R

StatQuest: Random Forests in R

Comparing single-cell RNA integration methods | Which is the best?

Comparing single-cell RNA integration methods | Which is the best?

Captain America: Brave New World | Official Teaser | In Theaters February 14, 2025

Captain America: Brave New World | Official Teaser | In Theaters February 14, 2025

Can Paris fix its poop problem before the Olympics?

Can Paris fix its poop problem before the Olympics?

ALTERNATIVE BUILD | Fitting Our Family of 5 In A 40 Ft Shipping Container

ALTERNATIVE BUILD | Fitting Our Family of 5 In A 40 Ft Shipping Container

New Family Member!

New Family Member!

Guide to filtering and subsetting single-cell anndata and pandas objects | basic and advanced

Guide to filtering and subsetting single-cell anndata and pandas objects | basic and advanced

An Introduction to Single Cell Sequencing

An Introduction to Single Cell Sequencing

Single-cell analysis with scVI machine-learning toolkit

Single-cell analysis with scVI machine-learning toolkit

Random Forest Classification | Machine Learning | Python

Random Forest Classification | Machine Learning | Python

2024 updated single-cell guide - Part 1: RNA preprocessing and quality control

2024 updated single-cell guide - Part 1: RNA preprocessing and quality control

Easy RNAseq volcano plot with one line of code

Easy RNAseq volcano plot with one line of code

[WEBINAR] Analysis of Single-Cell Multiome ATAC + Gene Expression - Dr. Wayne Doyle

[WEBINAR] Analysis of Single-Cell Multiome ATAC + Gene Expression - Dr. Wayne Doyle

Pseudobulk single-cell analysis in Python with Scanpy and pyDeseq2

Pseudobulk single-cell analysis in Python with Scanpy and pyDeseq2

Python for Bioinformatics - Drug Discovery Using Machine Learning and Data Analysis

Python for Bioinformatics - Drug Discovery Using Machine Learning and Data Analysis

Как думаете, КС потянет? 😂 #shorts #gaming #pc #asus #cs2 #csgo

Как думаете, КС потянет? 😂 #shorts #gaming #pc #asus #cs2 #csgo

1$ vs 500$ ВИРТУАЛЬНАЯ РЕАЛЬНОСТЬ !

1$ vs 500$ ВИРТУАЛЬНАЯ РЕАЛЬНОСТЬ !

В России ускорили интернет в 1000 раз

В России ускорили интернет в 1000 раз

WATERPROOF RATED IP-69🌧️#oppo #oppof27pro#oppoindia

WATERPROOF RATED IP-69🌧️#oppo #oppof27pro#oppoindia

ОБМАНУЛИ Вот что не так с этой клавиатурой от ZONE51

ОБМАНУЛИ Вот что не так с этой клавиатурой от ZONE51

Как разблокировать айфон?

Как разблокировать айфон?

Здесь упор в процессор

Здесь упор в процессор

Сложный РЕМОНТ ТОПОВОГО Samsung Galaxy S22 ULTRA SM-S908E после залития / НЕ ЛОВИТ СЕТИ

Сложный РЕМОНТ ТОПОВОГО Samsung Galaxy S22 ULTRA SM-S908E после залития / НЕ ЛОВИТ СЕТИ