Skip to content

Analysing 10x genomics PBMC dataset, employing different types of predictive techniques for cellular classification

Notifications You must be signed in to change notification settings

JaydenBeckwith/scRNAseq

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 

Repository files navigation

scRNAseq overview - PBMC analysis

Analysing 10x genomics PBMC dataset, employing different types of predictive techniques for cellular classification

The main libaries that were used in this analysis included - Scanpy, Matplotlib, Numpy, TensorFlow and Scikit-learn. The analysis was broken up into

  1. Preprocessing and Quality Control

Reading the raw data, filtering cells based on quality metrics, and normalizing the data.

  1. Feature Selection

This step identifies highly variable genes in the dataset.

  1. Dimensionality Reduction

This step uses PCA and UMAP to reduce the dimensionality of the dataset.

  1. Clustering

This step identifies clusters of cells based on their gene expression profiles.

  1. Machine Learning classification

Using an autoenconder, random forest and KNN classification techniques with CV and model evaluation

About

Analysing 10x genomics PBMC dataset, employing different types of predictive techniques for cellular classification

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published