Provides curated, ready-to-run notebooks, multi-omics data, and applications specifically designed for bioinformaticians

Notebooks

Geneformer

scGPT

PopV

Visiumccc

Mixscape

NicheNet

CopyKAT

Monocle3

Tangram

Signac

Show all

Premium

Slingshot: cell lineage and pseudotime inference for single-cell transcriptomics

BioTuring

Slingshot: cell lineage and pseudotime inference for single-cell transcriptomics

Single-cell RNA sequencing (scRNA-seq) data have allowed us to investigate cellular heterogeneity and the kinetics of a biological process. Some studies need to understand how cells change state, and corresponding genes during the process, but it is challenging to track the cell development in scRNA-seq protocols. Therefore, a variety of statistical and computational methods have been proposed for lineage inference (or pseudotemporal ordering) to reconstruct the states of cells according to the developmental process from the measured snapshot data. Specifically, lineage refers to an ordered transition of cellular states, where individual cells represent points along. pseudotime is a one-dimensional variable representing each cell’s transcriptional progression toward the terminal state. Slingshot which is one of the methods suggested for lineage reconstruction and pseudotime inference from single-cell gene expression data. In this notebook, we will illustrate an example workflow for cell lineage and pseudotime inference using Slingshot. The notebook is inspired by Slingshot's vignette and modified to demonstrate how the tool works on BioTuring's platform.

Only CPU

slingshot

Identifying tumor cells at the single-cell level using machine learning - inferCNV

BioTuring

Identifying tumor cells at the single-cell level using machine learning - inferCNV

Tumors are complex tissues of cancerous cells surrounded by a heterogeneous cellular microenvironment with which they interact. Single-cell sequencing enables molecular characterization of single cells within the tumor. However, cell annotation—the assignment of cell type or cell state to each sequenced cell—is a challenge, especially identifying tumor cells within single-cell or spatial sequencing experiments. Here, we propose ikarus, a machine learning pipeline aimed at distinguishing tumor cells from normal cells at the single-cell level. We test ikarus on multiple single-cell datasets, showing that it achieves high sensitivity and specificity in multiple experimental contexts. **InferCNV** is a Bayesian method, which agglomerates the expression signal of genomically adjointed genes to ascertain whether there is a gain or loss of a certain larger genomic segment. We have used **inferCNV** to call copy number variations in all samples used in the manuscript.

Only CPU

inferCNV

Spatial charting of single-cell transcriptomes in tissues - celltrek

BioTuring

Spatial charting of single-cell transcriptomes in tissues - celltrek

Single-cell RNA sequencing methods can profile the transcriptomes of single cells but cannot preserve spatial information. Conversely, spatial transcriptomics assays can profile spatial regions in tissue sections but do not have single-cell resolution. Here, Runmin Wei (Siyuan He, Shanshan Bai, Emi Sei, Min Hu, Alastair Thompson, Ken Chen, Savitri Krishnamurthy & Nicholas E. Navin) developed a computational method called CellTrek that combines these two datasets to achieve single-cell spatial mapping through coembedding and metric learning approaches. They benchmarked CellTrek using simulation and in situ hybridization datasets, which demonstrated its accuracy and robustness. They then applied CellTrek to existing mouse brain and kidney datasets and showed that CellTrek can detect topological patterns of different cell types and cell states. They performed single-cell RNA sequencing and spatial transcriptomics experiments on two ductal carcinoma in situ tissues and applied CellTrek to identify tumor subclones that were restricted to different ducts, and specific T-cell states adjacent to the tumor areas.

Only CPU

CellTrek

Hierarchicell: estimating power for tests of differential expression with single-cell data

BioTuring

Hierarchicell: estimating power for tests of differential expression with single-cell data

Power analyses are considered important factors in designing high-quality experiments. However, such analyses remain a challenge in single-cell RNA-seq studies due to the presence of hierarchical structure within the data (Zimmerman et al., 2021). As cells sampled from the same individual share genetic and environmental backgrounds, these cells are more correlated than cells sampled from different individuals. Currently, most power analyses and hypothesis tests (e.g., differential expression) in scRNA-seq data treat cells as if they were independent, thus ignoring the intra-sample correlation, which could lead to incorrect inferences. Hierarchicell (Zimmerman, K.D. and Langefeld, C.D., 2021) is an R package proposed to estimate power for testing hypotheses of differential expression in scRNA-seq data while considering the hierarchical correlation structure that exists in the data. The method offers four important categories of functions: data loading and cleaning, empirical estimation of distributions, simulating expression data, and computing type 1 error or power. In this notebook, we will illustrate an example workflow of Hierarchicell. The notebook is inspired by Hierarchicell's vignette and modified to demonstrate how the tool works on BioTuring's platform.

Only CPU

Hierarchicell

Trends

BioTuring

Bioturing Massive-scale Analysis Solution for CellChat: Running analysis for massive-scale data from Seurat dataset

This tool provides a user-friendly and automated way to analyze large-scale single-cell RNA-seq datasets stored in RDS (Seurat) format. It allows users to run various analysis tools on their data in one command, streamlining the analysis workflow and(More)

Only CPU

CellChat

BioTuring

Webinar scGPT: Towards Building a Foundational Model for Single-Cell Multi-omics Using Generative AI

Generative pre-trained models have demonstrated exceptional success in various fields, including natural language processing and computer vision. In line with this progress, scGPT has been developed as a foundational model tailored specifically for t(More)

Required GPU

scgpt

Seurat

BioTuring

CopyKAT: Delineating copy number and clonal substructure in human tumors from single-cell transcriptomes

Classification of tumor and normal cells in the tumor microenvironment from scRNA-seq data is an ongoing challenge in human cancer study. Copy number karyotyping of aneuploid tumors (***copyKAT***) (Gao, Ruli, et al., 2021) is a method proposed f(More)

Only CPU

copykat

Seurat

BioTuring

Webinar Geneformer: a deep learning model for exploring gene networks

Geneformer is a foundation transformer model pretrained on a large-scale corpus of ~30 million single cell transcriptomes to enable context-aware predictions in settings with limited data in network biology. Here, we will demonstrate a basic workflow(More)

Required GPU

Seurat

Geneformer

BioTuring

Inference and analysis of cell-cell communication using CellChat

Understanding global communications among cells requires accurate representation of cell-cell signaling links and effective systems-level analyses of those links. We construct a database of interactions among ligands, receptors and their cofactor(More)

Required GPU

CellChat

BioTuring

BioTuring Data Converter: Seurat <=> Scanpy for single-cell data transcriptomic and spatial transcriptomics

This notebook illustrates how to convert data from a Seurat object into a Scanpy annotation data and a Scanpy annotation data into a Seurat object using the BioStudio data transformation library (currently under development). It facilitates continued(More)

Only CPU

Scanpy

Seurat

BioTuring

Monorail-pipeline and Recount3

Monorail can be used to process local and/or private data, allowing results to be directly compared to any study in recount3. Taken together, Monorail-pipeline tools help biologists maximize the utility of publicly available RNA-seq data, especially (More)

Only CPU

recount3

BioTuring

Deep learning and alignment of spatially resolved single-cell transcriptomes with Tangram

Charting an organs’ biological atlas requires us to spatially resolve the entire single-cell transcriptome, and to relate such cellular features to the anatomical scale. Single-cell and single-nucleus RNA-seq (sc/snRNA-seq) can profile cells compre(More)

Required GPU

Tangram

BioTuring

Scanpy is a scalable toolkit for analyzing single-cell gene expression data built jointly with anndata.

SCANPY integrates the analysis possibilities of established R-based frameworks and provides them in a scalable and modular form. Specifically, SCANPY provides preprocessing comparable to SEURAT and CELL RANGER, visualization through TSNE, graph-d(More)

Only CPU

Scanpy

BioTuring

WGCNA: an R package for Weighted Gene Correlation Network Analysis

WGCNA: an R package for Weighted Gene Correlation Network Analysis Correlation networks are increasingly being used in bioinformatics applications. For example, weighted gene co-expression network analysis is a systems biology method for describing (More)

Only CPU

WGCNA

BioTuring

FunPat: Function-based Pattern analysis on RNA-seq time series data

Dynamic expression data, nowadays obtained using high-throughput RNA sequencing (RNA-seq), are essential to monitor transient gene expression changes and to study the dynamics of their transcriptional activity in the cell or response to stimuli. FunP(More)

Only CPU

FunPat

BioTuring

Monocle3 - An analysis toolkit for single-cell RNA-seq

Build single-cell trajectories with the software that introduced **pseudotime**. Find out about cell fate decisions and the genes regulated as they're made. Group and classify your cells based on gene expression. Identify new cell types and states a(More)

Only CPU

Monocle

Seurat

BioTuring

COMMOT: Screening cell-cell communication in spatial transcriptomics via collective optimal transport

In this notebook, we present COMMOT (COMMunication analysis by Optimal Transport) to infer cell-cell communication (CCC) in spatial transcriptomic, a package that infers CCC by simultaneously considering numerous ligand–receptor pairs for either sp(More)

Only CPU

COMMOT

BioTuring

MuSiC: Multi-subject Single-cell Deconvolution

Knowledge of cell type composition in disease relevant tissues is an important step towards the identification of cellular targets of disease. MuSiC is a method that utilizes cell-type specific gene expression from single-cell RNA sequencing (RNA-seq(More)

Only CPU

MuSiC

BioTuring

DWLS: Gene Expression Deconvolution Using Dampened Weighted Least Squares

Dampened weighted least squares (DWLS) is an estimation method for gene expression deconvolution, in which the cell-type composition of a bulk RNA-seq data set is computationally inferred. This method corrects common biases towards cell types that ar(More)

Only CPU

DWLS

Notebooks
Bioturing Massive-scale Analysis Solution for CellChat: Running analysis for massive-scale data from Seurat dataset Only CPU CellChat More
scGPT: Towards Building a Foundational Model for Single-Cell Multi-omics Using Generative AI Required GPU scgpt Seurat More
CopyKAT: Delineating copy number and clonal substructure in human tumors from single-cell transcriptomes Only CPU copykat Seurat More
Geneformer: a deep learning model for exploring gene networks Required GPU Seurat Geneformer More
Inference and analysis of cell-cell communication using CellChat Required GPU CellChat More
BioTuring Data Converter: Seurat <=> Scanpy for single-cell data transcriptomic and spatial transcriptomics Only CPU Scanpy Seurat More
Monorail-pipeline and Recount3 Only CPU recount3 More
Deep learning and alignment of spatially resolved single-cell transcriptomes with Tangram Required GPU Tangram More
Scanpy is a scalable toolkit for analyzing single-cell gene expression data built jointly with anndata. Only CPU Scanpy More
WGCNA: an R package for Weighted Gene Correlation Network Analysis Only CPU WGCNA More
FunPat: Function-based Pattern analysis on RNA-seq time series data Only CPU FunPat More
Monocle3 - An analysis toolkit for single-cell RNA-seq Only CPU Monocle Seurat More
COMMOT: Screening cell-cell communication in spatial transcriptomics via collective optimal transport Only CPU COMMOT More
MuSiC: Multi-subject Single-cell Deconvolution Only CPU MuSiC More
DWLS: Gene Expression Deconvolution Using Dampened Weighted Least Squares Only CPU DWLS More

...