Skip to content

Brodinlab/ME-CFS_study

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ME/CFS INMEST Study

ME/CFS study from Swedish clinical cohort (INMEST): single-level analyses and omics integration.

Table of contents

General info

This project used multiple omics data:

  • Plasma protein expression (Olink - NPX values)
  • Cell abundance (CyTOF - Grid cell abundance)
  • mRNA sequencing (VST counts)

Single-level analyses and omics-integration was performed in order to characterize this heterogeneous ME/CFS cohort and the effects of the INMEST treatment throughout the clinical trial.

Dependencies

Project is created with:

  • RStudio version: 3.6.0
  • Python version: 2.7
  • Unix/Linux

Repo description

  • DESeq2/ contains script to convert Kallisto estimates and run DESeq2
  • MOFA/ contains script to train MOFA model and some functions to uncover sources of variation
  • Figures/ contains scripts used for analyses and figures displayed in paper
  • MEM_clean.R used for mixed-effect modelling
  • gsea_clean.R used for gene set enrichment analyses and enrichment plot

Single-level analyses

Grid

  • Grid is a supervised learning algorithm based on t-SNE implementation. It uses the manual classification of cells in CyTOF samples and then the automatic classification of cells in new samples through the use of machine learning techniques based on these manual classifications.
  • To run Grid, install it using:
$ pip install cellgrid

DESeq2 (Differential Gene Expression Analysis)

  • The principles behind DESeq2 is described in Love et al. (2014)
  • Used Kallisto outputs (estimates) that were converted into read counts using tximport before running DESeq2 with deseq_run.R

GSEA (Gene Set Enrichment Analyses)

  • clusterProfiler::gseGO() used in gsea_clean.R to run hierarchical gsea and plot
  • Also, extracts results from GSEA used in GO_plot.R

Mixed-Effect modelling

  • Partial-bayesian mixed-effect modelling to account for covariates
  • MEM_clean.R extracts confounding variables for downstream use in GSEA for example, after modelling

Omics integration

MOFA

  • Multi-Omics Factor Analysis (MOFA) was used in this study in order to deconvolute the main sources of variation in the differents sets of data mentioned above. For more information, read their published Methods paper Argelaguet et al. (2018).
  • MOFA is publicly accessible here: https://github.com/bioFAM/MOFA

Figures

Spearman Correlation

  • spearman_corrmatrix.R uses cell abundance dataframe built from Grid that is sub-setted by active treatments for correlation
  • Option to re-order one matrix in accordance to the other for comparison purposes

GO plots

  • Post GSEA, GO_plot.R extracts set of genes associated with GO terms to plot
  • Original counts used to calculate median expression and log2 ratio for active treatments (covariate)
  • Solely, significant features of MEM used

Volcano MEM

  • Volcano_MEM_clean.R reads in generated MEM table built in MEM_clean.R to build volcano plots of features with significant ones highlighted and a top sub-set of them labeled

Disease Tolerance Network

  • visNetwork_clean.R reads in an edge and node file manipulated from TTRUST v2 output to build disease tolerance network with color gradient based on log2 expression and size of nodes from number of regulatory interactions

Setup

To run this project, install it locally using devtools:

$ install.packages('devtools')
$ library(devtools)
$ install_github('Brodinlab/ME-CFS_study')
$ library(ME-CFS_study)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages