The Vacc-SeqQC project: Benchmarking RNA-Seq for clinical vaccine studies

Over the last decade, the field of systems vaccinology has emerged, in which high throughput transcriptomics and other omics assays are used to probe changes of the innate and adaptive immune system in response to vaccination. The goal of this study was to benchmark key technical and analytical para...

Full description

Saved in:
Bibliographic Details
Published in:Frontiers in immunology Vol. 13; p. 1093242
Main Authors: Goll, Johannes B, Bosinger, Steven E, Jensen, Travis L, Walum, Hasse, Grimes, Tyler, Tharp, Gregory K, Natrajan, Muktha S, Blazevic, Azra, Head, Richard D, Gelber, Casey E, Steenbergen, Kristen J, Patel, Nirav B, Sanz, Patrick, Rouphael, Nadine G, Anderson, Evan J, Mulligan, Mark J, Hoft, Daniel F
Format: Journal Article
Language:English
Published: Switzerland Frontiers Media S.A 19-01-2023
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Over the last decade, the field of systems vaccinology has emerged, in which high throughput transcriptomics and other omics assays are used to probe changes of the innate and adaptive immune system in response to vaccination. The goal of this study was to benchmark key technical and analytical parameters of RNA sequencing (RNA-seq) in the context of a multi-site, double-blind randomized vaccine clinical trial. We collected longitudinal peripheral blood mononuclear cell (PBMC) samples from 10 subjects before and after vaccination with a live attenuated vaccine and performed RNA-Seq at two different sites using aliquots from the same sample to generate two replicate datasets (5 time points for 50 samples each). We evaluated the impact of (i) filtering lowly-expressed genes, (ii) using external RNA controls, (iii) fold change and false discovery rate (FDR) filtering, (iv) read length, and (v) sequencing depth on differential expressed genes (DEGs) concordance between replicate datasets. Using synthetic mRNA spike-ins, we developed a method for empirically establishing minimal read-count thresholds for maintaining fold change accuracy on a per-experiment basis. We defined a reference PBMC transcriptome by pooling sequence data and established the impact of sequencing depth and gene filtering on transcriptome representation. Lastly, we modeled statistical power to detect DEGs for a range of sample sizes, effect sizes, and sequencing depths. Our results showed that (i) filtering lowly-expressed genes is recommended to improve fold-change accuracy and inter-site agreement, if possible guided by mRNA spike-ins (ii) read length did not have a major impact on DEG detection, (iii) applying fold-change cutoffs for DEG detection reduced inter-set agreement and should be used with caution, if at all, (iv) reduction in sequencing depth had a minimal impact on statistical power but reduced the identifiable fraction of the PBMC transcriptome, (v) after sample size, effect size (i.e. the magnitude of fold change) was the most important driver of statistical power to detect DEG. The results from this study provide RNA sequencing benchmarks and guidelines for planning future similar vaccine studies.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ObjectType-Undefined-3
These authors have contributed equally to this work
This article was submitted to Systems Immunology, a section of the journal Frontiers in Immunology
Edited by: Mahbuba Rahman, McMaster University, Canada
Reviewed by: Samiksha Garse, DY Patil Deemed to be University, India; Mathieu Garand, Washington University in St. Louis, United States; Tengchuan Jin, University of Science and Technology of China, China
ISSN:1664-3224
1664-3224
DOI:10.3389/fimmu.2022.1093242