Introduction

HTAN centers are generating bulk RNA sequencing metadata. This set of metadata attributes are based on the GDC data dictionary for RNA-seq and also includes QC variables coordinated with individual HTAN centers & working groups, in consideration of downstream analysis use-cases.

Files organized by levels

The files are organized by Levels (a concept borrowed from the TCGA / GDC):

Level 1 – Unaligned read data (FASTQ)

Level 2 – Aligned reads (BAM)

Level 3 – Gene and isoform expression (CSV)