HTAN centers are generating bulk RNA sequencing metadata. This set of metadata attributes are based on the GDC data dictionary for RNA-seq and also includes QC variables coordinated with individual HTAN centers & working groups, in consideration of downstream analysis use-cases.
Files organized by levels
The files are organized by Levels (a concept borrowed from the TCGA / GDC):
Level 1 – Unaligned read data (FASTQ)
Level 2 – Aligned reads (BAM)
Level 3 – Gene and isoform expression (CSV)