HTAN Bulk RNA Sequencing Data Standard
Overview
This page describes the data levels, metadata attributes, and file structure for bulk RNA sequencing.
Description of Assay
Bulk RNA sequencing identifies the average gene expression profile of a biological sample.
Metadata Levels
The defined metadata leverages existing common data elements from the Genomic Data Commons (GDC). The HTAN data model currently supports Level 1, 2 and 3 RNA sequencing data:
Level Number |
Definition |
Example Data |
1 |
Unaligned reads |
FASTQ |
2 |
Aligned reads |
BAM |
3 |
Gene level expression, unnormalized |
Gene & isoform expression-level data (.csv) |