HTAN Bulk RNA Sequencing Data Standard

Overview

This page describes the data levels, metadata attributes, and file structure for bulk RNA sequencing.

Description of Assay

Bulk RNA sequencing identifies the average gene expression profile of a biological sample.

Metadata Levels

The defined metadata leverages existing common data elements from the Genomic Data Commons (GDC). The HTAN data model currently supports Level 1, 2 and 3 RNA sequencing data:  

Level Number

Definition

Example Data

1

Unaligned reads

FASTQ

2

Aligned reads

BAM

3

Gene level expression, unnormalized

Gene & isoform expression-level data (.csv)