Classify Nascent RNA Transcripts using Decision Tree • nascentrna

The goal of nascentrna is to make it easy to narrow in on interesting regions of genome activity measured by nascent RNA sequencing by classifying these active regions in relation to known, annotated genes using a decision tree.

Installation

You can install the latest version of nascentrna from GitHub with:

# install.packages("devtools")
devtools::install_github("coregenomics/nascentrna")

Example

Below is a basic example of running the annotate() function to find interesting gene activity.

library(nascentrna)   # annotate()
library(rtracklayer)  # import.bed()
library(TxDb.Hsapiens.UCSC.hg38.knownGene)

# Obtain transcription start sites derived from experimental nascent RNA data.
file_tss <- system.file(package = "dREG.HD", "extdata", "k562.chr21.predictions.bed")
if (file_tss == "") {
  file_tss <- "https://raw.githubusercontent.com/coregenomics/dREG.HD/master/dREG.HD/inst/extdata/k562.chr21.predictions.bed"
}
tss <- import.bed(file_tss)

# Fetch annotated genes near our TSS data.
genes_all <- genes(TxDb.Hsapiens.UCSC.hg38.knownGene)
genes_long <- subsetByOverlaps(genes_all, range(tss * 0.9))
genes <- genes_long[width(genes) < 1e6]

# Find interesting activity by proximity to genes.
annotate(tss, genes)

nascentrna

Installation

Example

Links

License

Developers

Dev status