Schema for modENCODE TopHat Junctions - D. rhopaloa TopHat Splice Junctions from modENCODE RNA-Seq
  Database: DrhoGB2    Primary Table: modencode_tophat    Row Count: 104,281
fieldexampleSQL type info
bin 585smallint(5) unsigned range
chrom AFPP02000013varchar(255) values
chromStart 223int(10) unsigned range
chromEnd 676int(10) unsigned range
name JUNC00000001varchar(255) values
score 2int(10) unsigned range
strand +char(1) values
thickStart 223int(10) unsigned range
thickEnd 676int(10) unsigned range
itemRgb 0int(10) unsigned range
blockCount 2int(10) unsigned range
blockSizes 88,12longblob  
chromStarts 0,441longblob  
expCount 311int(10) unsigned range
expIds 664longblob  

Sample Rows
 
binchromchromStartchromEndnamescorestrandthickStartthickEnditemRgbblockCountblockSizeschromStartsexpCountexpIds
585AFPP02000013223676JUNC000000012+2236760,0,0288,120,441311664
585AFPP02000013229741JUNC000000023+2297410,0,0282,890,423311652
585AFPP02000013258702JUNC000000031+2587020,0,0242,500,394300652
585AFPP02000013792952JUNC000000041-7929520,0,0275,250,135867927
585AFPP02000017383616JUNC000000051+3836160,0,0249,510,182432565
585AFPP02000026541057JUNC0000000686+54105734,139,34296,1190,884150938
585AFPP020000292201021JUNC00000007158-2201021192,96,154298,980,703318923
585AFPP02000030631021JUNC0000000836+63102151,102,2552124,800,878187941
585AFPP020000302541024JUNC0000000922+254102451,102,2552110,830,687364941
585AFPP0200003333224JUNC000000107-332240,0,0293,440,147126180

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

modENCODE TopHat Junctions (modencode_tophat) Track Description
 

Description

This track was created by mapping D. rhopaloa RNA-Seq reads (generated by the modENCODE project) against the D. rhopaloa DrhoGB2 assembly using Bowtie2 and TopHat. The RNA-Seq data were obtained from the NCBI Sequence Read Archive under the BioProject accession number PRJNA63469.

The TopHat junction predictions from the different libraries are filtered and merged together into a single set of predictions. The predictions are color-coded based on the number of reads supporting the junction:

ColorNumber of reads
> 1000
500-999
100-499
50-99
10-49
< 10

References

Chen ZX et al. Comparative validation of the D. melanogaster modENCODE transcriptome annotation. Genome Res. 2014 Jul;24(7):1209-23.

Trapnell C, Pachter L, Salzberg SL. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009 May 1;25(9):1105-11.