Schema for modENCODE TopHat Junctions - D. kikkawai TopHat Splice Junctions from modENCODE RNA-Seq
  Database: DkikGB2    Primary Table: modencode_tophat    Row Count: 94,510
fieldexampleSQL type info
bin 585smallint(5) unsigned range
chrom AFFH02000026varchar(255) values
chromStart 850int(10) unsigned range
chromEnd 1150int(10) unsigned range
name JUNC00000001varchar(255) values
score 4int(10) unsigned range
strand -char(1) values
thickStart 850int(10) unsigned range
thickEnd 1150int(10) unsigned range
itemRgb 0int(10) unsigned range
blockCount 2int(10) unsigned range
blockSizes 121,120longblob  
chromStarts 0,180longblob  
expCount 971int(10) unsigned range
expIds 1030longblob  

Sample Rows
 
binchromchromStartchromEndnamescorestrandthickStartthickEnditemRgbblockCountblockSizeschromStartsexpCountexpIds
585AFFH020000268501150JUNC000000014-85011500,0,02121,1200,1809711030
585AFFH02000168334565JUNC0000000216-33456551,102,255276,1040,127410461
585AFFH0200021916251817JUNC000000031+162518170,0,0244,560,13616691761
585AFFH0200021916351800JUNC000000041+163518000,0,0234,660,9916691734
585AFFH0200021917291942JUNC0000000512+1729194251,102,255292,330,18018211909
585AFFH0200021920512295JUNC0000000616+2051229551,102,255293,960,14821442199
585AFFH0200021928403032JUNC000000071+284030320,0,0281,190,17329213013
585AFFH0200021928443079JUNC0000000810+2844307951,102,255281,760,15929253003
585AFFH0200021932103471JUNC0000000932+3210347151,102,255299,970,16433093374
585AFFH0200021934673970JUNC000000105+346739700,0,0289,430,46035563927

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

modENCODE TopHat Junctions (modencode_tophat) Track Description
 

Description

This track was created by mapping D. kikkawai RNA-Seq reads (generated by the modENCODE project) against the D. kikkawai DkikGB2 assembly using Bowtie2 and TopHat. The RNA-Seq data were obtained from the NCBI Sequence Read Archive under the BioProject accession number PRJNA63469.

The TopHat junction predictions from the different libraries are filtered and merged together into a single set of predictions. The predictions are color-coded based on the number of reads supporting the junction:

ColorNumber of reads
> 1000
500-999
100-499
50-99
10-49
< 10

References

Chen ZX et al. Comparative validation of the D. melanogaster modENCODE transcriptome annotation. Genome Res. 2014 Jul;24(7):1209-23.

Trapnell C, Pachter L, Salzberg SL. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009 May 1;25(9):1105-11.