Schema for modENCODE TopHat Junctions - D. takahashii TopHat Splice Junctions from modENCODE RNA-Seq
  Database: DtakGB2    Primary Table: modencode_tophat    Row Count: 110,100
fieldexampleSQL type info
bin 585smallint(5) unsigned range
chrom AFFI02000003varchar(255) values
chromStart 11221int(10) unsigned range
chromEnd 31033int(10) unsigned range
name JUNC00000001varchar(255) values
score 1007int(10) unsigned range
strand -char(1) values
thickStart 11221int(10) unsigned range
thickEnd 31033int(10) unsigned range
itemRgb 13118744int(10) unsigned range
blockCount 2int(10) unsigned range
blockSizes 109,121longblob  
chromStarts 0,19691longblob  
expCount 11330int(10) unsigned range
expIds 30912longblob  

Sample Rows
 
binchromchromStartchromEndnamescorestrandthickStartthickEnditemRgbblockCountblockSizeschromStartsexpCountexpIds
585AFFI020000031122131033JUNC000000011007-1122131033200,45,242109,1210,196911133030912
585AFFI020000031126232047JUNC000000025-11262320470,0,0268,780,207071133031969
585AFFI020000031127711411JUNC000000031-11277114110,0,0253,190,1151133011392
585AFFI020000033083330986JUNC000000042-30833309860,0,0226,740,793085930912
585AFFI020000033105632092JUNC000000051433-3105632092200,45,242123,1230,9133117931969
585AFFI020000033105832420JUNC000000063-31058324200,0,02121,360,13263117932384
585AFFI020000033110532176JUNC000000074-31105321760,0,0274,510,10203117932125
585AFFI020000033196932249JUNC0000000878-319693224934,139,34291,1240,1563206032125
585AFFI020000033227132665JUNC000000092-32271326650,0,0261,390,3553233232626
585AFFI020000033243332673JUNC000000101-32433326730,0,0278,470,1933251132626

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

modENCODE TopHat Junctions (modencode_tophat) Track Description
 

Description

This track was created by mapping D. takahashii RNA-Seq reads (generated by the modENCODE project) against the D. takahashii DtakGB2 assembly using Bowtie2 and TopHat. The RNA-Seq data were obtained from the NCBI Sequence Read Archive under the BioProject accession number PRJNA63469.

The TopHat junction predictions from the different libraries are filtered and merged together into a single set of predictions. The predictions are color-coded based on the number of reads supporting the junction:

ColorNumber of reads
> 1000
500-999
100-499
50-99
10-49
< 10

References

Chen ZX et al. Comparative validation of the D. melanogaster modENCODE transcriptome annotation. Genome Res. 2014 Jul;24(7):1209-23.

Trapnell C, Pachter L, Salzberg SL. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009 May 1;25(9):1105-11.