Schema for Head Tissues TopHat Junctions - Junctions predicted by TopHat using modENCODE RNA-Seq from Head Tissues
  Database: DmojImproved    Primary Table: head_junctions    Row Count: 59,141
fieldexampleSQL type info
bin 585smallint(5) unsigned range
chrom improved_6498varchar(255) values
chromStart 52145int(10) unsigned range
chromEnd 53233int(10) unsigned range
name JUNC00000001varchar(255) values
score 7int(10) unsigned range
strand +char(1) values
thickStart 52145int(10) unsigned range
thickEnd 53233int(10) unsigned range
itemRgb 0int(10) unsigned range
blockCount 2int(10) unsigned range
blockSizes 71,70longblob  
chromStarts 0,1018longblob  
expCount 52216int(10) unsigned range
expIds 53163longblob  

Sample Rows
 
binchromchromStartchromEndnamescorestrandthickStartthickEnditemRgbblockCountblockSizeschromStartsexpCountexpIds
585improved_64985214553233JUNC000000017+52145532330,0,0271,700,10185221653163
585improved_64985219353285JUNC000000021+52193532850,0,0223,530,10395221653232
585improved_64985220453219JUNC000000035+52204532190,0,0251,560,9595225553163
585improved_64985224453234JUNC0000000431+522445323451,102,255274,710,9195231853163
585improved_64985239653186JUNC000000052+52396531860,0,0253,230,7675244953163
585improved_64987836297988JUNC000000062+78362979880,0,0258,330,195937842097955
585improved_64988032280484JUNC000000072-80322804840,0,0267,380,1248038980446
585improved_64989805298252JUNC0000000840+980529825251,102,255273,750,1259812598177
585improved_64989805598271JUNC0000000925+980559827151,102,255270,750,1419812598196
586improved_6498207074213119JUNC000000104+2070742131190,0,0269,310,6014207143213088

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

Head Tissues TopHat Junctions (head_junctions) Track Description
 

Description

This track was created by mapping D. mojavensis RNA-Seq reads from head tissues against the D. mojavensis DmojImproved genome assembly using TopHat2. The RNA-Seq data were generated by modENCODE project and the raw data is available for download at the NCBI Sequence Read Archive (SRA):

Accession numberDescriptionPlatform
SRX017458 Female heads (rep1) Illumina Genome Analyzer
SRX017459 Female heads (rep2) Illumina Genome Analyzer II
SRX017460 Male heads (rep1) Illumina Genome Analyzer
SRX017461 Male heads (rep2) Illumina Genome Analyzer II

The TopHat junction predictions from the different libraries are filtered and merged together into a single set of predictions. The predictions are color-coded based on the number of reads supporting the junction:

ColorNumber of reads
> 1000
500-999
100-499
50-99
10-49
< 10

Methods

TopHat2 was run against the D. mojavensis genome using the following parameters: -g 1 --b2-very-sensitive --min-intron-length 30 --max-intron-length 150000

References

TopHat is a fast splice junction mapper for RNA-Seq reads. It aligns RNA-Seq reads to mammalian-sized genomes using the ultra high-throughput short read aligner Bowtie, and then analyzes the mapping results to identify splice junctions between exons.

The RNA-Seq data were submitted by the modENCODE project. The original RNA-Seq dataset can be obtained from the NCBI GEO database under the accession number GSE20882.