Schema for RefSeq Genes - BLAT Alignments of NCBI RefSeq Genes
  Database: DsimGB2    Primary Table: refseqRNA    Row Count: 29,275   Data last updated: 2022-10-20
fieldexampleSQL type info
bin 585smallint(5) unsigned range
chrom CM002910varchar(255) values
chromStart 4002int(10) unsigned range
chromEnd 12681int(10) unsigned range
name XM_016179293varchar(255) values
score 1000int(10) unsigned range
strand -char(1) values
thickStart 5294int(10) unsigned range
thickEnd 11255int(10) unsigned range
reserved 0int(10) unsigned range
blockCount 9int(10) unsigned range
blockSizes 1422,109,443,643,106,1180,7...longblob  
chromStarts 0,1485,1856,2371,3579,3754,...longblob  

Sample Rows
 
binchromchromStartchromEndnamescorestrandthickStartthickEndreservedblockCountblockSizeschromStarts
585CM002910400212681XM_0161792931000-529411255091422,109,443,643,106,1180,779,160,279,0,1485,1856,2371,3579,3754,4992,7169,8400,
585CM002910400212681XM_0161792921000-529411255091422,109,443,643,106,1192,779,160,279,0,1485,1856,2371,3579,3742,4992,7169,8400,
585CM0029101306416466XM_0161792911000-131551634405860,193,880,282,527,0,923,1174,2109,2875,
585CM0029101306416466XM_0392911901000-131551634406619,160,193,880,282,527,0,700,923,1174,2109,2875,
585CM0029101306416181XM_0392911911000-131551618105860,193,880,282,81,0,923,1174,2109,3036,
585CM0029101306416466XM_0392911921000-131551634406860,193,96,385,282,527,0,923,1174,1669,2109,2875,
585CM0029101398616466XM_0392911931000-141251634404194,890,282,527,0,242,1187,1953,
585CM0029101658346103XM_0161792801000-17728448950141313,199,438,226,194,87,450,47,193,2547,197,557,102,169,0,1388,1677,2484,3201,3450,8742,9475,9645,10666,13451,14171,28242,29351,
585CM0029101658346930XM_0392911801000-17728448950141313,199,438,226,194,87,2877,47,493,2547,197,557,102,996,0,1388,1677,2484,3201,3450,5264,9475,9645,10666,13451,14171,28242,29351,
585CM0029101658346039XM_0161792841000-17728448950121313,199,438,226,194,87,47,493,197,557,102,105,0,1388,1677,2484,3201,3450,9475,9645,13451,14171,28242,29351,

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

RefSeq Genes (refseqRNA) Track Description
 

Description

D. simulans transcripts from the NCBI RefSeq database were aligned against the D. simulans assembly using BLAT. The thicker boxes denote the coding regions within the transcripts and the thinner boxes denote the untranslated regions.

Methods

D. simulans transcripts were mapped against the D. simulans genome assembly using BLAT with the following parameters:

    -q=rna -fine -minScore=20 -stepSize=5

The transcript alignments are analyzed by pslReps using the following parameter:

    -minCover=0.15 -minAli=0.98 -nearTop=0.001

The alignments are then filtered by pslCDnaFilter using the following parameters:

    -minId=0.95 -minCover=0.15 -localNearBest=0.001 \
    -minQSize=20 -ignoreIntrons -repsAsMatch -ignoreNs -bestOverlap

References

Kent WJ. BLAT — the BLAST-like alignment tool. Genome Res. 2002 Apr;12(4):656-64.

The transcripts were obtained from the NCBI FTP site under the RefSeq assembly accession number GCF_016746395.2.