Schema for genBlastG Genes - genBlastG Gene Predictions
  Database: DyakCAF1    Primary Table: genblastg    Row Count: 30,877   Data last updated: 2024-01-05
fieldexampleSQL type info
bin 585smallint(5) unsigned range
name gbg_l(2)gl-PE_1varchar(255) values
chrom chr2Lvarchar(255) values
strand -char(1) values
txStart 8643int(10) unsigned range
txEnd 12822int(10) unsigned range
cdsStart 8643int(10) unsigned range
cdsEnd 12822int(10) unsigned range
exonCount 7int(10) unsigned range
exonStarts 8643,8836,9279,9784,10693,1...longblob  
exonEnds 8773,8945,9722,10427,10799,...longblob  

Sample Rows
 
binnamechromstrandtxStarttxEndcdsStartcdsEndexonCountexonStartsexonEnds
585gbg_l(2)gl-PE_1chr2L-86431282286431282278643,8836,9279,9784,10693,10856,12106,8773,8945,9722,10427,10799,12048,12822,
585gbg_l(2)gl-PF_2chr2L-86431282286431282278643,8836,9279,9784,10693,10856,12106,8773,8945,9722,10427,10799,12048,12822,
585gbg_l(2)gl-PB_3chr2L-86431468486431468488643,8836,9279,9784,10693,10856,12106,14612,8773,8945,9722,10427,10799,12048,12885,14684,
585gbg_l(2)gl-PC_4chr2L-86431468486431468488643,8836,9279,9784,10693,10856,12106,14600,8773,8945,9722,10427,10799,12048,12885,14684,
585gbg_l(2)gl-PG_5chr2L-86431468486431468488643,8836,9279,9784,10693,10856,12106,14600,8773,8945,9722,10427,10799,12048,12885,14684,
585gbg_l(2)gl-PH_6chr2L-86431468486431468488643,8836,9279,9784,10693,10856,12106,14600,8773,8945,9722,10427,10799,12048,12885,14684,
585gbg_l(2)gl-PA_7chr2L-86431468486431468488643,8836,9279,9784,10693,10856,12106,14600,8773,8945,9722,10427,10799,12048,12885,14684,
585gbg_l(2)gl-PI_8chr2L-86431468486431468488643,8836,9279,9784,10693,10856,12106,14600,8773,8945,9722,10427,10799,12048,12885,14684,
585gbg_l(2)gl-PJ_9chr2L-86431468486431468488643,8836,9279,9784,10693,10856,12106,14600,8773,8945,9722,10427,10799,12048,12885,14684,
585gbg_l(2)gl-PD_0chr2L-86431282286431282278643,8836,9279,9784,10693,10856,12106,8773,8945,9722,10427,10799,12048,12822,

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

genBlastG Genes (genblastg) Track Description
 

Description

D. melanogaster proteins from FlyBase were aligned against each scaffold in the D. yakuba (DyakCAF1) assembly and the predicted gene models were constructed using genBlastG.

Methods

D. melanogaster protein sequences were aligned against the D. yakuba (DyakCAF1) genome assembly using WU-TBLASTN with the following parameters:

-B=1000 -V=1000 -hspmax=5000
-hspsepSmax=50000 -hspsepQmax=1000
-altscore='any * -999' -nogaps -hitdist=40
-matrix=BLOSUM80 -Q=12 -R=2 -e 1e-10
-wordmask=seg+xnu -topComboN=1 -links
-notes -warnings -novalidctxok

The WU-TBLASTN results were then analyzed by genBlastG using the following parameters: -P wublast -r 1 -i 30 -pro -gff -e 1e-10

References

She R, Chu JS, Uyar B, Wang J, Wang K, Chen N. genBlastG: using BLAST searches to build homologous gene models. Bioinformatics. 2011 Aug 1;27(15):2141-3.