Schema for GeMoMa Genes - GeMoMa Gene Predictions
  Database: DhydGB1    Primary Table: GeMoMa    Row Count: 11,329   Data last updated: 2022-10-20
fieldexampleSQL type info
bin 585smallint(5) unsigned range
name FBTR0307895_R0varchar(255) values
chrom NWQH01000001varchar(255) values
strand +char(1) values
txStart 25096int(10) unsigned range
txEnd 32312int(10) unsigned range
cdsStart 25096int(10) unsigned range
cdsEnd 32312int(10) unsigned range
exonCount 11int(10) unsigned range
exonStarts 25096,25277,25485,26820,278...longblob  
exonEnds 25132,25365,25634,27118,279...longblob  
score 0int(11) range
name2 gene_0varchar(255) values
cdsStartStat unkenum('none', 'unk', 'incmpl', 'cmpl') values
cdsEndStat unkenum('none', 'unk', 'incmpl', 'cmpl') values
exonFrames 0,0,1,0,1,1,0,1,2,0,0,longblob  

Sample Rows
 
binnamechromstrandtxStarttxEndcdsStartcdsEndexonCountexonStartsexonEndsscorename2cdsStartStatcdsEndStatexonFrames
585FBTR0307895_R0NWQH01000001+250963231225096323121125096,25277,25485,26820,27805,28045,28584,29834,31514,31673,32213,25132,25365,25634,27118,27988,28326,28735,30000,31614,32153,32312,0gene_0unkunk0,0,1,0,1,1,0,1,2,0,0,
585FBTR0077609_R0NWQH01000001-38662431163866243116638662,39239,40325,40605,40856,43006,39180,40217,40543,40758,41318,43116,0gene_126unkunk1,1,2,2,2,0,
585FBTR0081916_R0NWQH01000001-73243746767324374676473243,73664,73907,74238,73600,73850,74177,74676,0gene_127unkunk0,0,0,0,
585FBTR0070277_R0NWQH01000001+9796410241097964102410597964,98511,99458,101979,102221,98438,98632,99720,102103,102410,0gene_1unkunk0,0,1,2,0,
586FBTR0070279_R0NWQH01000001+1512501525671512501525671151250,152567,0gene_2unkunk0,
586FBTR0089815_R0NWQH01000001+15521717789815521717789815155217,155655,155777,155978,164199,165479,165974,167383,167896,168025,168440,168942,169472,169938,176824,155323,155709,155920,156587,164290,165594,166073,167547,167962,168109,168503,169405,169655,170032,177898,0gene_3unkunk0,1,1,0,0,1,2,2,1,1,1,1,2,2,0,
586FBTR0070272_R0NWQH01000001-1571281613341571281613345157128,157884,158646,158853,160814,157815,158106,158767,158947,161334,0gene_128unkunk0,0,2,1,0,
586FBTR0273400_R0NWQH01000001-1660581667931660581667933166058,166513,166777,166458,166703,166793,0gene_129unkunk2,1,0,
586FBTR0070270_R0NWQH01000001-1812891832111812891832114181289,181745,182321,183199,181682,182256,183151,183211,0gene_130unkunk0,2,0,0,
586FBTR0070283_R0NWQH01000001+1854891859711854891859712185489,185750,185691,185971,0gene_4unkunk0,1,

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

GeMoMa Genes (GeMoMa) Track Description
 

Description

D. melanogaster protein sequences from FlyBase were aligned against each scaffold in the D. hydei (DhydGB1) assembly and the predicted gene models were constructed using GeMoMa.

Methods

D. melanogaster protein sequences were aligned against the D. hydei (DhydGB1) genome assembly using NCBI TBLASTN with the following parameters:

  -evalue 1e-5
  -max_intron_length 100000
  -matrix BLOSUM80
  -gapopen 13
  -gapextend 2
  -soft_masking true
  -db_soft_mask 30
  -best_hit_overhang 0.1
  -best_hit_score_edge 0.1

The TBLASTN results were used by GeMoMa to produce an initial set of gene predictions. The GeMoMa predictions are then filtered by the GAF module in GeMoMa to produce the final set of gene predictions.

References

Keilwagen J, Wenk M, Erickson JL, Schattat MH, Grau J, Hartung F. Using intron position conservation for homology-based gene prediction. Nucleic Acids Res. 2016 May 19;44(9):e89.