Schema for Augustus Genes (BUSCO) - Augustus Gene Predictions with BUSCO Training Set
  Database: DsimGB2    Primary Table: augustusBUSCO    Row Count: 13,290   Data last updated: 2022-10-20
fieldexampleSQL type info
bin 585smallint(5) unsigned range
name CM002910.g1.t1varchar(255) values
chrom CM002910varchar(255) values
strand -char(1) values
txStart 11595int(10) unsigned range
txEnd 32311int(10) unsigned range
cdsStart 11595int(10) unsigned range
cdsEnd 32311int(10) unsigned range
exonCount 13int(10) unsigned range
exonStarts 11595,13505,14010,14238,159...longblob  
exonEnds 11603,13683,14180,14334,159...longblob  
score 0int(11) range
name2 CM002910.g1varchar(255) values
cdsStartStat unkenum('none', 'unk', 'incmpl', 'cmpl') values
cdsEndStat unkenum('none', 'unk', 'incmpl', 'cmpl') values
exonFrames 1,0,1,1,2,2,1,0,1,0,0,1,0,longblob  

Sample Rows
 
binnamechromstrandtxStarttxEndcdsStartcdsEndexonCountexonStartsexonEndsscorename2cdsStartStatcdsEndStatexonFrames
585CM002910.g1.t1CM002910-115953231111595323111311595,13505,14010,14238,15939,18260,19067,22919,25519,26233,27524,30754,32295,11603,13683,14180,14334,15953,18698,19293,23106,25770,26660,29318,31311,32311,0CM002910.g1unkunk1,0,1,1,2,2,1,0,1,0,0,1,0,
585CM002910.g2.t1CM002910+56097596995609759699356097,58364,59606,56360,58740,59699,0CM002910.g2unkunk0,2,0,
585CM002910.g3.t1CM002910+87202898668720289866587202,87659,88692,89109,89618,87256,87681,89010,89538,89866,0CM002910.g3unkunk0,0,1,1,1,
585CM002910.g4.t1CM002910-90558927319055892731690558,91090,91360,91476,91596,92192,91033,91123,91422,91541,92000,92731,0CM002910.g4unkunk2,2,0,1,2,0,
585CM002910.g5.t1CM002910+94798996359479899635694798,96473,96808,97058,98782,99526,94922,96551,96931,97228,99462,99635,0CM002910.g5unkunk0,1,1,1,0,2,
585CM002910.g6.t1CM002910-1017451045001017451045004101745,103577,103802,104372,102140,103746,104316,104500,0CM002910.g6unkunk1,0,2,0,
585CM002910.g7.t1CM002910-1087301146931087301146937108730,109307,109469,109700,110714,111754,114509,109114,109408,109638,110534,110904,111872,114693,0CM002910.g7unkunk0,1,0,0,2,1,0,
585CM002910.g8.t1CM002910+1182301194301182301194301118230,119430,0CM002910.g8unkunk0,
585CM002910.g9.t1CM002910-1242381249571242381249573124238,124360,124876,124244,124819,124957,0CM002910.g9unkunk0,0,0,
585CM002910.g10.t1CM002910-1250921281121250921281126125092,125366,125739,126882,127020,127295,125309,125580,126495,126965,127234,128112,0CM002910.g10unkunk2,1,1,2,1,0,

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

Augustus Genes (BUSCO) (augustusBUSCO) Track Description
 

Description

This track shows the gene models from the Augustus gene predictor using the species-specific parameters estimated from a training set produced by BUSCO.

Methods

The training set used to estimate the species-specific parameters for the Augustus analysis was produced by BUSCO using the arthropoda_odb9 dataset.

Transposons remnants identified by RepeatMasker were provided as hints to the Augustus gene predictor. The following additional parameters were used with the Augustus gene predictor:

  --strand=both
  --noInFrameStop=true
  --gff3=on
  --uniqueGeneId=true
  --protein=off
  --codingseq=off
  --introns=off
  --stop=off
  --cds=on
  --singlestrand=false
  --UTR=off
  --genemodel=partial

References

Stanke M, Waack S. Gene prediction with a hidden Markov model and a new intron submodel. Bioinformatics. 2003 Oct;19 Suppl 2:ii215-25.

Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015 Oct 1;31(19):3210-2.