Schema for Augustus - Augustus Gene Predictions
  Database: DanaCAF1    Primary Table: augustus    Row Count: 12,881   Data last updated: 2022-10-20
fieldexampleSQL type info
bin 585smallint(5) unsigned range
name scaffold_0.g24.t1varchar(255) values
chrom scaffold_0varchar(255) values
strand +char(1) values
txStart 0int(10) unsigned range
txEnd 346int(10) unsigned range
cdsStart 0int(10) unsigned range
cdsEnd 346int(10) unsigned range
exonCount 2int(10) unsigned range
exonStarts 0,157,longblob  
exonEnds 96,346,longblob  
score 0int(11) range
name2 scaffold_0.g24varchar(255) values
cdsStartStat unkenum('none', 'unk', 'incmpl', 'cmpl') values
cdsEndStat unkenum('none', 'unk', 'incmpl', 'cmpl') values
exonFrames 0,0,longblob  

Sample Rows
 
binnamechromstrandtxStarttxEndcdsStartcdsEndexonCountexonStartsexonEndsscorename2cdsStartStatcdsEndStatexonFrames
585scaffold_0.g24.t1scaffold_0+0346034620,157,96,346,0scaffold_0.g24unkunk0,0,
585scaffold_10296.g78.t1scaffold_10296+265124626512462265,970,590,1246,0scaffold_10296.g78unkunk0,1,
585scaffold_10480.g97.t1scaffold_10480-803109280310922803,1083,926,1092,0scaffold_10480.g97unkunk0,0,
585scaffold_10492.g77.t1scaffold_10492+5851858518258,341,191,518,0scaffold_10492.g77unkunk2,0,
585scaffold_10550.g14.t1scaffold_10550+181100218110021181,1002,0scaffold_10550.g14unkunk0,
585scaffold_10575.g6.t1scaffold_10575-128105312810533128,401,726,345,664,1053,0scaffold_10575.g6unkunk2,0,0,
585scaffold_10575.g7.t1scaffold_10575-112649911126499121126,4766,1792,4991,0scaffold_10575.g7unkunk0,0,
585scaffold_11114.g27.t1scaffold_11114+0966096610,966,0scaffold_11114.g27unkunk0,
585scaffold_11116.g63.t1scaffold_11116+013750137510,1375,0scaffold_11116.g63unkunk2,
585scaffold_11119.g18.t1scaffold_11119+0499049910,499,0scaffold_11119.g18unkunk2,

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

Augustus (augustus) Track Description
 

Description

This track shows the gene predictions from the Augustus program. The predictions are based on hints provided by the SPALN alignments of D. melanogaster proteins against the D. ananassae genome assembly, as well as splice junction predictions and assembled transcripts derived from D. ananassae RNA-Seq data.

References

Stanke M, Diekhans M, Baertsch R, Haussler D. Using native and syntenically mapped cDNA alignments to improve de novo gene finding. Bioinformatics. 2008 Mar 1;24(5):637-44. doi: 10.1093/bioinformatics/btn013.

Iwata H, Gotoh O. Benchmarking spliced alignment programs including Spaln2, an extended version of Spaln that incorporates additional species-specific features. Nucleic Acids Res. 2012 Nov 1;40(20):e161. doi: 10.1093/nar/gks708.

Haas BJ et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat Protoc. 2013 Aug;8(8):1494-512.

Kovaka S, Zimin AV, Pertea GM, Razaghi R, Salzberg SL, Pertea M. Transcriptome assembly from long-read RNA-seq alignments with StringTie2. Genome Biol. 2019 Dec 16;20(1):278.