Schema for Geneid Genes - Geneid Gene Predictions
  Database: DyakCAF1    Primary Table: geneid    Row Count: 18,828   Data last updated: 2022-10-20
fieldexampleSQL type info
bin 585smallint(5) unsigned range
name mRNA_geneid-chr2L_1varchar(255) values
chrom chr2Lvarchar(255) values
strand -char(1) values
txStart 4506int(10) unsigned range
txEnd 11258int(10) unsigned range
cdsStart 4506int(10) unsigned range
cdsEnd 11258int(10) unsigned range
exonCount 5int(10) unsigned range
exonStarts 4506,9279,9784,10693,10856,longblob  
exonEnds 4514,9722,10349,10799,11258,longblob  
score 0int(11) range
name2 geneid-chr2L_1varchar(255) values
cdsStartStat unkenum('none', 'unk', 'incmpl', 'cmpl') values
cdsEndStat unkenum('none', 'unk', 'incmpl', 'cmpl') values
exonFrames 1,2,1,0,0,longblob  

Sample Rows
 
binnamechromstrandtxStarttxEndcdsStartcdsEndexonCountexonStartsexonEndsscorename2cdsStartStatcdsEndStatexonFrames
585mRNA_geneid-chr2L_1chr2L-45061125845061125854506,9279,9784,10693,10856,4514,9722,10349,10799,11258,0geneid-chr2L_1unkunk1,2,1,0,0,
585mRNA_geneid-chr2L_2chr2L-166892877216689287721216689,17541,17769,18264,21194,21490,22503,23238,24943,27559,28097,28211,17214,17711,17865,18649,21393,21928,22700,23323,25710,27815,28144,28772,0geneid-chr2L_2unkunk0,1,1,0,2,2,0,2,0,2,0,0,
585mRNA_geneid-chr2L_3chr2L-29805477052980547705429805,32040,32502,47635,31795,32237,33059,47705,0geneid-chr2L_3unkunk2,0,1,0,
585mRNA_geneid-chr2L_4chr2L+56139593895613959389356139,56602,59103,56277,59046,59389,0geneid-chr2L_4unkunk0,0,2,
585mRNA_geneid-chr2L_5chr2L+61801673776180167377561801,62894,64196,64380,67131,62173,63086,64312,64669,67377,0geneid-chr2L_5unkunk0,0,0,2,0,
585mRNA_geneid-chr2L_6chr2L-72463807457246380745272463,80732,73485,80745,0geneid-chr2L_6unkunk1,0,
585mRNA_geneid-chr2L_7chr2L+84208933058420893305884208,84788,88114,90594,90940,91893,92369,93246,84249,85732,88338,90724,90965,92246,92798,93305,0geneid-chr2L_7unkunk0,2,1,0,1,2,1,1,
585mRNA_geneid-chr2L_8chr2L-93839942689383994268193839,94268,0geneid-chr2L_8unkunk0,
585mRNA_geneid-chr2L_9chr2L-95302960169530296016195302,96016,0geneid-chr2L_9unkunk0,
585mRNA_geneid-chr2L_10chr2L+9806310272998063102729698063,99782,99927,100177,101876,102620,98187,99860,100050,100347,102556,102729,0geneid-chr2L_10unkunk0,1,1,1,0,2,

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

Geneid Genes (geneid) Track Description
 

Description

This track shows gene predictions from the geneid program developed by Roderic Guigó's Computational Biology of RNA Processing group which is part of the Centre de Regulació Genòmica (CRG) in Barcelona, Catalunya, Spain.

Methods

Geneid is a program to predict genes in anonymous genomic sequences designed with a hierarchical structure. In the first step, splice sites, start and stop codons are predicted and scored along the sequence using Position Weight Arrays (PWAs). Next, exons are built from the sites. Exons are scored as the sum of the scores of the defining sites, plus the the log-likelihood ratio of a Markov Model for coding DNA. Finally, from the set of predicted exons, the gene structure is assembled, maximizing the sum of the scores of the assembled exons.

Credits

Thanks to Computational Biology of RNA Processing for providing these data.

References

Blanco E, Parra G, Guigó R. Using geneid to identify genes. Curr Protoc Bioinformatics. 2007 Jun;Chapter 4:Unit 4.3. PMID: 18428791

Parra G, Blanco E, Guigó R. GeneID in Drosophila. Genome Res. 2000 Apr;10(4):511-5. PMID: 10779490; PMC: PMC310871