Schema for Geneid Genes - Geneid Gene Predictions
  Database: DereCAF1    Primary Table: geneid    Row Count: 16,234   Data last updated: 2022-10-20
fieldexampleSQL type info
bin 585smallint(5) unsigned range
name mRNA_geneid-scaffold_1000_1varchar(255) values
chrom scaffold_1000varchar(255) values
strand +char(1) values
txStart 12int(10) unsigned range
txEnd 18int(10) unsigned range
cdsStart 12int(10) unsigned range
cdsEnd 18int(10) unsigned range
exonCount 1int(10) unsigned range
exonStarts 12,longblob  
exonEnds 18,longblob  
score 0int(11) range
name2 geneid-scaffold_1000_1varchar(255) values
cdsStartStat unkenum('none', 'unk', 'incmpl', 'cmpl') values
cdsEndStat unkenum('none', 'unk', 'incmpl', 'cmpl') values
exonFrames 0,longblob  

Sample Rows
 
binnamechromstrandtxStarttxEndcdsStartcdsEndexonCountexonStartsexonEndsscorename2cdsStartStatcdsEndStatexonFrames
585mRNA_geneid-scaffold_1000_1scaffold_1000+12181218112,18,0geneid-scaffold_1000_1unkunk0,
585mRNA_geneid-scaffold_1007_1scaffold_1007-8108168108161810,816,0geneid-scaffold_1007_1unkunk0,
585mRNA_geneid-scaffold_1009_1scaffold_1009-2372522372521237,252,0geneid-scaffold_1009_1unkunk0,
585mRNA_geneid-scaffold_1010_1scaffold_1010+197519871975198711975,1987,0geneid-scaffold_1010_1unkunk0,
585mRNA_geneid-scaffold_1012_1scaffold_1012+133813411338134111338,1341,0geneid-scaffold_1012_1unkunk0,
585mRNA_geneid-scaffold_102_1scaffold_102-237723832377238312377,2383,0geneid-scaffold_102_1unkunk0,
585mRNA_geneid-scaffold_1023_1scaffold_1023+126512681265126811265,1268,0geneid-scaffold_1023_1unkunk0,
585mRNA_geneid-scaffold_1029_1scaffold_1029-24512451124,51,0geneid-scaffold_1029_1unkunk0,
585mRNA_geneid-scaffold_103_1scaffold_103-173317451733174511733,1745,0geneid-scaffold_103_1unkunk0,
585mRNA_geneid-scaffold_1035_1scaffold_1035+102710571027105711027,1057,0geneid-scaffold_1035_1unkunk0,

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

Geneid Genes (geneid) Track Description
 

Description

This track shows gene predictions from the geneid program developed by Roderic Guigó's Computational Biology of RNA Processing group which is part of the Centre de Regulació Genòmica (CRG) in Barcelona, Catalunya, Spain.

Methods

Geneid is a program to predict genes in anonymous genomic sequences designed with a hierarchical structure. In the first step, splice sites, start and stop codons are predicted and scored along the sequence using Position Weight Arrays (PWAs). Next, exons are built from the sites. Exons are scored as the sum of the scores of the defining sites, plus the the log-likelihood ratio of a Markov Model for coding DNA. Finally, from the set of predicted exons, the gene structure is assembled, maximizing the sum of the scores of the assembled exons.

Credits

Thanks to Computational Biology of RNA Processing for providing these data.

References

Blanco E, Parra G, Guigó R. Using geneid to identify genes. Curr Protoc Bioinformatics. 2007 Jun;Chapter 4:Unit 4.3. PMID: 18428791

Parra G, Blanco E, Guigó R. GeneID in Drosophila. Genome Res. 2000 Apr;10(4):511-5. PMID: 10779490; PMC: PMC310871