Schema for genBlastG Genes - genBlastG Gene Predictions
|
|
Database: DereCAF1 Primary Table: genblastg Row Count: 31,975   Data last updated: 2024-08-24
field | example | SQL type | info |
bin | 585 | smallint(5) unsigned | range |
name | gbg_His4:CG33869-PA-R1-5-A1_0 | varchar(255) | values |
chrom | scaffold_1161 | varchar(255) | values |
strand | + | char(1) | values |
txStart | 63 | int(10) unsigned | range |
txEnd | 375 | int(10) unsigned | range |
cdsStart | 63 | int(10) unsigned | range |
cdsEnd | 375 | int(10) unsigned | range |
exonCount | 1 | int(10) unsigned | range |
exonStarts | 63, | longblob | |
exonEnds | 375, | longblob | |
|
| |
|
|
Sample Rows
|
|
bin | name | chrom | strand | txStart | txEnd | cdsStart | cdsEnd | exonCount | exonStarts | exonEnds |
---|
585 | gbg_His4:CG33869-PA-R1-5-A1_0 | scaffold_1161 | + | 63 | 375 | 63 | 375 | 1 | 63, | 375, |
585 | gbg_His4:CG33909-PA-R1-5-A1_1 | scaffold_1161 | + | 63 | 375 | 63 | 375 | 1 | 63, | 375, |
585 | gbg_His4:CG33877-PA-R1-5-A1_2 | scaffold_1161 | + | 63 | 375 | 63 | 375 | 1 | 63, | 375, |
585 | gbg_His4:CG33905-PA-R1-5-A1_3 | scaffold_1161 | + | 63 | 375 | 63 | 375 | 1 | 63, | 375, |
585 | gbg_His4r-PD-R1-5-A1_4 | scaffold_1161 | + | 63 | 375 | 63 | 375 | 1 | 63, | 375, |
585 | gbg_His4r-PB-R1-5-A1_5 | scaffold_1161 | + | 63 | 375 | 63 | 375 | 1 | 63, | 375, |
585 | gbg_His4:CG33897-PA-R1-5-A1_6 | scaffold_1161 | + | 63 | 375 | 63 | 375 | 1 | 63, | 375, |
585 | gbg_His4:CG33881-PA-R1-5-A1_7 | scaffold_1161 | + | 63 | 375 | 63 | 375 | 1 | 63, | 375, |
585 | gbg_His4:CG33879-PA-R1-5-A1_8 | scaffold_1161 | + | 63 | 375 | 63 | 375 | 1 | 63, | 375, |
585 | gbg_His4:CG33907-PA-R1-5-A1_9 | scaffold_1161 | + | 63 | 375 | 63 | 375 | 1 | 63, | 375, |
|
Note: all start coordinates in our database are 0-based, not
1-based. See explanation
here.
| |
|
|
genBlastG Genes (genblastg) Track Description
|
|
Description
D. melanogaster proteins from FlyBase were aligned against each
scaffold in the D. erecta (DereCAF1) assembly and the predicted gene models were constructed
using genBlastG.
Methods
D. melanogaster protein sequences were aligned against the D. erecta (DereCAF1) genome
assembly using WU-TBLASTN with the following parameters:
-B=1000 -V=1000 -hspmax=5000
-hspsepSmax=50000 -hspsepQmax=1000
-altscore='any * -999' -nogaps -hitdist=40
-matrix=BLOSUM80 -Q=12 -R=2 -e 1e-10
-wordmask=seg+xnu -topComboN=1 -links
-notes -warnings -novalidctxok
The WU-TBLASTN results were then analyzed by genBlastG using the following parameters:
-P wublast -r 1 -i 30 -pro -gff -e 1e-10
References
She R, Chu JS, Uyar B, Wang J, Wang K, Chen N.
genBlastG: using BLAST searches to build homologous gene models.
Bioinformatics. 2011 Aug 1;27(15):2141-3.
| |
|
|
|