Schema for LTRharvest - LTR Fragments Identified by LTRharvest
|
|
Database: DobsGB1 Primary Table: ltrdigest_TSD Row Count: 854   Data last updated: 2022-10-20
field | example | SQL type | info |
bin | 589 | smallint(5) unsigned | range |
chrom | BDQP01000001 | varchar(255) | values |
chromStart | 527005 | int(10) unsigned | range |
chromEnd | 534836 | int(10) unsigned | range |
name | repeat_region1_TSD | varchar(255) | values |
score | 1000 | int(10) unsigned | range |
strand | - | char(1) | values |
thickStart | 527005 | int(10) unsigned | range |
thickEnd | 534836 | int(10) unsigned | range |
reserved | 0 | int(10) unsigned | range |
blockCount | 2 | int(10) unsigned | range |
blockSizes | 4,4 | longblob | |
chromStarts | 0,7827 | longblob | |
|
| |
|
|
Sample Rows
|
|
bin | chrom | chromStart | chromEnd | name | score | strand | thickStart | thickEnd | reserved | blockCount | blockSizes | chromStarts |
---|
589 | BDQP01000001 | 527005 | 534836 | repeat_region1_TSD | 1000 | - | 527005 | 534836 | 0 | 2 | 4,4 | 0,7827 |
597 | BDQP01000001 | 1613890 | 1623106 | repeat_region2_TSD | 1000 | + | 1613890 | 1623106 | 0 | 2 | 4,4 | 0,9212 |
592 | BDQP01000002 | 1022792 | 1024817 | repeat_region3_TSD | 1000 | - | 1022792 | 1024817 | 0 | 2 | 4,4 | 0,2021 |
585 | BDQP01000003 | 14336 | 22518 | repeat_region4_TSD | 1000 | - | 14336 | 22518 | 0 | 2 | 5,5 | 0,8177 |
585 | BDQP01000003 | 28103 | 41442 | repeat_region5_TSD | 1000 | + | 28103 | 41442 | 0 | 2 | 4,4 | 0,13335 |
585 | BDQP01000003 | 41592 | 47508 | repeat_region6_TSD | 1000 | - | 41592 | 47508 | 0 | 2 | 4,4 | 0,5912 |
606 | BDQP01000003 | 2786781 | 2793199 | repeat_region7_TSD | 1000 | + | 2786781 | 2793199 | 0 | 2 | 4,4 | 0,6414 |
595 | BDQP01000004 | 1319593 | 1325898 | repeat_region8_TSD | 1000 | - | 1319593 | 1325898 | 0 | 2 | 4,4 | 0,6301 |
602 | BDQP01000004 | 2301561 | 2303587 | repeat_region9_TSD | 1000 | - | 2301561 | 2303587 | 0 | 2 | 4,4 | 0,2022 |
605 | BDQP01000004 | 2688819 | 2696222 | repeat_region10_TSD | 1000 | - | 2688819 | 2696222 | 0 | 2 | 4,4 | 0,7399 |
|
Note: all start coordinates in our database are 0-based, not
1-based. See explanation
here.
| |
|
|
LTRharvest (ltrdigest) Track Description
|
|
Description
LTRharvest
is used to identify putative long terminal repeats (LTR)
retrotransposons in the D. obscura genome using the following parameters:
-seed 76 -minlenltr 116 -maxlenltr 800 -mindistltr 2280 -maxdistltr 8773 -similar 91 -xdrop 7
-mat 2 -mis -2 -ins -3 -del -3 -mintsd 4 -maxtsd 20 -vic 60 -overlaps best
Additional features within the LTR retrotransposons are annotated by
LTRdigest
with the following parameters:
-pptradius 30 -pptlen 8 30 -pptrprob 0.97 -uboxlen 3 30 -pptuprob 0.91 -pbsradius 30
-pbsalilen 11 30 -pbsoffset 0 5 -pbstrnaoffset 0 40 -pbsmaxedist 1 -pbsmatchscore 5
-pbsmismatchscore -10 -pbsinsertionscore -20 -pbsdeletionscore -20 -pdomevalcutoff 1e-6
-maxgaplen 50
The subset of candidates that show significant similarity to Pfam protein domains
within LTR retrotransposons are selected and then clustered using the ltrclustering program
with the following parameters:
-psmall 80 -plarge 30
References
-
Ellinghaus D, Kurtz S, and Willhoeft U.
LTRharvest,
an efficient and flexible software for de novo detection of LTR retrotransposons.
BMC Bioinformatics, 9:18, 2008.
-
Steinbiss S, Willhoeft U, Gremme G, and Kurtz S.
Fine-grained
annotation and classification of de novo predicted LTR retrotransposons.
Nucleic Acids Research, 37(21):7002-7013 (2009).
-
Steinbiss S, Kastens S, and Kurtz S.
LTRsift: a graphical user interface for semi-automatic classification and postprocessing of de novo detected LTR retrotransposons.
Mob DNA. 2012 Nov 7;3(1):18.
| |
|
|
|