Schema for LTRharvest - LTR Fragments Identified by LTRharvest
  Database: DhydGB1    Primary Table: ltrdigest_protein    Row Count: 1,411   Data last updated: 2022-10-20
fieldexampleSQL type info
bin 598smallint(5) unsigned range
chrom NWQH01000001varchar(255) values
chromStart 1768272int(10) unsigned range
chromEnd 1768408int(10) unsigned range
name GAGPOL_ty3gypsy_BMC_Evoluti...varchar(255) values
score 0int(10) unsigned range
strand -char(1) values
thickStart 0int(10) unsigned range

Sample Rows
 
binchromchromStartchromEndnamescorestrandthickStart
598NWQH0100000117682721768408GAGPOL_ty3gypsy_BMC_Evolutionary_Biology_8_276_20080-0
598NWQH0100000117682721768408GAGPOL_ty3gypsy_BMC_Evolutionary_Biology_8_276_20080-0
598NWQH0100000117682721768408GAGPOL_ty3gypsy_BMC_Evolutionary_Biology_8_276_20080-0
598NWQH0100000117684051768493POL_ty3gypsy_Biology_Direct_4_41_20090-0
585NWQH0100000412231398RVT_10-0
585NWQH0100000417461912Exo_endo_phos_20-0
585NWQH0100000418732024Exo_endo_phos_20-0
585NWQH010000131933320006ENV_errantiviridae0-0
585NWQH010000131933320012Gypsy0-0
585NWQH010000132002920939GAGPOL_ty3gypsy_BMC_Evolutionary_Biology_8_276_20080-0

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

LTRharvest (ltrdigest) Track Description
 

Description

LTRharvest is used to identify putative long terminal repeats (LTR) retrotransposons in the D. hydei genome using the following parameters:

-seed 76 -minlenltr 116 -maxlenltr 800 -mindistltr 2280 -maxdistltr 8773 -similar 91 -xdrop 7 -mat 2 -mis -2 -ins -3 -del -3 -mintsd 4 -maxtsd 20 -vic 60 -overlaps best

Additional features within the LTR retrotransposons are annotated by LTRdigest with the following parameters:

-pptradius 30 -pptlen 8 30 -pptrprob 0.97 -uboxlen 3 30 -pptuprob 0.91 -pbsradius 30 -pbsalilen 11 30 -pbsoffset 0 5 -pbstrnaoffset 0 40 -pbsmaxedist 1 -pbsmatchscore 5 -pbsmismatchscore -10 -pbsinsertionscore -20 -pbsdeletionscore -20 -pdomevalcutoff 1e-6 -maxgaplen 50

The subset of candidates that show significant similarity to Pfam protein domains within LTR retrotransposons are selected and then clustered using the ltrclustering program with the following parameters:

-psmall 80 -plarge 30

References