Schema for D. erecta (DereRS2) Net - D. erecta (Oct. 2018 (The University of Chicago/DereRS2)) Alignment Net

Database: DereCAF1 Primary Table: netDereRS2 Row Count: 6,678 Data last updated: 2022-10-20

field	example	SQL type	info
`bin`	585	`smallint(5) unsigned`	range
`level`	1	`int(10) unsigned`	range
`tName`	scaffold_5035	`varchar(255)`	values
`tStart`	0	`int(10) unsigned`	range
`tEnd`	338	`int(10) unsigned`	range
`strand`	-	`char(1)`	values
`qName`	QMER02000005	`varchar(255)`	values
`qStart`	3776325	`int(10) unsigned`	range
`qEnd`	3776663	`int(10) unsigned`	range
`chainId`	293159	`int(10) unsigned`	range
`ali`	338	`int(10) unsigned`	range
`score`	29642	`double`	range
`qOver`	-1	`int(11)`	range
`qFar`	-1	`int(11)`	range
`qDup`	338	`int(11)`	range
`type`	top	`varchar(255)`	values
`tN`	0	`int(11)`	range
`qN`	0	`int(11)`	range
`tR`	0	`int(11)`	range
`qR`	0	`int(11)`	range
`tNewR`	-1	`int(11)`	range
`qNewR`	-1	`int(11)`	range
`tOldR`	-1	`int(11)`	range
`qOldR`	-1	`int(11)`	range
`tTrf`	0	`int(11)`	range
`qTrf`	175	`int(11)`	range

Sample Rows

bin	level	tName	tStart	tEnd	strand	qName	qStart	qEnd	chainId	ali	score	qOver	qFar	qDup	type	tR	qR	tNewR	qNewR	tOldR	qOldR	tTrf	qTrf
585	1	scaffold_5035	0	338	-	QMER02000005	3776325	3776663	293159	338	29642	-1	-1	338	top	0	0	-1	-1	-1	-1	0	175
585	1	scaffold_264	5	582	-	QMER02000016	3131887	3132465	161143	577	54029	-1	-1	578	top	384	422	-1	-1	-1	-1	0	0
585	1	scaffold_950	0	707	+	QMER02000001	7889887	7890594	127911	707	66749	-1	-1	707	top	0	0	-1	-1	-1	-1	0	0
585	1	scaffold_4958	0	718	+	QMER02000007	2188603	2189321	125284	718	67966	-1	-1	718	top	97	107	-1	-1	-1	-1	0	1
585	1	scaffold_4956	0	728	+	QMER02000007	4146095	4146822	125901	727	67685	-1	-1	727	top	413	408	-1	-1	-1	-1	411	408
585	1	scaffold_5032	353	728	+	QMER02000005	3633005	3633383	347864	375	23784	-1	-1	378	top	0	0	-1	-1	-1	-1	0	0
585	1	scaffold_4852	0	753	-	QMER02000042	52394	53145	124482	751	68326	-1	-1	0	top	309	602	-1	-1	-1	-1	0	0
585	1	scaffold_4932	0	770	+	QMER02000005	2118132	2118902	120328	770	70342	-1	-1	770	top	0	0	-1	-1	-1	-1	770	770
585	1	scaffold_302	0	188	+	QMER02000001	19252011	19252197	485356	186	14586	-1	-1	186	top	68	0	-1	-1	-1	-1	4	0
585	1	scaffold_561	0	852	-	QMER02000028	726103	726955	101766	852	80763	-1	-1	852	top	283	732	-1	-1	-1	-1	0	0

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

D. erecta (DereRS2) Net (netDereRS2) Track Description


	Description This track shows the best D. erecta/D. erecta chain for every part of the D. erecta genome. It is useful for finding orthologous regions and for studying genome rearrangement. The D. erecta sequence used in this annotation is from the Oct. 2018 (The University of Chicago/DereRS2) (DereRS2) assembly. Display Conventions and Configuration In full display mode, the top-level (level 1) chains are the largest, highest-scoring chains that span this region. In many cases gaps exist in the top-level chain. When possible, these are filled in by other chains that are displayed at level 2. The gaps in level 2 chains may be filled by level 3 chains and so forth. In the graphical display, the boxes represent ungapped alignments; the lines represent gaps. Click on a box to view detailed information about the chain as a whole; click on a line to display information about the gap. The detailed information is useful in determining the cause of the gap or, for lower level chains, the genomic rearrangement. Individual items in the display are categorized as one of four types (other than gap): Top - the best, longest match. Displayed on level 1. Syn - line-ups on the same chromosome as the gap in the level above it. Inv - a line-up on the same chromosome as the gap above it, but in the opposite orientation. NonSyn - a match to a chromosome different from the gap in the level above. Methods Chains were derived from blastz alignments, using the methods described on the chain tracks description pages, and sorted with the highest-scoring chains in the genome ranked first. The program chainNet was then used to place the chains one at a time, trimming them as necessary to fit into sections not already covered by a higher-scoring chain. During this process, a natural hierarchy emerged in which a chain that filled a gap in a higher-scoring chain was placed underneath that chain. The program netSyntenic was used to fill in information about the relationship between higher- and lower-level chains, such as whether a lower-level chain was syntenic or inverted relative to the higher-level chain. The program netClass was then used to fill in how much of the gaps and chains contained Ns (sequencing gaps) in one or both species and how much was filled with transposons inserted before and after the two organisms diverged. Credits The chainNet, netSyntenic, and netClass programs were developed at the University of California Santa Cruz by Jim Kent. Blastz was developed at Pennsylvania State University by Minmei Hou, Scott Schwartz, Zheng Zhang, and Webb Miller with advice from Ross Hardison. Lineage-specific repeats were identified by Arian Smit and his program RepeatMasker. The browser display and database storage of the nets were made by Robert Baertsch and Jim Kent. References Kent, W.J., Baertsch, R., Hinrichs, A., Miller, W., and Haussler, D. Evolution's cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes. Proc Natl Acad Sci USA 100(20), 11484-11489 (2003). Schwartz, S., Kent, W.J., Smit, A., Zhang, Z., Baertsch, R., Hardison, R., Haussler, D., and Miller, W. Human-Mouse Alignments with BLASTZ. Genome Res. 13(1), 103-7 (2003).

Description

Display Conventions and Configuration

Methods

Credits

References