Schema for (dm3) D. mel. Chain - D. melanogaster (Apr. 2006) Chained Alignments
  Database: DanaWGS    Primary Table: chainDm3    Row Count: 26,350   Data last updated: 2023-08-05
fieldexampleSQL type info
bin 585smallint(5) unsigned range
score 18763double range
tName scaffold_0varchar(255) values
tSize 1069int(10) unsigned range
tStart 0int(10) unsigned range
tEnd 342int(10) unsigned range
qName chr3Rvarchar(255) values
qSize 27905053int(10) unsigned range
qStrand -char(1) values
qStart 9033474int(10) unsigned range
qEnd 9033813int(10) unsigned range
id 9370int(10) unsigned range

Sample Rows
 
binscoretNametSizetStarttEndqNameqSizeqStrandqStartqEndid
58518763scaffold_010690342chr3R27905053-903347490338139370
5858297scaffold_10005160111691307chr2RHet3288761-1313901131404623343
5859361scaffold_100162071337686chr2LHet368872-23417523451820183
58533154scaffold_100281470680chrUextra29004656+9186659194064591
5859235scaffold_100281470195chrUextra29004656+54030754043320510
58538704scaffold_100281470705chr2R21146708+5899845907873589
58516041scaffold_100281470309chr3RHet2517507-1882413188265111263
58533074scaffold_100281470679chr3L24543557+24429916244306574609
58530890scaffold_100281470537chr3L24543557-277834327788205053
58526425scaffold_100281470702chr2R21146708+237541823761096185

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

(dm3) D. mel. Chain (chainDm3) Track Description
 

Description

This track shows D. melanogaster/D. ananassae genomic alignments using a gap scoring system that allows longer gaps than traditional affine gap scoring systems. It can also tolerate gaps in both D. melanogaster and D. ananassae simultaneously. These "double-sided" gaps can be caused by local inversions and overlapping deletions in both species. The D. melanogaster sequence is from the Apr. 2006 (dm3) assembly.

The chain track displays boxes joined together by either single or double lines. The boxes represent aligning regions. Single lines indicate gaps that are largely due to a deletion in the D. melanogaster assembly or an insertion in the D. ananassae assembly. Double lines represent more complex gaps that involve substantial sequence in both species. This may result from inversions, overlapping deletions, an abundance of local mutation, or an unsequenced gap in one species. In cases where there are multiple chains over a particular portion of the D. ananassae genome, chains with single-lined gaps are often due to processed pseudogenes, while chains with double-lined gaps are more often due to paralogs and unprocessed pseudogenes. In the "pack" and "full" display modes, the individual feature names indicate the chromosome, strand, and location (in thousands) of the match for each matching alignment.

Display Conventions and Configuration

By default, the chains to chromosome-based assemblies are colored based on which chromosome they map to in the aligning organism. To turn off the coloring, check the "off" button next to: Color track based on chromosome.

To display only the chains of one chromosome in the aligning organism, enter the name of that chromosome (e.g. chr4) in box next to: Filter by chromosome.

Methods

Transposons that have been inserted since the D. melanogaster/D. ananassae split were removed, and the resulting abbreviated genomes were aligned with blastz. The transposons were then put back into the alignments. The resulting alignments were converted into axt format and the resulting axts fed into axtChain. AxtChain organizes all the alignments between a single D. melanogaster and a single D. ananassae chromosome into a group and makes a kd-tree out of all the gapless subsections (blocks) of the alignments. Next, maximally scoring chains of these blocks were found by running a dynamic program over the kd-tree. Chains scoring below a threshold were discarded; the remaining chains are displayed here.

Credits

Blastz was developed at Pennsylvania State University by Minmei Hou, Scott Schwartz, Zheng Zhang, and Webb Miller with advice from Ross Hardison.

Lineage-specific repeats were identified by Arian Smit and his program RepeatMasker.

The axtChain program was developed at the University of California at Santa Cruz by Jim Kent with advice from Webb Miller and David Haussler.

The browser display and database storage of the chains were generated by Robert Baertsch and Jim Kent.

References

Chiaromonte, F., Yap, V.B., Miller, W. Scoring pairwise genomic sequence alignments. Pac Symp Biocomput 2002, 115-26 (2002).

Kent, W.J., Baertsch, R., Hinrichs, A., Miller, W., and Haussler, D. Evolution's cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes. Proc Natl Acad Sci USA 100(20), 11484-11489 (2003).

Schwartz, S., Kent, W.J., Smit, A., Zhang, Z., Baertsch, R., Hardison, R., Haussler, D., and Miller, W. Human-Mouse Alignments with BLASTZ. Genome Res. 13(1), 103-7 (2003).