Schema for S2 30-state (R5) - modENCODE 30-state Model for S2 cells (R5)
  Database: dm6    Primary Table: S2_30state    Row Count: 73,964
fieldexampleSQL type info
bin 585smallint(5) unsigned range
chrom chr2Lvarchar(255) values
chromStart 5199int(10) unsigned range
chromEnd 5799int(10) unsigned range
name 20_1varchar(255) values
score 1000int(10) unsigned range
strand +char(1) values
thickStart 5199int(10) unsigned range
thickEnd 5799int(10) unsigned range
itemRgb 16744272int(10) unsigned range

Sample Rows
 
binchromchromStartchromEndnamescorestrandthickStartthickEnditemRgb
585chr2L5199579920_11000+51995799255,127,80
585chr2L5799679930_21000+57996799211,211,211
585chr2L6799939925_31000+67999399128,128,128
585chr2L9399101996_41000+939910199255,0,0
585chr2L10199115998_51000+1019911599128,0,128
585chr2L11599157997_61000+1159915799128,0,128
585chr2L15799165998_71000+1579916599128,0,128
585chr2L16599175994_81000+1659917599255,0,0
585chr2L17599181993_91000+1759918199255,0,0
585chr2L18199183995_101000+1819918399255,0,0

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

S2 30-state (R5) (S2_30state) Track Description
 

Description

Chromatin is composed of DNA and a variety of modified histones and non-histone proteins, which have an impact on cell differentiation, gene regulation and other key cellular processes. Here we present a genome-wide chromatin landscape for Drosophila melanogaster based on eighteen histone modifications, summarized by nine prevalent combinatorial patterns. Integrative analysis with other data (non-histone chromatin proteins, DNase I hypersensitivity, GRO-Seq reads produced by engaged polymerase, short/long RNA products) reveals discrete characteristics of chromosomes, genes, regulatory elements and other functional domains. We find that active genes display distinct chromatin signatures that are correlated with disparate gene lengths, exon patterns, regulatory functions and genomic contexts. We also demonstrate a diversity of signatures among Polycomb targets that include a subset with paused polymerase. This systematic profiling and integrative analysis of chromatin signatures provides insights into how genomic elements are regulated, and will serve as a resource for future experimental investigations of genome structure and function.

Methods

Multiple histone modifications act in concert to determine genome functions, producing combinatorial chromatin "states". We used unsupervised, multivariate HMM machine learning to segment the genome based on the combinatorial patterns of 18 histone modifications in S2 and BG3 cells (52). This produced a nine state intensity-based model (continuous model states c1 to c9) reflecting major chromatin states, as well as a 30 state model that identifies combinatorial patterns at a finer resolution (discrete model states d1 to d30).

30-state9-stateDescriptionColor
1
2
3
4
5
6
1Promoter and TSS
7
8
9
10
2Transcription elongation
11
12
13
14
15
16
17
3Regulatory regions (enhancers)
18
19
20
21
4Active introns
22
23
24
5Active genes on the Male X
256Polycomb-mediated repression
26
27
7Pericentromeric heterochromatin
28
29
8Heterochromatin-like embedded in euchromatin
309Transcriptionally silent, intergenic

Credits

Kharchenko PV., et al. Comprehensive analysis of the chromatin landscape in Drosophila melanogaster. Nature Volume: 471, Pages: 480-485 (24 March 2011)