Description

This track shows multiple alignments of 11 primate assemblies on this target/reference assembly (Ring-tailed lemur - 2021-11-04 - Vertebrate Genomes Project).

This is a composite track with both a multiz calculated multiple alignment and the same set of assemblies used to calculate the 11-way multiple alignment with the cactus alignment procedure.

The multiz multiple alignment is generated using multiz and other tools in the UCSC/Penn State Bioinformatics comparative genomics alignment pipeline.

Gap Annotation

The Display chains between alignments configuration option enables display of gaps between alignment blocks in the pairwise alignments in a manner similar to the Chain track display. Missing sequence in any assembly is highlighted in the track display by regions of yellow when zoomed out and by Ns when displayed at base level. The following conventions are used:

Genomic Breaks

Discontinuities in the genomic context (chromosome, scaffold or region) of the aligned DNA in the aligning species are shown as follows:

Base Level

When zoomed-in to the base-level display, the track shows the base composition of each alignment. The numbers and symbols on the Gaps line indicate the lengths of gaps in the Ring-tailed lemur - 2021-11-04 - Vertebrate Genomes Project sequence at those alignment positions relative to the longest non-Ring-tailed lemur sequence. If there is sufficient space in the display, the size of the gap is shown. If the space is insufficient and the gap size is a multiple of 3, a "*" is displayed; other gap sizes are indicated by "+".

count alignment
percent
assembly and
browser link
maf file
type
common name/assembly date
assembly submitter
01 n/a GCF_020740605.2_mLemCat1.pri reference ring-tailed lemur (mLemCat1 2021)/2021-11-04
02 47.884 GCF_027406575.1 maf net slow loris/2022-12-28/VGP
03 43.366 GCA_028858775.2 maf net chimpanzee/2024-01-08/NHGRI/NIH
04 43.364 GCA_029289425.2 maf net pygmy chimpanzee/2024-01-08/NHGRI/NIH
05 43.348 hg38 maf net Human/hg38/Dec. 2013 (GRCh38/hg38)/GRCh38 Genome Reference Consortium Human Reference 38 (GCA_000001405.15)
06 43.347 hs1 maf net Human/hs1/Jan. 2022 (T2T CHM13v2.0/hs1)/Telomere to telomere (T2T) assembly of haploid CHM13 + chrY (GCA_009914755.4)
07 43.286 GCA_028885655.2 maf net Sumatran orangutan/2024-01-05/NHGRI/NIH
08 43.272 GCA_028885625.2 maf net Bornean orangutan/2024-01-08/NHGRI/NIH
09 43.269 GCA_029281585.2 maf net western lowland gorilla/2024-01-08/NHGRI/NIH
10 41.960 GCA_028878055.2 maf net siamang/2024-01-05/NHGRI/NIH
11 32.460 GCF_011100555.1 maf net white-tufted-ear marmoset/2021-04-28/VGP

Alignments identity

showing percent identity, how much of the target is matched by the query
chainssyntenicreciprocal
best
common
name
assembly
47.88446.71546.538slow lorisGCF_027406575.1
43.36642.71542.561chimpanzeeGCA_028858775.2
43.36442.71142.560pygmy chimpanzeeGCA_029289425.2
43.34842.67242.532Humanhg38
43.34742.69542.544Humanhs1
43.28642.62642.483Sumatran orangutanGCA_028885655.2
43.27242.61542.472Bornean orangutanGCA_028885625.2
43.26942.61642.466western lowland gorillaGCA_029281585.2
41.96041.31941.183siamangGCA_028878055.2
32.46031.76631.869white-tufted-ear marmosetGCF_011100555.1

Display Conventions and Configuration

In full and pack display modes, conservation scores are displayed as a wiggle track (histogram) in which the height reflects the size of the score. The conservation wiggles can be configured in a variety of ways to highlight different aspects of the displayed information. Click the Graph configuration help link for an explanation of the configuration options.

Methods

Pairwise alignments of each species to the Ring-tailed lemur//hive/data/genomes/asmHubs/refseqBuild/GCF/020/740/605/GCF_020740605.2_mLemCat1.pri/html/GCF_020740605.2_mLemCat1.pri.names.tab/GCF_020740605.2/2021-11-04 genome are displayed below the conservation histogram as a grayscale density plot (in pack mode) or as a wiggle (in full mode) that indicates alignment quality. In dense display mode, conservation is shown in grayscale using darker values to indicate higher levels of overall conservation as scored by phastCons.

Checkboxes on the track configuration page allow selection of the species to include in the pairwise display. Note that excluding species from the pairwise display does not alter the the conservation score display.

To view detailed information about the alignments at a specific position, zoom the display in to 30,000 or fewer bases, then click on the alignment.

From the cactus alignment, a target specific maf file was extracted from the cactus hal file. This maf file is used to construct the track.

Credits

This track was created using the following programs:

References

Harris RS. Improved pairwise alignment of genomic DNA. Ph.D. Thesis. Pennsylvania State University, USA. 2007.

PhyloP:

Cooper GM, Stone EA, Asimenos G, NISC Comparative Sequencing Program., Green ED, Batzoglou S, Sidow A. Distribution and intensity of constraint in mammalian genomic sequence. Genome Res. 2005 Jul;15(7):901-13. PMID: 15965027; PMC: PMC1172034; DOI: 10.1101/gr.3577405

Pollard KS, Hubisz MJ, Rosenbloom KR, Siepel A. Detection of nonneutral substitution rates on mammalian phylogenies. Genome Res. 2010 Jan;20(1):110-21. PMID: 19858363; PMC: PMC2798823

Siepel A, Haussler D. Phylogenetic Hidden Markov Models. In: Nielsen R, editor. Statistical Methods in Molecular Evolution. New York: Springer; 2005. pp. 325-351. DOI: 10.1007/0-387-27733-1_12

Siepel A, Pollard KS, and Haussler D. New methods for detecting lineage-specific selection. In Proceedings of the 10th International Conference on Research in Computational Molecular Biology (RECOMB 2006), pp. 190-205. DOI: 10.1007/11732990_17

Armstrong J, Hickey G, Diekhans M, Fiddes IT, Novak AM, Deran A, Fang Q, Xie D, Feng S, Stiller J et al. Progressive Cactus is a multiple-genome aligner for the thousand-genome era. Nature. 2020 Nov;587(7833):246-251. DOI: 10.1038/s41586-020-2871-y; PMID: 33177663; PMC: PMC7673649