This directory contains the Build 36 "essentially finished" mouse genome (UCSC mm8, February 2006) from the Mouse Genome Sequencing Consortium. This assembly was produced at NCBI. Files included in this directory: - chr*.fa.gz: compressed FASTA sequence of each chromosome. Each chromosome is in a separate file in a gzip Fasta format. Repeats -- which are shown in lower case -- are annotated by RepeatMasker run at the sensitive setting and Tandem Repeats Finder (repeats of period 12 or less). md5sum.txt - MD5 checksum of these files to verify correct transmission The main assembly is contained in the chrN.fa.gz files, where N is the name of the chromosome. The chrN_random.fa.gz files contain clones that are not yet finished or cannot be placed with certainty at a specific place on the chromosome. -------------------------------------------------------------------- If you plan to download a large file or multiple files from this directory, we recommend that you use ftp rather than downloading the files via our website. To do so, ftp to hgdownload.cse.ucsc.edu, then go to the directory goldenPath/mm8/chromosomes. To download multiple files, use the "mget" command: mget ... - or - mget -a (to download all the files in the directory) The rsync command to download the entire directory: rsync -avzP rsync://hgdownload.cse.ucsc.edu/goldenPath/mm8/chromosomes/ . For a single file, e.g. chrM.fa.gz rsync -avzP \ rsync://hgdownload.cse.ucsc.edu/goldenPath/mm8/chromosomes/chrM.fa.gz . Or with wget, all files: wget --timestamping \ 'ftp://hgdownload.cse.ucsc.edu/goldenPath/mm8/chromosomes/*' With wget, a single file: wget --timestamping \ 'ftp://hgdownload.cse.ucsc.edu/goldenPath/mm8/chromosomes/chrM.fa.gz' \ -O chrM.fa.gz To unpack the fa.gz files: gunzip .fa.gz All the files in this directory are freely available for public use. This file last updated: 2006-02-16 - 16 February 2006