
Murasaki is a scalable and fast, language theory-based homology detection tool across multiple large genomes. it enable whole-genome scale multiple genome global alignments. supports unlimited length gapped-seed patterns and unique tf-idf based filtering.
murasaki is an anchor alignment software, which is
* exteremely fast (17 cpu hours for whole human x mouse genome (with
40 nodes: 52 wall minutes))
* scalable (arbitrarily parallelizable across multiple nodes using mpi.
even a single node with 16gb of ram can handle over 1gbp of sequence.)
* unlimited pattern length
* repeat tolerant
* intelligent noise reduction
this package contains all files that are common to the single core murasaki package and the multi core murasaki-mpi package.