Sourmash
note
Quickly searches, compares, and analyzes genomic and metagenomic data sets.
The module can summarise data from the following sourmash output files (descriptions from command line help output):
- sourmash compare- create a similarity matrix comparing many samples.
 
- sourmash gather- search a metagenome signature against databases.
 
Additional information on sourmash and its outputs is available on the sourmash documentation website.
sourmash gather is modelled after the Kraken module, and builds a bar graph that
shows the coverage of top-5 genomes covered most by all samples. The number of top
genomes can be customized in the config file:
sourmash:
  gather:
    top_n: 5
File search patterns
sourmash/compare:
  fn: '*.labels.txt'
sourmash/gather:
  contents: intersect_bp,f_orig_query,f_match,f_unique_to_query,f_unique_weighted,
  num_lines: 1