Version history

Added

  • #169 Have KAIJU_MKFMI module also export relevant taxdump files for downstream processes (by @jfy133)
  • #170 Publish sometimes generated unmapped.txt file for Kraken2 databases (❤️ to @softstam for reporting, fix @jfy133)
  • #178 Add additional validation checks for required MetaCache inputs (by @jfy133)
  • #179 Add new parameter --save_uncompressed_fastas to only optionally save decompressed input files (fix @jfy133)

Fixed

  • #158 Prevent sylph failing due to too long commands when many input genomes (by @softstam, @jfy133)
  • #160 Prevent sourmash failing due to too long commands when many input genomes (by @softstam, @jfy133)
  • #161 Force METACACHE_BUILD module to always use one CPU, as not multi-threaded, removing warning (by @jfy133)
  • #162 Fix links to FAQ in parameter docs (by @jfy133)
  • #163 Fix code block title in auxiliary files section of FAQ (by @jfy133)
  • #165 Force use of KrakenUniq --jellyfish-bin to ensure more regular execution (by @jfy133)
  • #173 Fix generated downstream samplesheet’s Bracken directory name being flipped (by @jfy133)
  • #175 Fix MetaCache receiving wrong taxonomy file (was seq2map, should have been accession2taxid) (by @sofstam, @jfy133)
  • #182 KMCP emits correct taxonomy files for downstream use (by @sofstam, @jfy133)
  • #183 Fix KrakenUniq using incorrectly non-renamed seqid2map taxonomy file, resulting in no taxonomy info during classification (by @jfy133)
  • #184 Stop generation of concatenated FASTA file of input files if not needed by selected tools (by @jfy133)

Dependencies

ToolOld VersionNew Version

Added

  • #117 - Updated to nf-core/tools template 3.5.1 (by @jfy133)
  • #143 - Documented how to resovle KrakenUniq unbound variable jellyfish issue (❤️ to @flass for suggesting, added by @jfy133)
  • #140 - Added MetaCache database building support (❤️ to @ChillarAnand for suggestion, added by @alxndrdiaz and @jfy133)
  • #144 - Added tutorial on how to convert an NCBI assembly_summary.txt file to input samplesheet (❤️ to @dialvarezs for improvements, by @jfy133)

Fixed

Dependencies

ToolOld VersionNew Version
nf-core3.4.13.5.1
MetaCache2.5.0

Added

  • #108 - Document workaround for building databases from single FASTA files, e.g. from NCBI RefSeq (by @pcantalupo)
  • #111 - Added sourmash reference building both for genomes and proteomes (by @Midnighter).
  • #117 - Updated to nf-core/tools template 3.4.1 (by @jfy133)
  • #124 - Added sylph reference building (by @jfy133 and @sofstam).

Fixed

  • #110 - Corrected the documented structures of the grouped output from the PREPROCESSING subworkflow (by @Midnighter).
  • #121 - Fix Kraken2 build failing with local files due to symlink-in-symlink mounting error with containers (❤️ to @ellmagu for reporting, fix by @jfy133)
  • #122 - Update DIAMOND to support more recent versions of NCBI taxonomy (by @jfy133)
  • #123 - Fix a MALT build validation check incorrectly assigned to —build_krakenuniq (by @jfy133)
  • #133 - Fix Kaiju compatible renamed FASTA files being always published even if —kaiju_keepintermediates false (by @jfy133)

Dependencies

ToolOld VersionNew Version
sourmash4.9.4
sylph0.9.0
kraken22.1.52.1.6
tar1.34
nf-core3.3.23.4.1
DIAMOND2.1.122.1.16
MultiQC1.311.32
Nextflow24.04.225.04.2

Deprecated

  • #118 - Deprecated automated building of tar.gz archives of all databases (by @jfy133)
  • #126 - Remove guidance about needed double and single quotes when giving to --<tool>_build_options as actually not necessary (by @jfy133)

Added

Initial release of nf-core/createtaxdb, created with the nf-core template.

Adds database building support for the following tools:

Additional optimisation when running with very large number of genomes by @BioWilko.

ToolOld VersionNew Version
Bracken3.1
Centrifuge1.0.4.2
DIAMOND2.1.12
find4.6.0
ganon2.1.0
kaiju1.10.0
KMCP0.9.4
Kraken22.1.5
KrakenUniq1.0.4
MALT0.6.2
pigz2.8
seqkit2.9.0
tar1.34
MultiQC1.29
p7zip16.02

Fixed

Dependencies

Deprecated