The format is based on Keep a Changelog and this project adheres to Semantic Versioning.
- Add
MultiQC-SAVmodule for Illumina Run QC reports - Migrate readgroup parsing from
preprocessing.nfto a local subworkflow - Update pipeline to use topic channels only, deprecation
versionschannels - Drop
bcl_demultiplexsubworkflow in favour ofbcl-convertmodule - Update the output handling to use the new workflow output definitions.
- Bump all modules to latest versions.
- The workflow now outputs data in a subdirectory per
library, including a library specific MultiQC report - Drop support for unaligned cram outputs in favor of untrimmed fastq outputs, which are more widely supported and can be used for a wider range of downstream analyses.
- Add support for untrimmed fastq outputs for unsupported genomes or when aligner is set to
false. - Drop support for global
alignerparameter. The aligner must now be specified per sample in the sample sheet or sample info. - Drop support for global
markdupandumi_awareparameters. Marking duplicates must now be specified per sample in the sample sheet or sample info. - Drop support for global
run_coverageanddisable_picard_metricsparameters. Running coverage analysis must now be specified per sample in the sample sheet or sample info. - Drop support for global
skip_trimming,trim_front,trim_tail,adapter_R1andadapter_R2parameters. Trimming must now be specified per sample in the sample sheet or sample info. - Drop support for global
roiparameter. Regions of interest must now be specified per sample in the sample sheet or sample info. - Simplify fastq sharding and make it user configurable via the
split_fastqparameter. - Added splice junctions and junctions outputs for RNA-seq alignments using STAR.
- Update bash options for nextflow version 25.04.2
- Update
bcl-convertmodule to 4.3.13 - Update
bowtie2module to 2.5.4 - Update
multiqcmodule to 1.29
- Panel coverage will now run all genelists' coverage analysis in one process per sample
- Fix readgroup sample names for
snp_prefixed samples - Edit
WGSprofile to includesamtools/sormadupand enable UMI aware duplicate marking
- Add flag to
snapto allow for more expansion of compressed data - Parse readgroup samplenames for
snp_prefixed samples - Replace GRCh39 by mm10 as default genome for
Mus musculus
- Update modules
- Fix bugs where samples with unequal number of files were not processed correctly
- Update to nf-core template v3.1.1
- Fix all tests
- Drop unused params
- Set aligner to
starfor RNA-seq - Finetune resources
- Fix some bugs for different input tags
- Update modules
- Fix syntax according to new rules
- Drop usage of
check_maxfunction in favour ofresourceLimits - Bump nf-schema to v2.1.0
- Add configuration profiles for several datatypes
- Fix usage of
--run_coverageparameter - Update modules
- Move repo to nf-cmgg organisation
- Overhaul reference data handling. All data is now sourced from the
igenomesconfig file and dynamically loaded based on theorganismmetadata field. - Drop support for bam/cram inputs
- Update to nf-core template v2.13.1
- Drop
fgbio/fastqtosamin favour ofsamtools/import - Add
samtools/coveragefor coverage analysis - Add testing with nf-test
- Overhaul panel coverage analysis
- Add support for
STARaligner for RNA-seq - Replace CSV sampleinfo definition with validated JSON/YAML
- Update modules
- Add fix for sample replicates with different split sizes
- Add option to use samtools for duplicate marking
- Updated modules
- Updated to nf-core template v2.8
- Fix issue where unaligned samples were not merged and fixed naming
- Add minimal support for sample "tags" in sample sheet
- Add panel coverage analysis
- Limit panel coverage analyses to samples with WES/WGS tags
- Make BAM_ARCHIVE accept channels with cram files
- Drop ELPREP support
- Add support for cram and index outputs in SAMTOOLS_SORMADUP
- Add option to disable marking duplicates
- Update to nf-core template v2.9
- Start using nf-validation for input parsing
- Drop support for generating aligner indices
- Support for bamprocessing with ELPREP
- Improved sample grouping
- Meta value clean up
- bump modules
- fix multiqc report to include all results
- fix duplicate marking with bamsormadup
- drop post alignment sorting, doesn't make sense as we do sort/markdup afterwards
- Add option to use elprep for alignment postprocessing
- Add
unaligneddenominator to unaligned samples
- Updated modules
- Bug with
fgbio/fastqtosamwhere having multiple files per read requires a read structure
Initial release of CenterForMedicalGeneticsGhent/nf-cmgg-preprocessing, created with the nf-core template.