Build pipeline: viash-hub.demultiplex.main-p5t6v
Source commit: f591b7849f
Source message: Output demultiplexer log files (#41)
6.7 KiB
demultiplex v0.3.10
Minor changes
- Moved the test resources to their new location (PR #37).
demultiplex v0.3.9
Bug fixes
-
Fix defaults for output arguments in nextflow schema's.
-
Fix an issue where an integer being passed to a argument with
type: doubleresulted in an error (PR #44).
Minor changes
-
Bump viash to 0.9.4, which adds support for nextflow versions starting major version 25.01 (PR #43 and #44).
-
Output demultiplexer logs and metrics (PR #41).
demultiplex v0.3.8
Bug fixes
- Provide a proper error when a FASTQ file is empty after demultiplexing (PR #40).
demultiplex v0.3.7
Minor updates
- Ignore lines starting with '#' when parsing run information CSV (PR #39).
demultiplex v0.3.6
Minor updates
- Allow letter case variants for headers when looking for sample information in run information CSV (PR #38).
demultiplex v0.3.5
Breaking changes
- The
demultiplexworkflow now outputs a list of directories for theoutput_falcoargument (one for each barcode) instead of one directory for the complete run. The output from therunnerworkflow remained unchanged (PR #33).
Minor updates
- In case Illumina data is detected in the input folder, check for the presence of the 'copyComplete.txt' file.
This check can be disabled using
--skip_copycomplete_check(PR #34).
demultiplex v0.3.4
Minor updates
- Resource labels are now automatically included during build (PR #32).
demultiplex v0.3.3
Breaking change
-
The
runnerdefines the output differently now:- The last part of the
--inputpath is expected to be the run ID and this run ID is used to create the output directory. - If the input is
file.tar.gzinstead of a directory, thefilepart is used as the run ID.
- The last part of the
-
The output structure is then as follows:
$publish_dir/<run_id>/<date_time_stamp>_demultiplex_<version>/For instance:
$publish_dir └── 200624_A00834_0183_BHMTFYDRXX └── 20241217_051404_demultiplex_v1.2 ├── run_information.csv ├── fastq │ ├── Sample1_S1_L001_R1_001.fastq.gz │ ├── Sample23_S3_L001_R1_001.fastq.gz │ ├── SampleA_S2_L001_R1_001.fastq.gz │ ├── Undetermined_S0_L001_R1_001.fastq.gz │ └── sampletest_S4_L001_R1_001.fastq.gz └── qc ├── fastqc │ ├── Sample1_S1_L001_R1_001.fastq.gz_fastqc_data.txt │ ├── Sample1_S1_L001_R1_001.fastq.gz_fastqc_report.html │ ├── Sample1_S1_L001_R1_001.fastq.gz_summary.txt │ ├── Sample23_S3_L001_R1_001.fastq.gz_fastqc_data.txt │ ├── Sample23_S3_L001_R1_001.fastq.gz_fastqc_report.html │ ├── Sample23_S3_L001_R1_001.fastq.gz_summary.txt │ ├── SampleA_S2_L001_R1_001.fastq.gz_fastqc_data.txt │ ├── SampleA_S2_L001_R1_001.fastq.gz_fastqc_report.html │ ├── SampleA_S2_L001_R1_001.fastq.gz_summary.txt │ ├── Undetermined_S0_L001_R1_001.fastq.gz_fastqc_data.txt │ ├── Undetermined_S0_L001_R1_001.fastq.gz_fastqc_report.html │ ├── Undetermined_S0_L001_R1_001.fastq.gz_summary.txt │ ├── sampletest_S4_L001_R1_001.fastq.gz_fastqc_data.txt │ ├── sampletest_S4_L001_R1_001.fastq.gz_fastqc_report.html │ └── sampletest_S4_L001_R1_001.fastq.gz_summary.txt └── multiqc_report.html -
This logic can be avoided by providing the flag
--plain_output.
Minor updates
- Added
output_run_informationargument that copies the run information file to the output (PR #31).
demultiplex v0.3.2
Bug fixes
- Ignore empty CSV entries when parsing sample information (PR #29).
demultiplex v0.3.1
Minor updates
- Add
--run_informationand--demultiplexerarguments torunnerworkflow (PR #27).
Bug fixes
-
Fix detection of sample IDs from Illumina V2 sample sheets (PR #28).
-
Provide a clear error message when
--run_informationis provided but not--demultiplexer(PR #27).
demultiplex v0.3.0
Major updates
The outflow of the workflow has been refactored to be more flexible (PR #19). This is done by creating a wrapper workflow runner that wraps the native demultiplex workflow. The runner workflow is responsible for setting the output directory based on the input arguments:
3 arguments exist for specifying the relative location of the 3 outputs of the workflow:
fastq_output: The directory where the demultiplexed fastq files are stored.falco_output: the directory for thefastqc/falcoreports.multiqc_output: The filename for themultiqcreport.
The target location path is determined by the following logic:
- If no
idis provided, the output directory is set to$publish_dir. - If an
idis explicitly set using Seqera Cloud or by adding--id <>, the output directory is set to$publish_dir/<id>.
The workflow has two optional flags to be used in combination with --id:
--add_date_time: rather than publishing the results under$publish_dir, this adds an additional layer$publish_dir/<date-time-stamp>/. This is useful when you want to keep track of multiple runs of the workflow (example:240322_143020).--add_workflow_id: adding this flag will add_demultiplex_<version>to the output directory (example:demultiplex_v0.2.0). When starting the workflow from a non-release, the version will be set toversion_unkonwn.
The default structure in the output directory is:
- Two sub-directories:
fastqqcfor the reports:multiqc_report.htmlfastqc/directory containing the different fastqc (falco) reports.
The $publish_dir variable corresponds to the argument provided with --publish-dir. The date-time-stamp is generated by the workflow based on when it was launched and is thus guaranteed to be unique.
demultiplex v0.2.0
Breaking changes
demultiplexworkflow: renamedsample_sheetargument torun_information(PR #24)
New features
- Add support for
bases2fastqdemultiplexer (PR #24)
Minor updates
- Add resource labels to workflows (PR #21).
demultiplex v0.1.1
Minor updates
-
Bump viash to 0.9.0 (PR #14).
-
demultiplexworkflow: usev0.2.0release instead ofmainbranch forbioboxdependencies (PR #11). -
Renamed
biobaserepository tobiobox(PR #13 and PR #15).
demultiplex v0.1.0
Initial release