CHANGELOG.md

# demultiplex v0.3.6

## Minor updates

* Allow letter case variants for headers when looking for sample information in run information CSV (PR #38).

# demultiplex v0.3.5

## Breaking changes

* The `demultiplex` workflow now outputs a list of directories
  for the `output_falco` argument (one for each barcode) instead of one directory
  for the complete run. The output from the `runner` workflow remained
  unchanged (PR #33).

## Minor updates

* In case Illumina data is detected in the input folder, check for the presence of the 'copyComplete.txt' file.
  This check can be disabled using `--skip_copycomplete_check` (PR #34).

# demultiplex v0.3.4

## Minor updates

* Resource labels are now automatically included during build (PR #32).

# demultiplex v0.3.3

## Breaking change

- The `runner` defines the output differently now:

  - The last part of the `--input` path is expected to be the run ID and this run ID is used to create the output directory.
  - If the input is `file.tar.gz` instead of a directory, the `file` part is used as the run ID.

- The output structure is then as follows:

    ```
    $publish_dir/<run_id>/<date_time_stamp>_demultiplex_<version>/
    ```

    For instance:

    ```
    $publish_dir
    └── 200624_A00834_0183_BHMTFYDRXX
        └── 20241217_051404_demultiplex_v1.2
            ├── run_information.csv
            ├── fastq
            │   ├── Sample1_S1_L001_R1_001.fastq.gz
            │   ├── Sample23_S3_L001_R1_001.fastq.gz
            │   ├── SampleA_S2_L001_R1_001.fastq.gz
            │   ├── Undetermined_S0_L001_R1_001.fastq.gz
            │   └── sampletest_S4_L001_R1_001.fastq.gz
            └── qc
                ├── fastqc
                │   ├── Sample1_S1_L001_R1_001.fastq.gz_fastqc_data.txt
                │   ├── Sample1_S1_L001_R1_001.fastq.gz_fastqc_report.html
                │   ├── Sample1_S1_L001_R1_001.fastq.gz_summary.txt
                │   ├── Sample23_S3_L001_R1_001.fastq.gz_fastqc_data.txt
                │   ├── Sample23_S3_L001_R1_001.fastq.gz_fastqc_report.html
                │   ├── Sample23_S3_L001_R1_001.fastq.gz_summary.txt
                │   ├── SampleA_S2_L001_R1_001.fastq.gz_fastqc_data.txt
                │   ├── SampleA_S2_L001_R1_001.fastq.gz_fastqc_report.html
                │   ├── SampleA_S2_L001_R1_001.fastq.gz_summary.txt
                │   ├── Undetermined_S0_L001_R1_001.fastq.gz_fastqc_data.txt
                │   ├── Undetermined_S0_L001_R1_001.fastq.gz_fastqc_report.html
                │   ├── Undetermined_S0_L001_R1_001.fastq.gz_summary.txt
                │   ├── sampletest_S4_L001_R1_001.fastq.gz_fastqc_data.txt
                │   ├── sampletest_S4_L001_R1_001.fastq.gz_fastqc_report.html
                │   └── sampletest_S4_L001_R1_001.fastq.gz_summary.txt
                └── multiqc_report.html

    ```

- This logic can be avoided by providing the flag `--plain_output`.

# Minor updates

* Added `output_run_information` argument that copies the run information file to the output (PR #31).

# demultiplex v0.3.2

# Bug fixes

* Ignore empty CSV entries when parsing sample information (PR #29).

# demultiplex v0.3.1

# Minor updates

* Add `--run_information` and `--demultiplexer` arguments to `runner` workflow (PR #27).

# Bug fixes

* Fix detection of sample IDs from Illumina V2 sample sheets (PR #28).

* Provide a clear error message when `--run_information` is provided but not `--demultiplexer` (PR #27).

# demultiplex v0.3.0

## Major updates

The outflow of the workflow has been refactored to be more flexible (PR #19). This is done by creating a wrapper workflow `runner` that wraps the native `demultiplex` workflow. The `runner` workflow is responsible for setting the output directory based on the input arguments:

3 arguments exist for specifying the relative location of the 3 _outputs_ of the workflow:

- `fastq_output`: The directory where the demultiplexed fastq files are stored.
- `falco_output`: the directory for the `fastqc`/`falco` reports.
- `multiqc_output`: The filename for the `multiqc` report.

The target location path is determined by the following logic:

- If no `id` is provided, the output directory is set to `$publish_dir`.
- If an `id` is explicitly set using Seqera Cloud or by adding `--id <>`, the output directory is set to `$publish_dir/<id>`.

The workflow has two optional flags to be used in combination with `--id`:

- `--add_date_time`: rather than publishing the results under `$publish_dir`, this adds an additional layer `$publish_dir/<date-time-stamp>/`. This is useful when you want to keep track of multiple runs of the workflow (example: `240322_143020`).
- `--add_workflow_id`: adding this flag will add `_demultiplex_<version>` to the output directory (example: `demultiplex_v0.2.0`). When starting the workflow from a non-release, the version will be set to `version_unkonwn`.

The default structure in the output directory is:

- Two sub-directories:
  - `fastq`
  - `qc` for the reports:
    - `multiqc_report.html`
    - `fastqc/` directory containing the different fastqc (falco) reports.

The `$publish_dir` variable corresponds to the argument provided with `--publish-dir`. The `date-time-stamp` is generated by the workflow based on when it was launched and is thus guaranteed to be unique.

# demultiplex v0.2.0

## Breaking changes

* `demultiplex` workflow: renamed `sample_sheet` argument to `run_information` (PR #24)

## New features

* Add support for `bases2fastq` demultiplexer (PR #24)

## Minor updates

* Add resource labels to workflows (PR #21).

# demultiplex v0.1.1

## Minor updates

* Bump viash to 0.9.0 (PR #14).

* `demultiplex` workflow: use `v0.2.0` release instead of `main` branch for `biobox` dependencies (PR #11).

* Renamed `biobase` repository to `biobox` (PR #13 and PR #15).

# demultiplex v0.1.0

Initial release
Build branch main with version main (55699d2) Build pipeline: viash-hub.demultiplex.main-z5mvc Source commit: https://github.com/viash-hub/demultiplex/commit/55699d24c0ff8d88e572ccc1a5ebc40217bb9524 Source message: Allow letter case variations in run information CSV (#38) 2025-03-20 19:54:44 +00:00			`# demultiplex v0.3.6`

			`## Minor updates`

			`* Allow letter case variants for headers when looking for sample information in run information CSV (PR #38).`

Build branch main with version main (dd1f934) Build pipeline: viash-hub.demultiplex.main-v6krs Source commit: https://github.com/viash-hub/demultiplex/commit/dd1f93487f4e908999504e1fcdf97f6c59f743d9 Source message: Add check for the presence of a 'CopyComplete.txt' file (#34) 2025-01-14 12:10:26 +00:00			`# demultiplex v0.3.5`

Build branch main with version main (795abd6) Build pipeline: viash-hub.demultiplex.main-4xxbp Source commit: https://github.com/viash-hub/demultiplex/commit/795abd68688f4f31b0587bc8e4a7de49b6c00825 Source message: Run Falco in parallel for each well (#33) 2025-03-04 06:00:00 +00:00			`## Breaking changes`

			* The `demultiplex` workflow now outputs a list of directories
			for the `output_falco` argument (one for each barcode) instead of one directory
			for the complete run. The output from the `runner` workflow remained
			`unchanged (PR #33).`

Build branch main with version main (dd1f934) Build pipeline: viash-hub.demultiplex.main-v6krs Source commit: https://github.com/viash-hub/demultiplex/commit/dd1f93487f4e908999504e1fcdf97f6c59f743d9 Source message: Add check for the presence of a 'CopyComplete.txt' file (#34) 2025-01-14 12:10:26 +00:00			`## Minor updates`

			`* In case Illumina data is detected in the input folder, check for the presence of the 'copyComplete.txt' file.`
			This check can be disabled using `--skip_copycomplete_check` (PR #34).

Build branch main with version main (d7d3b3e) Build pipeline: viash-hub.demultiplex.main-6k4dk Source commit: https://github.com/viash-hub/demultiplex/commit/d7d3b3e1de64f07a8b161b68a60098103ff691fb Source message: Automatically include resources labels during build (#32) 2024-12-20 11:30:10 +00:00			`# demultiplex v0.3.4`

			`## Minor updates`

			`* Resource labels are now automatically included during build (PR #32).`

Build branch main with version main (8d3c288) Build pipeline: viash-hub.demultiplex.main-mgtgd Source commit: https://github.com/viash-hub/demultiplex/commit/8d3c2888d6df6688763d303dba03ce53d339fb0f Source message: Improved output logic (#30) * Improved output logic * Strip suffix only + update description * Apply suggestions from code review Co-authored-by: Dries Schaumont <5946712+DriesSchaumont@users.noreply.github.com> --------- Co-authored-by: Dries Schaumont <5946712+DriesSchaumont@users.noreply.github.com> 2024-12-18 15:15:49 +00:00			`# demultiplex v0.3.3`

			`## Breaking change`

			- The `runner` defines the output differently now:

			- The last part of the `--input` path is expected to be the run ID and this run ID is used to create the output directory.
			- If the input is `file.tar.gz` instead of a directory, the `file` part is used as the run ID.

			`- The output structure is then as follows:`

			```
			`$publish_dir/<run_id>/<date_time_stamp>_demultiplex_<version>/`
			```

			`For instance:`

			```
			`$publish_dir`
			`└── 200624_A00834_0183_BHMTFYDRXX`
			`└── 20241217_051404_demultiplex_v1.2`
Build branch main with version main (798e361) Build pipeline: viash-hub.demultiplex.main-plzkv Source commit: https://github.com/viash-hub/demultiplex/commit/798e361afeea8dbb84dc23ef38e4fbc3883463e7 Source message: Add run information to output (#31) 2024-12-19 15:54:37 +00:00			`├── run_information.csv`
Build branch main with version main (8d3c288) Build pipeline: viash-hub.demultiplex.main-mgtgd Source commit: https://github.com/viash-hub/demultiplex/commit/8d3c2888d6df6688763d303dba03ce53d339fb0f Source message: Improved output logic (#30) * Improved output logic * Strip suffix only + update description * Apply suggestions from code review Co-authored-by: Dries Schaumont <5946712+DriesSchaumont@users.noreply.github.com> --------- Co-authored-by: Dries Schaumont <5946712+DriesSchaumont@users.noreply.github.com> 2024-12-18 15:15:49 +00:00			`├── fastq`
			`│ ├── Sample1_S1_L001_R1_001.fastq.gz`
			`│ ├── Sample23_S3_L001_R1_001.fastq.gz`
			`│ ├── SampleA_S2_L001_R1_001.fastq.gz`
			`│ ├── Undetermined_S0_L001_R1_001.fastq.gz`
			`│ └── sampletest_S4_L001_R1_001.fastq.gz`
			`└── qc`
			`├── fastqc`
			`│ ├── Sample1_S1_L001_R1_001.fastq.gz_fastqc_data.txt`
			`│ ├── Sample1_S1_L001_R1_001.fastq.gz_fastqc_report.html`
			`│ ├── Sample1_S1_L001_R1_001.fastq.gz_summary.txt`
			`│ ├── Sample23_S3_L001_R1_001.fastq.gz_fastqc_data.txt`
			`│ ├── Sample23_S3_L001_R1_001.fastq.gz_fastqc_report.html`
			`│ ├── Sample23_S3_L001_R1_001.fastq.gz_summary.txt`
			`│ ├── SampleA_S2_L001_R1_001.fastq.gz_fastqc_data.txt`
			`│ ├── SampleA_S2_L001_R1_001.fastq.gz_fastqc_report.html`
			`│ ├── SampleA_S2_L001_R1_001.fastq.gz_summary.txt`
			`│ ├── Undetermined_S0_L001_R1_001.fastq.gz_fastqc_data.txt`
			`│ ├── Undetermined_S0_L001_R1_001.fastq.gz_fastqc_report.html`
			`│ ├── Undetermined_S0_L001_R1_001.fastq.gz_summary.txt`
			`│ ├── sampletest_S4_L001_R1_001.fastq.gz_fastqc_data.txt`
			`│ ├── sampletest_S4_L001_R1_001.fastq.gz_fastqc_report.html`
			`│ └── sampletest_S4_L001_R1_001.fastq.gz_summary.txt`
			`└── multiqc_report.html`

			```

			- This logic can be avoided by providing the flag `--plain_output`.

Build branch main with version main (798e361) Build pipeline: viash-hub.demultiplex.main-plzkv Source commit: https://github.com/viash-hub/demultiplex/commit/798e361afeea8dbb84dc23ef38e4fbc3883463e7 Source message: Add run information to output (#31) 2024-12-19 15:54:37 +00:00			`# Minor updates`

			* Added `output_run_information` argument that copies the run information file to the output (PR #31).
Build branch main with version main (8d3c288) Build pipeline: viash-hub.demultiplex.main-mgtgd Source commit: https://github.com/viash-hub/demultiplex/commit/8d3c2888d6df6688763d303dba03ce53d339fb0f Source message: Improved output logic (#30) * Improved output logic * Strip suffix only + update description * Apply suggestions from code review Co-authored-by: Dries Schaumont <5946712+DriesSchaumont@users.noreply.github.com> --------- Co-authored-by: Dries Schaumont <5946712+DriesSchaumont@users.noreply.github.com> 2024-12-18 15:15:49 +00:00
Build branch main with version main (45accaa) Build pipeline: viash-hub.demultiplex.main-ghkns Source commit: https://github.com/viash-hub/demultiplex/commit/45accaa50e52c6b29c78259995293789f345c80d Source message: Avoid empty csv entries when parsing sample information (#29) 2024-12-11 18:06:57 +00:00			`# demultiplex v0.3.2`

			`# Bug fixes`

			`* Ignore empty CSV entries when parsing sample information (PR #29).`

Build branch main with version main (d3a9c9b) Build pipeline: viash-hub.demultiplex.main-gcqps Source commit: https://github.com/viash-hub/demultiplex/commit/d3a9c9b3be9790bf89258b14c9a8c83af945ad47 Source message: Fix detection of sample names from Illumina V2 sample sheets (#28) 2024-12-11 14:22:44 +00:00			`# demultiplex v0.3.1`

Build branch main with version main (5c096fc) Build pipeline: viash-hub.demultiplex.main-svj8q Source commit: https://github.com/viash-hub/demultiplex/commit/5c096fce4015435019d81e2cc524a478f4034adc Source message: Fixes for v0.3.0 (#27) Co-authored-by: DriesSchaumont <5946712+DriesSchaumont@users.noreply.github.com> 2024-12-11 15:44:12 +00:00			`# Minor updates`

			* Add `--run_information` and `--demultiplexer` arguments to `runner` workflow (PR #27).

Build branch main with version main (d3a9c9b) Build pipeline: viash-hub.demultiplex.main-gcqps Source commit: https://github.com/viash-hub/demultiplex/commit/d3a9c9b3be9790bf89258b14c9a8c83af945ad47 Source message: Fix detection of sample names from Illumina V2 sample sheets (#28) 2024-12-11 14:22:44 +00:00			`# Bug fixes`

			`* Fix detection of sample IDs from Illumina V2 sample sheets (PR #28).`

Build branch main with version main (5c096fc) Build pipeline: viash-hub.demultiplex.main-svj8q Source commit: https://github.com/viash-hub/demultiplex/commit/5c096fce4015435019d81e2cc524a478f4034adc Source message: Fixes for v0.3.0 (#27) Co-authored-by: DriesSchaumont <5946712+DriesSchaumont@users.noreply.github.com> 2024-12-11 15:44:12 +00:00			* Provide a clear error message when `--run_information` is provided but not `--demultiplexer` (PR #27).

Build branch main with version main (b7e30f3) Build pipeline: viash-hub.demultiplex.main-b6qtk Source commit: https://github.com/viash-hub/demultiplex/commit/b7e30f394e7a4ae7a51961ade824e24ffe718e1f Source message: Revert 64a371e, update CHANGELOG for 0.3.0 (#26) 2024-12-11 09:30:29 +00:00			`# demultiplex v0.3.0`
Build branch main with version main (6e6be28) Build pipeline: viash-hub.demultiplex.main-wmjf8 Source commit: https://github.com/viash-hub/demultiplex/commit/6e6be28b85ab619214ae05a017a33498c0dc8890 Source message: Prepare CHANGELOG for release 0.2.0 2024-12-05 10:37:33 +00:00
Build branch main with version main (64a371e) Build pipeline: viash-hub.demultiplex.main-lcwkk Source commit: https://github.com/viash-hub/demultiplex/commit/64a371e168472c987d9ec61b0373b0e28762dfcd Source message: Update CHANGELOG for 0.2.0 (#25) 2024-12-11 09:03:18 +00:00			`## Major updates`

			The outflow of the workflow has been refactored to be more flexible (PR #19). This is done by creating a wrapper workflow `runner` that wraps the native `demultiplex` workflow. The `runner` workflow is responsible for setting the output directory based on the input arguments:

			`3 arguments exist for specifying the relative location of the 3 _outputs_ of the workflow:`

			- `fastq_output`: The directory where the demultiplexed fastq files are stored.
			- `falco_output`: the directory for the `fastqc`/`falco` reports.
			- `multiqc_output`: The filename for the `multiqc` report.

			`The target location path is determined by the following logic:`

			- If no `id` is provided, the output directory is set to `$publish_dir`.
			- If an `id` is explicitly set using Seqera Cloud or by adding `--id <>`, the output directory is set to `$publish_dir/<id>`.

			The workflow has two optional flags to be used in combination with `--id`:

			- `--add_date_time`: rather than publishing the results under `$publish_dir`, this adds an additional layer `$publish_dir/<date-time-stamp>/`. This is useful when you want to keep track of multiple runs of the workflow (example: `240322_143020`).
			- `--add_workflow_id`: adding this flag will add `_demultiplex_<version>` to the output directory (example: `demultiplex_v0.2.0`). When starting the workflow from a non-release, the version will be set to `version_unkonwn`.

			`The default structure in the output directory is:`

			`- Two sub-directories:`
			- `fastq`
			- `qc` for the reports:
			- `multiqc_report.html`
			- `fastqc/` directory containing the different fastqc (falco) reports.

			The `$publish_dir` variable corresponds to the argument provided with `--publish-dir`. The `date-time-stamp` is generated by the workflow based on when it was launched and is thus guaranteed to be unique.

Build branch main with version main (b7e30f3) Build pipeline: viash-hub.demultiplex.main-b6qtk Source commit: https://github.com/viash-hub/demultiplex/commit/b7e30f394e7a4ae7a51961ade824e24ffe718e1f Source message: Revert 64a371e, update CHANGELOG for 0.3.0 (#26) 2024-12-11 09:30:29 +00:00			`# demultiplex v0.2.0`

			`## Breaking changes`

			* `demultiplex` workflow: renamed `sample_sheet` argument to `run_information` (PR #24)

Build branch main with version main (6e6be28) Build pipeline: viash-hub.demultiplex.main-wmjf8 Source commit: https://github.com/viash-hub/demultiplex/commit/6e6be28b85ab619214ae05a017a33498c0dc8890 Source message: Prepare CHANGELOG for release 0.2.0 2024-12-05 10:37:33 +00:00			`## New features`

			* Add support for `bases2fastq` demultiplexer (PR #24)
Build branch main with version main (5cb1323) Build pipeline: viash-hub.demultiplex.main-6d5fm Source commit: https://github.com/viash-hub/demultiplex/commit/5cb13230bf682321226addce896a3015e8864913 Source message: Add resource labels to workflows. (#22) * Add resource labels to workflows. * Add CHANGELOG entry 2024-11-06 17:52:30 +00:00
			`## Minor updates`

			`* Add resource labels to workflows (PR #21).`

Build branch main with version main (aca8016) Build pipeline: viash-hub.demultiplex.main-gl2l5 Source commit: https://github.com/viash-hub/demultiplex/commit/aca8016d742c72e4badc2fc91a40b8f7f1290010 Source message: Trigger CI 2024-09-17 14:38:40 +00:00			`# demultiplex v0.1.1`
Build branch main with version main (ed860be) Build pipeline: vsh-ci-template-hp9jh Source commit: https://github.com/viash-hub/demultiplex/commit/ed860bed30c98b981270f104cffbdb9b7f1ce141 Source message: BUG: Wrong pointer to biobox dependency (#15) * Fix pointer to biobox * Add PR number 2024-06-24 12:48:53 +00:00
Build branch main with version main (399e469) Build pipeline: viash-hub.demultiplex.main-f55jt Source commit: https://github.com/viash-hub/demultiplex/commit/399e46901d6ce882a7430842b0d74774736439e2 Source message: Add test_resources to .gitignore 2024-09-13 09:46:12 +00:00			`## Minor updates`
Build branch main with version main (ed860be) Build pipeline: vsh-ci-template-hp9jh Source commit: https://github.com/viash-hub/demultiplex/commit/ed860bed30c98b981270f104cffbdb9b7f1ce141 Source message: BUG: Wrong pointer to biobox dependency (#15) * Fix pointer to biobox * Add PR number 2024-06-24 12:48:53 +00:00
Build branch main with version main (399e469) Build pipeline: viash-hub.demultiplex.main-f55jt Source commit: https://github.com/viash-hub/demultiplex/commit/399e46901d6ce882a7430842b0d74774736439e2 Source message: Add test_resources to .gitignore 2024-09-13 09:46:12 +00:00			`* Bump viash to 0.9.0 (PR #14).`
Build branch main with version main (ed860be) Build pipeline: vsh-ci-template-hp9jh Source commit: https://github.com/viash-hub/demultiplex/commit/ed860bed30c98b981270f104cffbdb9b7f1ce141 Source message: BUG: Wrong pointer to biobox dependency (#15) * Fix pointer to biobox * Add PR number 2024-06-24 12:48:53 +00:00
Build branch main with version main (399e469) Build pipeline: viash-hub.demultiplex.main-f55jt Source commit: https://github.com/viash-hub/demultiplex/commit/399e46901d6ce882a7430842b0d74774736439e2 Source message: Add test_resources to .gitignore 2024-09-13 09:46:12 +00:00			* `demultiplex` workflow: use `v0.2.0` release instead of `main` branch for `biobox` dependencies (PR #11).
Build branch main with version main (ed860be) Build pipeline: vsh-ci-template-hp9jh Source commit: https://github.com/viash-hub/demultiplex/commit/ed860bed30c98b981270f104cffbdb9b7f1ce141 Source message: BUG: Wrong pointer to biobox dependency (#15) * Fix pointer to biobox * Add PR number 2024-06-24 12:48:53 +00:00
Build branch main with version main (399e469) Build pipeline: viash-hub.demultiplex.main-f55jt Source commit: https://github.com/viash-hub/demultiplex/commit/399e46901d6ce882a7430842b0d74774736439e2 Source message: Add test_resources to .gitignore 2024-09-13 09:46:12 +00:00			* Renamed `biobase` repository to `biobox` (PR #13 and PR #15).
Build branch main with version main (ed860be) Build pipeline: vsh-ci-template-hp9jh Source commit: https://github.com/viash-hub/demultiplex/commit/ed860bed30c98b981270f104cffbdb9b7f1ce141 Source message: BUG: Wrong pointer to biobox dependency (#15) * Fix pointer to biobox * Add PR number 2024-06-24 12:48:53 +00:00
Build branch main with version main (aca8016) Build pipeline: viash-hub.demultiplex.main-gl2l5 Source commit: https://github.com/viash-hub/demultiplex/commit/aca8016d742c72e4badc2fc91a40b8f7f1290010 Source message: Trigger CI 2024-09-17 14:38:40 +00:00			`# demultiplex v0.1.0`
Build branch main with version main (ed860be) Build pipeline: vsh-ci-template-hp9jh Source commit: https://github.com/viash-hub/demultiplex/commit/ed860bed30c98b981270f104cffbdb9b7f1ce141 Source message: BUG: Wrong pointer to biobox dependency (#15) * Fix pointer to biobox * Add PR number 2024-06-24 12:48:53 +00:00
Build branch main with version main (399e469) Build pipeline: viash-hub.demultiplex.main-f55jt Source commit: https://github.com/viash-hub/demultiplex/commit/399e46901d6ce882a7430842b0d74774736439e2 Source message: Add test_resources to .gitignore 2024-09-13 09:46:12 +00:00			`Initial release`