Files

CI df41c7466c Build branch main with version main (8dff52a)

Build pipeline: viash-hub.htrnaseq.main-j2rpn

Source commit: 8dff52a885

Source message: Add `umi_length` parameter to runner. (#46)

2025-02-28 14:10:10 +00:00

5.8 KiB

Raw Blame History

htrnaseq v0.5.5

New functionality

Add umi_length parameter to the runner workflow (PR #46)

htrnaseq v0.5.4

Fix missing barcodes in the output from generate_pool_statistics, which caused an assertion error in create_pdata. In order to resolve the issue generate_well_statistics now outputs results for all chromosomes/scaffolds presented by the genome annotation, even when no reads were mapped to the regions in question. generate_pool_statistics will now remove regions from the output that have not at least one counts across all barcodes (PR #44).

htrnaseq v0.5.3

Bug fixes

Fix create_eset component failing to create when one of the input samples has no counts (PR #43).

htrnaseq v0.5.2

Bug fixes

create_fdata: remove duplicate entries from feature data (PR #41).

htrnaseq v0.5.1

Bug fixes

generate_well_statistics: fix ValueError when an empty .bam file is provided as input (PR #40).
create_pdata: avoid false positive ValueError for non-overlapping barcodes when input data contains empty (NA) values (PR #40).

htrnaseq v0.5.0

New functionality

Added ignore parameter was added to the runner workflow in order to pass over certain input files from the input directory (PR #39).

htrnaseq v0.4.0

Breaking changes

An effort has been made to align the inputs for the htrnaseq and the mapping and demultiplexing of the wells, in order simplify running these steps as seperate steps (PR #37).

Changes to the parallel_map component:
- The barcode argument has been renamed to barcodesFasta and the provided value for this argument must now be single FASTA file instead of a list of barcodes.
- The filenames for the provided FASTQ files must now conform to the format {name}_R(1|2).fasta, where {name} is the well identifiers. The well identifiers correspond to the headers of the FASTA file containing the barcodes (up untill the first whitespace). Forward and reverse FASTQ files must still be provided in pairs, meaning that the order of files provided to input_r1 and input_r2 remains important.
- The requirement for equal number of barcodes and FASTQ pairs to be provided has been dropped. Instead, the barcodes provided with barcodesFasta are matched to the input FASTQ files by comparing the header of the FASTA records to the file names of the provided FASTQ input files. Each barcode must match exactly one FASTQ input pair (forward and reverse reads), but FASTQ files that were not matched to any barcode are not processed. Basically, the barcodes fasta can now act as a filter for the FASTQ files to be mapped.
The utils/groupWells workflow has been removed.
parallel_map_wf has been removed as its functionality is now incomporated into the parallel_map component.
The pool, well_id, barcode, lane, pair_end and n_wells output arguments have been dropped from the well_demultiplexing workflow. This workflow now only outputs a list of demultiplexed FASTQ files.
A well_metadata workflow has been implemented that extracts the metadata that is no longer output by the well_demultiplexing workflow from the demultiplexed files and the barcodes FASTA.

New functionality

Multiple input directories can not be provided. The input reads from these from these directories will be joined per barcode before mapping. This is useful when data has been generated using multiple sequencing runs in order to increase sequencing depth (PR #38).

htrnaseq v0.3.0

New functionality

Added umi_length argument (PR #27).
Added runner workflow (PR #26, see below)

`runner` workflow

Removed wellBarcodesLength from parallel_map workflow (PR #27).

Major changes

A runner workflows has been added, providing two additional features:

Start from an input directory containing fastq files rather than a list of input fastq pairs.
Improve the output of the workflow

Input directory

It is now possible to specify a single --input <basedir> directory as input and the runner will extract the fastq file pairs. An error will be raised if the filename processing leads to errors.

Output

The runner provides a complete different approach to output. A couple of things are important here:

Output is split up in 2 parts:
1. The well-demultiplexed fastq files (--fastq_publish_dir)
2. All the other results of the workflow (--results_publish_dir)
The well-demultiplexed fastq file are stored under --fastq_publish_dir according to the following format:
```
$fastq_publish_dir/$id/<date-time>_htrnaseq_<version>/$plate_$lane/<well_id>_R1/2_001.fastq
```

The other results are stored under --results_publish_dir according to the following format:

$results_publish_dir/$project_id/$experiment_id/<date-time>_htrnaseq_<version>/

This is an example listing of this directory:

esets
fData
nrReadsNrGenesPerChrom
pData
report.html
star_output
starLogs

This output structure can be circumvented by using the --output_dir option, which will store all output in a single directory.

Using the htrnaseq workflow directory rather than the runner interface
Using the argument --plain_output with the runner. fastq files and other results will still be published in their respective directories, but not in a directory hierarchy as described above.

Minor changes

Use v0.2.0 version of cutadapt instead of main (PR #23).
Use v0.3.0 version of cutadapt
Bump viash to 0.9.1 (PR #31).
create_eset: Update base container image, R version and all dependencies to newer versions (PR #28).

5.8 KiB

Raw Blame History

htrnaseq v0.5.5

New functionality

htrnaseq v0.5.4

htrnaseq v0.5.3

Bug fixes

htrnaseq v0.5.2

Bug fixes

htrnaseq v0.5.1

Bug fixes

htrnaseq v0.5.0

New functionality

htrnaseq v0.4.0

Breaking changes

New functionality

htrnaseq v0.3.0

New functionality

`runner` workflow

Major changes

Input directory

Output

Minor changes

htrnaseq v0.2.0

New functionality

htrnaseq v0.1.0

5.8 KiB Raw Blame History

htrnaseq v0.5.5

New functionality

htrnaseq v0.5.4

htrnaseq v0.5.3

Bug fixes

htrnaseq v0.5.2

Bug fixes

htrnaseq v0.5.1

Bug fixes

htrnaseq v0.5.0

New functionality

htrnaseq v0.4.0

Breaking changes

New functionality

htrnaseq v0.3.0

New functionality

runner workflow

Major changes

Input directory

Output

Minor changes

htrnaseq v0.2.0

New functionality

htrnaseq v0.1.0

5.8 KiB

Raw Blame History

`runner` workflow