v2.0 Release Candidate #119

chris-cheshire · 2022-05-05T12:45:49Z

Major Changes

[#53] - Complete redesign of the samplesheet input system. Controls are no longer hard-coded to igg and the process of assigning controls to samples has been simplified.
[#93], [#73] - Additional sample normalisation options have been added. In addition to normalising using detected spike-in DNA, there are now several options for normalising against read depth instead as well as skipping normalisation entirely.
[#62] - Added MACS2 as an optional peak caller. Peak calling can now be altered using the peakcaller variable. Both peak callers can be run using --peakcaller SEACR,MACS2, the primary caller is the first item in the list and will be used downstream while the secondary will be run and outputted to the results folder.
[#101] - v1.1 ran consensus peak calling at both the group level and for all samples. This was causing performance issues for larger sample sets. There is now a new consensus_peak_mode parameter that defaults to group. Consensus will only be run on all samples if this is changed to all.

Enhancements

Updated pipeline template to nf-core/tools 2.3.2.
Upgraded pipeline to support the new nf-core module configuration system.
More robust CI testing. Over 213 tests now before any code is merged with the main code base.
More control over which parts of the pipeline run. Explicit skipping has been implemented for every section of the pipeline.
Added options for scaling control data before it is used to call peaks. This is especially useful when using read depth normalisation as this can sometimes result in few peaks being called due to high background levels.
Added support for Bowtie2 large indexes.
IGV auto-session builder now supports gff and fna file extensions.
Bowtie2 alignment has been altered to run in --end-to-end mode only if trimming is skipped. If trimming is activated then it will run in --local mode.
[#88] - Many processes have been optimized for resource utilization. Users will especially notice that single thread processes will now only request 1 core rather than 2.
[#63] - Custom containers for python reporting have now been condensed into a single container and added to BioConda.
[#76] - Standardized python versions across reporting modules.

Fixes

[#120] - DeepTools compute matrix/heatmaps now only runs if there are peaks detected.
[#99] - Large upset plots were causing process crashes. Upset plots will now fail gracefully if the number of samples in the consensus group is more than 10.
[#95] - Fixed FRIP calculation performance issues and crashes.

Software dependencies

Note, since the pipeline is now using Nextflow DSL2, each process will be run with its own Biocontainer. This means that on occasion it is entirely possible for the pipeline to be using different versions of the same tool. However, the overall software dependency changes compared to the last release have been listed below for reference.

Dependency	Old version	New version
`samtools`	1.14	1.15.1
`bowtie2`	2.4.2	2.4.4
`picard`	2.25.7	2.27.1

NB: Dependency has been updated if both old and new version information is present.
NB: Dependency has been added if just the new version information is present.
NB: Dependency has been removed if version information isn't present.

PR checklist

Release updates

modules/local/linux/awk_script.nf

Replace gawk containers with ubuntu

FriederikeHanssen

Love it, just some teeny tiny things from my side mostly about version and test data.

.github/workflows/ci.yml

CHANGELOG.md

README.md

conf/modules.config

conf/test.config

modules/local/modules/generate_reports/main.nf

modules/local/modules/plot_consensus_peaks/main.nf

modules/local/samtools_custom_view.nf

subworkflows/nf-core/mark_duplicates_picard.nf

tests/_template.yml

Release Updates

v2.0 updates

Tamara Hodgetts and others added 30 commits December 2, 2021 16:32

testing

fde40cd

testing

dff0322

testing

c5d9625

testing

62621f0

change curly brackets

7be8d53

Removed comment

d51a9f4

removed comments

660795e

removed comment

a3d0ec5

Removed macs_fdr parameter

18d0d76

Corrected bed file naming

299fcdb

Testing

05b0863

testing

bb4ec4a

Checking channel content

aa4acb7

Checking channel content

fd82555

Checking file size

14e0179

Viewing channel content

46549c3

Viewing channel content

2c763bd

Viewing channel content

eb80a17

Re-arranged code

75f97f0

commented out code

5cdf694

removed comments

b01ba19

Included latest version of Nextflow in testing

87a700a

Template update for nf-core/tools version 2.2

95c78a8

Merge branch 'dev' into peakcallers

890ddda

Whitespace fix

ed90be7

Added run narrow peak param

cb56b02

Added handling for bt2l files in bowtie2

0e2da5b

Merge branch 'dev' into peakcallers

85d425f

Whitespace trim

fde9d74

Commenting

a0fa1f4

chris-cheshire added 5 commits May 25, 2022 10:12

Corrected markdown

e118510

Commenting and doc updates

a4ccca9

Macs2 will now use the igenomes genome parameter

2f7e499

Commenting update

8246b09

Merge pull request #127 from luslab/dev

5165e83

Release updates

chris-cheshire requested a review from alexthiery May 26, 2022 10:37

dshinzie reviewed May 26, 2022

View reviewed changes

modules/local/linux/awk_script.nf Outdated Show resolved Hide resolved

drpatelh and others added 2 commits May 27, 2022 10:08

Replace gawk containers with ubuntu

c48d547

Merge pull request #128 from drpatelh/ubuntu

89a1af5

Replace gawk containers with ubuntu

FriederikeHanssen requested changes May 27, 2022

View reviewed changes

chris-cheshire added 12 commits May 29, 2022 11:21

Merge remote-tracking branch 'nf-core/dev' into dev

ceb8988

Updated to old zenodo

8af48a4

removed static reports folder

892b5ac

Modules config now uses "${params.publish_dir_mode}" instead of copy

30707ac

Updated version output

eb69e86

Updated multiqc version

ce16be3

Updated versions

cb1b89a

Updated samtools version

e58873c

Merge pull request #129 from luslab/dev

b5c19c1

Release Updates

Added versioning to cut and sort

f71699a

Added more versioning into pipeline

1e27946

Merge pull request #130 from luslab/dev

dbaad8c

v2.0 updates

chris-cheshire requested a review from FriederikeHanssen June 8, 2022 11:19

FriederikeHanssen approved these changes Jun 8, 2022

View reviewed changes

chris-cheshire added 5 commits June 8, 2022 12:29

Comment remove

f3005ef

updated all the test data paths

9c8abf9

bumped version to 2.0

90b5200

Updated modules

ca92a9b

Merge pull request #131 from luslab/dev

f02083e

v2.0 updates

chris-cheshire merged commit 971984a into master Jun 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v2.0 Release Candidate #119

v2.0 Release Candidate #119

chris-cheshire commented May 5, 2022 •

edited

Loading

FriederikeHanssen left a comment

v2.0 Release Candidate #119

v2.0 Release Candidate #119

Conversation

chris-cheshire commented May 5, 2022 • edited Loading

Major Changes

Enhancements

Fixes

Software dependencies

PR checklist

FriederikeHanssen left a comment

Choose a reason for hiding this comment

chris-cheshire commented May 5, 2022 •

edited

Loading