Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
87 commits
Select commit Hold shift + click to select a range
a5af10e
adding mskcc modules
Apr 13, 2023
9edde93
adding test yaml for mskccprepnucleo
Apr 13, 2023
f07a6de
name change
Apr 20, 2023
043bec3
removing redundant channel
Apr 20, 2023
a830c37
re-doing first access subworkflow
Apr 27, 2023
6503b00
clean up old workflow, after re-name
May 2, 2023
db72d2c
try fork syncing
May 9, 2023
00fa7d7
Merge branch 'develop' into feature/extractumi
buehlere May 9, 2023
fc28f6e
experimenting with auto sync
May 9, 2023
d669e02
Merge branch 'feature/extractumi' of https://github.com/mskcc-omics-w…
May 9, 2023
7c9816d
testing gitactions
May 9, 2023
228c772
gitactions test
May 9, 2023
2d70a15
Merge pull request #4 from nf-core/master
buehlere May 9, 2023
d612e60
gitactions experiment
May 9, 2023
fde2b5d
Merge branch 'feature/extractumi' of https://github.com/mskcc-omics-w…
May 9, 2023
a78e7ea
trying to fix gitactions
May 9, 2023
06b27e0
Create sync-fork.yml
May 9, 2023
f2c759d
Update sync-fork.yml
May 9, 2023
7ff8837
Update sync-fork.yml
May 9, 2023
03ede3c
Update sync-fork.yml
May 9, 2023
37ad4bd
Merge branch 'master' of https://github.com/mskcc-omics-workflows/mod…
May 9, 2023
879199a
Update sync-fork.yml
May 9, 2023
c21e35b
Update sync-fork.yml
May 9, 2023
cd09579
Update sync-fork.yml
May 9, 2023
31331e4
Update sync-fork.yml
May 9, 2023
cf56e9b
Merge branch 'master' into feature/extractumi
May 9, 2023
9a1f635
remove output folders
May 9, 2023
be48046
Update sync-fork.yml
May 10, 2023
caa71e9
Revert "Update sync-fork.yml"
May 10, 2023
e0c8587
Create sync-action.yml
May 10, 2023
0877f99
Merge pull request #10 from nf-core/master
buehlere May 10, 2023
6b5a9c2
Merge pull request #11 from nf-core/master
buehlere May 12, 2023
d65303a
Merge pull request #12 from nf-core/master
buehlere May 13, 2023
ff7519d
Merge pull request #13 from nf-core/master
buehlere May 15, 2023
ff6690a
Merge pull request #14 from nf-core/master
buehlere May 16, 2023
37ceaf0
Merge pull request #15 from nf-core/master
buehlere May 17, 2023
441d2d9
remove this for separate PR
May 17, 2023
95ef62c
Merge branch 'master' into feature/extractumi
May 17, 2023
eaef95a
clean up for extractumi
May 17, 2023
4c663a3
Merge branch 'master' into develop
May 17, 2023
7215646
Update sync-action.yml
May 17, 2023
ab7ba98
Delete sync-fork.yml
May 17, 2023
13d45de
Merge pull request #16 from nf-core/master
buehlere May 17, 2023
12be703
Merge branch 'develop' into feature/extractumi
May 17, 2023
35baf68
Update sync-action.yml
May 17, 2023
f2f4e04
Update sync-action.yml
May 17, 2023
5912f29
Revert "Merge pull request #16 from nf-core/master"
May 17, 2023
668223d
Merge pull request #17 from nf-core/master
buehlere May 18, 2023
d3d728d
Update sync-action.yml
May 18, 2023
1b166e5
Merge pull request #20 from nf-core/master
buehlere May 18, 2023
4b5b4ba
Merge branch 'develop' into feature/extractumi
May 18, 2023
e49802d
cleanup
May 18, 2023
98ecd46
Merge pull request #23 from nf-core/master
buehlere May 22, 2023
e3e34fa
Merge pull request #25 from nf-core/master
buehlere May 23, 2023
a49348f
Merge pull request #27 from nf-core/master
buehlere May 24, 2023
bf8e125
Merge pull request #29 from nf-core/master
buehlere May 25, 2023
fa5bb4d
Trigger github actions when PR is made against develop
anoronh4 May 25, 2023
bdb938f
split github workflows into different files for running on develop PRs
anoronh4 May 25, 2023
4a27c56
Merge pull request #32 from nf-core/master
buehlere May 26, 2023
78a953f
Disable sentieon testing in CI pytests for PRs against develop
anoronh4 May 26, 2023
3c06e80
changed names of all github action workflows that trigger in PRs agai…
anoronh4 May 26, 2023
2c128f2
Merge pull request #34 from nf-core/master
buehlere May 27, 2023
2d80a5a
Merge pull request #36 from nf-core/master
buehlere May 30, 2023
c7e7c4e
Merge pull request #38 from nf-core/master
buehlere May 31, 2023
8485cc7
Merge pull request #40 from nf-core/master
buehlere Jun 1, 2023
365d9eb
Merge pull request #42 from nf-core/master
buehlere Jun 2, 2023
320b705
Merge pull request #44 from nf-core/master
buehlere Jun 3, 2023
0b85cd9
Merge pull request #46 from nf-core/master
buehlere Jun 5, 2023
84aa571
Merge pull request #48 from nf-core/master
buehlere Jun 6, 2023
ea666e8
Merge pull request #50 from nf-core/master
buehlere Jun 8, 2023
c1b05ee
Merge pull request #30 from mskcc-omics-workflows/enhancement/run_wor…
buehlere Jun 8, 2023
b82a132
Merge branch 'develop' into feature/extractumi
Jun 8, 2023
e32f380
prettier formatting
Jun 8, 2023
c588273
fix formatting for linters
Jun 8, 2023
601c863
removing tools-test-dataset
Jun 8, 2023
a3ae978
re-doing md5 sum
Jun 8, 2023
10d52c3
Update test.yml
Jun 8, 2023
a804e7c
redoing md5sums
Jun 8, 2023
ad64d4c
Update main.nf
Jun 9, 2023
e8724f1
Merge branch 'develop' into feature/extractumi
Aug 8, 2023
73fedb5
Delete sync-action.yml
Aug 8, 2023
1902b7f
Merge branch 'develop' into feature/extractumi
Aug 8, 2023
0b5ff75
remove poorly synced files
Aug 8, 2023
053d201
more poorly synced cleanup
Aug 8, 2023
75a0cfe
name change
Aug 9, 2023
ac38f09
removing old name workflow
Aug 9, 2023
d867082
Update pytest_modules.yml
Aug 9, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
27 changes: 27 additions & 0 deletions install_data.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,27 @@
#!/usr/bin/env bash


# Test data is hosted on Google Drive at:
# https://drive.google.com/file/d/1GtT8jsBGwRoQC-5wHh06r8RFkiFBuirp/view?usp=sharing

fileid=1GtT8jsBGwRoQC-5wHh06r8RFkiFBuirp

filename=test_nucleo.tar.gz
foldername=test_nucleo

# Skip if already have test data
[[ -f $filename ]] && exit 0
[[ -d $foldername ]] && exit 0

curl -c ./cookie -s -k -L "https://drive.google.com/uc?export=download&id=$fileid" > /dev/null

curl -k -Lb ./cookie "https://drive.google.com/uc?export=download&confirm=`awk '/download/ {print $NF}' ./cookie`&id=${fileid}" -o ${filename}

# Suppress linux warnings for MacOS tar.gz files
if [[ "$OSTYPE" == "linux-gnu" ]]; then
tar --warning=no-unknown-keyword -xzvf $filename
elif [[ "$OSTYPE" == "darwin"* ]]; then
tar -xzvf $filename
fi

rm $filename
Original file line number Diff line number Diff line change
@@ -0,0 +1,68 @@
// TODO nf-core: If in doubt look at other nf-core/subworkflows to see how we are doing things! :)
// https://github.com/nf-core/modules/tree/master/subworkflows
// You can also ask for help via your pull request or on the #subworkflows channel on the nf-core Slack workspace:
// https://nf-co.re/join
// TODO nf-core: A subworkflow SHOULD import at least two modules

include { FGBIO_FASTQTOBAM } from '../../../modules/nf-core/fgbio/fastqtobam/main'
include { PICARD_MERGESAMFILES } from '../../../modules/nf-core/picard/mergesamfiles/'
include { GATK4_SAMTOFASTQ } from '../../../modules/nf-core/gatk4/samtofastq/main'
include { FASTP } from '../../../modules/nf-core/fastp/main'

workflow FASTQ_EXTRACTUMI_FGBIO_PICARD_GATK4_FASTP {

take:
// TODO nf-core: edit input (take) channels
ch_fastq // channel: [ val(meta), [ bam ] ]

main:

ch_versions = Channel.empty()

// FGBIO_FASTQTOBAM: get unmerged bams
// ch_fastq is a channel, which enables parallel
// channels enable parallel: https://www.nextflow.io/docs/latest/faq.html?highlight=parallel
FGBIO_FASTQTOBAM (
ch_fastq
)
FGBIO_FASTQTOBAM.out.bam.map{
meta, bam ->
[bam]
}.collect().map{
bams ->
[[id: 'unmerged_bams'], bams ]
}.set{unmerged_bams}
ch_versions = ch_versions.mix(FGBIO_FASTQTOBAM.out.versions) //write out versioning

// PICARD_MERGESAMFILES: merge bams files
PICARD_MERGESAMFILES (
unmerged_bams
).bam.map {
meta, bam ->
new_id = 'merged_bam'
[[id: new_id], bam ]
}.set {merged_bam}
ch_versions = ch_versions.mix(PICARD_MERGESAMFILES.out.versions)
// GATK4_SAMTOFASTQ: get fastqs from merged bam
GATK4_SAMTOFASTQ (
merged_bam
).fastq.map {
meta, fastq ->
new_id = 'merged_fastq'
[[id: new_id], fastq ]
}.set {merged_fastq}
ch_versions = ch_versions.mix(GATK4_SAMTOFASTQ.out.versions)

// GATK4_SAMTOFASTQ: Run fastp on fastqs
FASTP (
merged_fastq, [], false, false
)
ch_versions = ch_versions.mix(FASTP.out.versions)
// final emit
emit:
// TODO nf-core: edit emitted channels
bam = FASTP.out.reads

versions = ch_versions // channel: [ versions.yml ]

}
Original file line number Diff line number Diff line change
@@ -0,0 +1,48 @@
name: "fastq_extractumi_fgbio_picard_gatk4_fastp"
## TODO nf-core: Add a description of the subworkflow and list keywords
description: Sort SAM/BAM/CRAM file
keywords:
- sort
- bam
- sam
- cram
## TODO nf-core: Add a list of the modules used in the subworkflow
modules:
- samtools/sort
- samtools/index
## TODO nf-core: List all of the variables used as input, including their types and descriptions
input:
- meta:
type: map
description: |
Groovy Map containing sample information
e.g. [ id:'test' ]
- bam:
type: file
description: BAM/CRAM/SAM file
pattern: "*.{bam,cram,sam}"
## TODO nf-core: List all of the variables used as output, including their types and descriptions
output:
- meta:
type: map
description: |
Groovy Map containing sample information
e.g. [ id:'test' ]
- bam:
type: file
description: Sorted BAM/CRAM/SAM file
pattern: "*.{bam,cram,sam}"
- bai:
type: file
description: BAM/CRAM/SAM samtools index
pattern: "*.{bai,crai,sai}"
- csi:
type: file
description: CSI samtools index
pattern: "*.csi"
- versions:
type: file
description: File containing software versions
pattern: "versions.yml"
authors:
- "@buehlere"
4 changes: 4 additions & 0 deletions tests/config/pytest_modules.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3736,6 +3736,10 @@ subworkflows/fastq_download_prefetch_fasterqdump_sratools:
- subworkflows/nf-core/fastq_download_prefetch_fasterqdump_sratools/**
- tests/subworkflows/nf-core/fastq_download_prefetch_fasterqdump_sratools/**

subworkflows/fastq_extractumi_fgbio_picard_gatk4_fastp:
- subworkflows/nf-core/fastq_extractumi_fgbio_picard_gatk4_fastp/**
- tests/subworkflows/nf-core/fastq_extractumi_fgbio_picard_gatk4_fastp/**

subworkflows/fastq_fastqc_umitools_fastp:
- subworkflows/nf-core/fastq_fastqc_umitools_fastp/**
- tests/subworkflows/nf-core/fastq_fastqc_umitools_fastp/**
Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,26 @@
#!/usr/bin/env nextflow

nextflow.enable.dsl = 2

include { FASTQ_EXTRACTUMI_FGBIO_PICARD_GATK4_FASTP } from '../../../../subworkflows/nf-core/fastq_extractumi_fgbio_picard_gatk4_fastp/main.nf'


workflow test_fastq_extractumi_fgbio_picard_gatk4_fastp {
// load test data
def bashScriptFile = new File('install_data.sh')

def processBuilder = new ProcessBuilder('bash', bashScriptFile.toString())
processBuilder.redirectOutput(ProcessBuilder.Redirect.INHERIT)
processBuilder.redirectError(ProcessBuilder.Redirect.INHERIT)

def process = processBuilder.start()
process.waitFor()

// channels enable parralle: https://www.nextflow.io/docs/latest/faq.html?highlight=parallel
fastq = [
[[id:'gene1', single_end:false], [file('test_nucleo/fastq/seracare_0-5_R1_001ad.fastq.gz'), file('test_nucleo/fastq/seracare_0-5_R2_001ad.fastq.gz')]],
[[id:'gene2', single_end:false], [file('test_nucleo/fastq/seracare_0-5_R1_001ae.fastq.gz'), file('test_nucleo/fastq/seracare_0-5_R2_001ae.fastq.gz')]]
]
fastq = ch_fastq = Channel.fromList(fastq)
FASTQ_EXTRACTUMI_FGBIO_PICARD_GATK4_FASTP ( fastq )
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
process {

publishDir = { "${params.outdir}/${task.process.tokenize(':')[-1].tokenize('_')[0].toLowerCase()}" }

}
Original file line number Diff line number Diff line change
@@ -0,0 +1,27 @@
- name: fastq_extractumi_fgbio_picard_gatk4_fastp test_fastq_extractumi_fgbio_picard_gatk4_fastp
command: nextflow run ./tests/subworkflows/nf-core/fastq_extractumi_fgbio_picard_gatk4_fastp -entry test_fastq_extractumi_fgbio_picard_gatk4_fastp -c ./tests/config/nextflow.config
tags:
- fastp
- fgbio
- fgbio/fastqtobam
- gatk4
- gatk4/samtofastq
- picard
- picard/mergesamfiles
- subworkflows
- subworkflows/fastq_extractumi_fgbio_picard_gatk4_fastp
files:
- path: output/bwa/bwa/chr14_chr16.amb
md5sum: 00fb74627e074db6238dcd9bc08dc48a
- path: output/bwa/bwa/chr14_chr16.ann
md5sum: d8825e2fcb3cd372cd61ededfe283025
- path: output/bwa/bwa/chr14_chr16.bwt
md5sum: 45637ec2c011d0f73cac6c470c5b5d2b
- path: output/bwa/bwa/chr14_chr16.pac
md5sum: 46f856371d59e859295497c967478d31
- path: output/bwa/bwa/chr14_chr16.sa
md5sum: 466dbbbce2fb9528e760477ccdc2ea5b
- path: output/bwa/gene.bam
md5sum: d7c5943b79704d8ed7f432786738f25d
- path: output/picard/aligned_bam.bam
md5sum: 89acecb9fcb99f9182a417215489ea50