-
Notifications
You must be signed in to change notification settings - Fork 29
Description
Hi Team,
I was trying generate the pgs scrore based on the external published GWAS sumstats data. Below is the format of input which I am using to run the genopred pipeline: This gwas sumstats has only 2955 number of SNPs.
CHR BP SNP A2 A1 N P BETA SE
6 32444703 chr6:32444703:G:A G A 19048 5.57E-14 -0.235722334 0.028910379
6 32444762 chr6:32444762:A:G A G 19048 5.57E-14 -0.235722334 0.028910379
6 32444794 chr6:32444794:A:T A T 19048 2.28E-12 -0.274436846 0.036744737
and is failing.
I have also check the overlap rsID with hammap_snplist and only 36 SNPs are overlapping.
I am not sure why it is breaking. Below are the error log for your review:
-
reading list of SNPs for allele merge from /tmp/RtmpW4Ovwt/hm3_no_mhc.snplist Read 1215912 SNPs for allele merge. Reading sumstats from /tmp/RtmpW4Ovwt/sumstats.gz into memory 5000000 SNPs at a time. . done Read 36 SNPs from --sumstats file. Removed 23 SNPs not in --merge-alleles. Removed 0 SNPs with missing values. Removed 0 SNPs with INFO <= 0.9. Removed 0 SNPs with MAF <= 0.01. Removed 0 SNPs with out-of-bounds p-values. Removed 0 variants that were not SNPs or were strand-ambiguous. 13 SNPs remain. Removed 0 SNPs with duplicated rs numbers (13 SNPs remain). Removed 0 SNPs with N < 12698.6666667 (13 SNPs remain). ERROR converting summary statistics: Traceback (most recent call last): File "/home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/software/ldsc/munge_sumstats.py", line 701, in munge_sumstats check_median(dat.SIGNED_SUMSTAT, signed_sumstat_null, 0.1, sign_cname)
-
Error in rule ldsc_i: jobid: 8 input: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/gwas_sumstat/GiantCellArthritis/GiantCellArthritis-cleaned.gz, /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/ld_scores, /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/software/ldsc/ldsc.py, /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/hm3_snplist/w_hm3.snplist output: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/ldsc/GiantCellArthritis/GiantCellArthritis.hsq_obs log: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/logs/ldsc-GiantCellArthritis.log (check log file(s) for error details) conda-env: /tools/GenoPred/pipeline/.snakemake/conda/a5a8a0f260443effe6ae6b9313150c49_ shell: Rscript ../Scripts/ldsc/ldsc.R --ref_plink_chr /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/ref/ref.chr --sumstats /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/gwas_sumstat/GiantCellArthritis/GiantCellArthritis-cleaned.gz --munge_sumstats /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/software/ldsc/munge_sumstats.py --ldsc /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/software/ldsc/ldsc.py --ld_scores /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/ld_scores/UKBB.EUR.rsid --hm3_snplist /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/hm3_snplist/w_hm3.snplist --hm3_no_mhc T --sample_prev 0.5 --pop_prev 0.1 --output /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/ldsc/GiantCellArthritis/GiantCellArthritis --test NA > /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/logs/ldsc-GiantCellArthritis.log 2>&1 (one of the commands exited with non-zero exit code; note that snakemake uses bash strict mode!) [Wed Feb 18 22:29:52 2026] Finished job 1. 2 of 6 steps (33%) done [Wed Feb 18 22:55:04 2026] Finished job 14. 3 of 6 steps (50%) done Shutting down, this might take some time. Exiting because a job execution failed. Look above for error message Complete log: .snakemake/log/2026-02-18T222940.351253.snakemake.log
-
snakemake log:
output: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/ldsc/GiantCellArthritis/GiantCellArthritis.hsq_obs
log: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/logs/ldsc-GiantCellArthritis.log
jobid: 8
benchmark: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/benchmarks/ldsc-GiantCellArthritis.txt
reason: Missing output files: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/ldsc/GiantCellArthritis/GiantCellArthritis.hsq_obs; Input files updated by another job: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/gwas_sumstat/GiantCellArthritis/GiantCellArthritis-cleaned.gz
wildcards: gwas=GiantCellArthritis
resources: tmpdir=/tmp
Activating conda environment: .snakemake/conda/a5a8a0f260443effe6ae6b9313150c49_
[Wed Feb 18 22:29:48 2026]
Error in rule ldsc_i:
jobid: 8
input: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/gwas_sumstat/GiantCellArthritis/GiantCellArthritis-cleaned.gz, /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/ld_scores, /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/software/ldsc/ldsc.py, /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/hm3_snplist/w_hm3.snplist
output: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/ldsc/GiantCellArthritis/GiantCellArthritis.hsq_obs
log: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/logs/ldsc-GiantCellArthritis.log (check log file(s) for error details)
conda-env: /tools/GenoPred/pipeline/.snakemake/conda/a5a8a0f260443effe6ae6b9313150c49_
shell:
Rscript ../Scripts/ldsc/ldsc.R --ref_plink_chr /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/ref/ref.chr --sumstats /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/gwas_sumstat/GiantCellArthritis/GiantCellArthritis-cleaned.gz --munge_sumstats /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/software/ldsc/munge_sumstats.py --ldsc /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/software/ldsc/ldsc.py --ld_scores /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/ld_scores/UKBB.EUR.rsid --hm3_snplist /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/hm3_snplist/w_hm3.snplist --hm3_no_mhc T --sample_prev 0.5 --pop_prev 0.1 --output /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/ldsc/GiantCellArthritis/GiantCellArthritis --test NA > /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/logs/ldsc-GiantCellArthritis.log 2>&1
(one of the commands exited with non-zero exit code; note that snakemake uses bash strict mode!)
Any help is highly appreciable
3.