Skip to content

Genopred PGS rule issue #166

@shrivastavabhishek

Description

@shrivastavabhishek

Hi Team,

I was trying generate the pgs scrore based on the external published GWAS sumstats data. Below is the format of input which I am using to run the genopred pipeline: This gwas sumstats has only 2955 number of SNPs.

CHR BP SNP A2 A1 N P BETA SE
6 32444703 chr6:32444703:G:A G A 19048 5.57E-14 -0.235722334 0.028910379
6 32444762 chr6:32444762:A:G A G 19048 5.57E-14 -0.235722334 0.028910379
6 32444794 chr6:32444794:A:T A T 19048 2.28E-12 -0.274436846 0.036744737

and is failing.

I have also check the overlap rsID with hammap_snplist and only 36 SNPs are overlapping.

I am not sure why it is breaking. Below are the error log for your review:

  1. reading list of SNPs for allele merge from /tmp/RtmpW4Ovwt/hm3_no_mhc.snplist Read 1215912 SNPs for allele merge. Reading sumstats from /tmp/RtmpW4Ovwt/sumstats.gz into memory 5000000 SNPs at a time. . done Read 36 SNPs from --sumstats file. Removed 23 SNPs not in --merge-alleles. Removed 0 SNPs with missing values. Removed 0 SNPs with INFO <= 0.9. Removed 0 SNPs with MAF <= 0.01. Removed 0 SNPs with out-of-bounds p-values. Removed 0 variants that were not SNPs or were strand-ambiguous. 13 SNPs remain. Removed 0 SNPs with duplicated rs numbers (13 SNPs remain). Removed 0 SNPs with N < 12698.6666667 (13 SNPs remain). ERROR converting summary statistics: Traceback (most recent call last): File "/home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/software/ldsc/munge_sumstats.py", line 701, in munge_sumstats check_median(dat.SIGNED_SUMSTAT, signed_sumstat_null, 0.1, sign_cname)

  2. Error in rule ldsc_i: jobid: 8 input: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/gwas_sumstat/GiantCellArthritis/GiantCellArthritis-cleaned.gz, /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/ld_scores, /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/software/ldsc/ldsc.py, /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/hm3_snplist/w_hm3.snplist output: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/ldsc/GiantCellArthritis/GiantCellArthritis.hsq_obs log: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/logs/ldsc-GiantCellArthritis.log (check log file(s) for error details) conda-env: /tools/GenoPred/pipeline/.snakemake/conda/a5a8a0f260443effe6ae6b9313150c49_ shell: Rscript ../Scripts/ldsc/ldsc.R --ref_plink_chr /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/ref/ref.chr --sumstats /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/gwas_sumstat/GiantCellArthritis/GiantCellArthritis-cleaned.gz --munge_sumstats /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/software/ldsc/munge_sumstats.py --ldsc /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/software/ldsc/ldsc.py --ld_scores /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/ld_scores/UKBB.EUR.rsid --hm3_snplist /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/hm3_snplist/w_hm3.snplist --hm3_no_mhc T --sample_prev 0.5 --pop_prev 0.1 --output /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/ldsc/GiantCellArthritis/GiantCellArthritis --test NA > /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/logs/ldsc-GiantCellArthritis.log 2>&1 (one of the commands exited with non-zero exit code; note that snakemake uses bash strict mode!) [Wed Feb 18 22:29:52 2026] Finished job 1. 2 of 6 steps (33%) done [Wed Feb 18 22:55:04 2026] Finished job 14. 3 of 6 steps (50%) done Shutting down, this might take some time. Exiting because a job execution failed. Look above for error message Complete log: .snakemake/log/2026-02-18T222940.351253.snakemake.log

  3. snakemake log:
    output: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/ldsc/GiantCellArthritis/GiantCellArthritis.hsq_obs
    log: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/logs/ldsc-GiantCellArthritis.log
    jobid: 8
    benchmark: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/benchmarks/ldsc-GiantCellArthritis.txt
    reason: Missing output files: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/ldsc/GiantCellArthritis/GiantCellArthritis.hsq_obs; Input files updated by another job: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/gwas_sumstat/GiantCellArthritis/GiantCellArthritis-cleaned.gz
    wildcards: gwas=GiantCellArthritis
    resources: tmpdir=/tmp

Activating conda environment: .snakemake/conda/a5a8a0f260443effe6ae6b9313150c49_
[Wed Feb 18 22:29:48 2026]
Error in rule ldsc_i:
jobid: 8
input: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/gwas_sumstat/GiantCellArthritis/GiantCellArthritis-cleaned.gz, /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/ld_scores, /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/software/ldsc/ldsc.py, /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/hm3_snplist/w_hm3.snplist
output: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/ldsc/GiantCellArthritis/GiantCellArthritis.hsq_obs
log: /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/logs/ldsc-GiantCellArthritis.log (check log file(s) for error details)
conda-env: /tools/GenoPred/pipeline/.snakemake/conda/a5a8a0f260443effe6ae6b9313150c49_
shell:
Rscript ../Scripts/ldsc/ldsc.R --ref_plink_chr /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/ref/ref.chr --sumstats /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/gwas_sumstat/GiantCellArthritis/GiantCellArthritis-cleaned.gz --munge_sumstats /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/software/ldsc/munge_sumstats.py --ldsc /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/software/ldsc/ldsc.py --ld_scores /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/ld_scores/UKBB.EUR.rsid --hm3_snplist /home/ext_shrivastav_abhishekkumar_may/CTD_PRS/resources/data/hm3_snplist/w_hm3.snplist --hm3_no_mhc T --sample_prev 0.5 --pop_prev 0.1 --output /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/ldsc/GiantCellArthritis/GiantCellArthritis --test NA > /home/ext_shrivastav_abhishekkumar_may/GCA_PRS/GenoPredOut/reference/logs/ldsc-GiantCellArthritis.log 2>&1
(one of the commands exited with non-zero exit code; note that snakemake uses bash strict mode!)
Any help is highly appreciable
3.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions