feat: Add section validation warnings for recommended manuscript sections by paxcalpt · Pull Request #275 · HenriquesLab/rxiv-maker

paxcalpt · 2025-12-24T12:03:14Z

Summary

This PR adds a new that checks manuscripts for standard sections commonly required by journals and issues warnings (not errors) when sections are missing.

Changes

New Validator

File:
Checks for 6 recommended sections:
- Data Availability
- Code Availability
- Author Contributions
- Acknowledgements
- Funding
- Competing Interests

Integration

Added to validation workflow in Version: ImageMagick 7.1.2-11 Q16-HDRI aarch64 23470 https://imagemagick.org
Copyright: (C) 1999 ImageMagick Studio LLC
License: https://imagemagick.org/script/license.php
Features: Cipher DPC HDRI Modules OpenMP
Delegates (built-in): bzlib cairo fontconfig freetype heic jng jp2 jpeg jxl lcms lqr ltdl lzma openexr png raw rsvg tiff uhdr webp xml zip zlib zstd
Compiler: clang (17.0.0)
Usage: import [options ...] [ file ]

Image Settings:
-adjoin join images into a single multi-image file
-border include window border in the output image
-channel type apply option to select image channels
-colorspace type alternate image colorspace
-comment string annotate image with comment
-compress type type of pixel compression when writing the image
-define format:option
define one or more image format options
-density geometry horizontal and vertical density of the image
-depth value image depth
-descend obtain image by descending window hierarchy
-display server X server to contact
-dispose method layer disposal method
-dither method apply error diffusion to image
-delay value display the next image after pausing
-encipher filename convert plain pixels to cipher pixels
-endian type endianness (MSB or LSB) of the image
-encoding type text encoding type
-filter type use this filter when resizing an image
-format "string" output formatted image characteristics
-frame include window manager frame
-gravity direction which direction to gravitate towards
-identify identify the format and characteristics of the image
-interlace type None, Line, Plane, or Partition
-interpolate method pixel color interpolation method
-label string assign a label to an image
-limit type value Area, Disk, Map, or Memory resource limit
-monitor monitor progress
-page geometry size and location of an image canvas
-pause seconds seconds delay between snapshots
-pointsize value font point size
-quality value JPEG/MIFF/PNG compression level
-quiet suppress all warning messages
-regard-warnings pay attention to warning messages
-repage geometry size and location of an image canvas
-respect-parentheses settings remain in effect until parenthesis boundary
-sampling-factor geometry
horizontal and vertical sampling factor
-scene value image scene number
-screen select image from root window
-seed value seed a new sequence of pseudo-random numbers
-set property value set an image property
-silent operate silently, i.e. don't ring any bells
-snaps value number of screen snapshots
-support factor resize support: > 1.0 is blurry, < 1.0 is sharp
-synchronize synchronize image to storage device
-taint declare the image as modified
-transparent-color color
transparent color
-treedepth value color tree depth
-verbose print detailed information about the image
-virtual-pixel method
Constant, Edge, Mirror, or Tile
-window id select window with this id or name
root selects whole screen

Image Operators:
-annotate geometry text
annotate the image with text
-colors value preferred number of colors in the image
-crop geometry preferred size and location of the cropped image
-encipher filename convert plain pixels to cipher pixels
-extent geometry set the image size
-geometry geometry preferred size or location of the image
-help print program options
-monochrome transform image to black and white
-negate replace every pixel with its complementary color
-quantize colorspace reduce colors in this colorspace
-resize geometry resize the image
-rotate degrees apply Paeth rotation to the image
-strip strip image of all profiles and comments
-thumbnail geometry create a thumbnail of the image
-transparent color make this color transparent within the image
-trim trim image edges
-type type image type

Miscellaneous Options:
-debug events display copious debugging information
-help print program options
-list type print a list of supported option arguments
-log format format of debugging information
-version print version information

By default, 'file' is written in the MIFF image format. To
specify a particular image format, precede the filename with an image
format name and a colon (i.e. ps:image) or specify the image type as
the filename suffix (i.e. image.ps). Specify 'file' as '-' for
standard input or output.

Runs as first validator ("Structure" validation)
Flexible pattern matching supports multiple section name variations

Behavior

Warning Level: Issues warnings, not errors
- Doesn't block PDF generation
- Helps authors ensure manuscript completeness
Smart Matching: Recognizes variations like:
- "Acknowledgements" or "Acknowledgments"
- "Competing Interests" or "Conflicts of Interest"
- "Data and Code Availability" (combined section)

Testing

Tested on both manuscripts:

✅ rxiv-maker manuscript: All sections present, no warnings
✅ maicrobe manuscript: Missing Acknowledgements, warning issued correctly

Example output:

⚠️ WARNING (1):

  1. Recommended section 'Acknowledgements' not found
     📄 /path/to/MANUSCRIPT/01_MAIN.md
     💡 Suggestion: Add an '## Acknowledgements' section to thank contributors

Benefits

Helps authors catch missing sections before journal submission
Aligns with common journal requirements
Non-blocking (warnings only) allows flexibility
Clear suggestions guide authors to add missing sections

…ions - New SectionValidator checks for standard sections required by most journals - Warns (doesn't error) if sections are missing: • Data Availability • Code Availability • Author Contributions • Acknowledgements • Funding • Competing Interests - Helps authors ensure manuscripts are complete before submission - Integrated into 'rxiv validate' command as first validator in chain - Flexible pattern matching supports multiple section naming variations

github-actions · 2025-12-24T12:06:42Z

Code Review: Section Validation Warnings

Thank you for this contribution! This is a valuable feature that aligns well with journal requirements.

Strengths

Well-designed validator pattern following BaseValidator architecture
Appropriate use of ValidationLevel.WARNING instead of ERROR
Flexible pattern matching for section name variations
Clear, actionable suggestion messages
Good metadata tracking
Clean code structure

CRITICAL: Missing Test Coverage (Severity: High)

The PR adds a new validator but includes no unit tests. According to CLAUDE.md, the project aims for 80% test coverage.

Required: tests/unit/test_section_validator.py should include:

Test with all sections present
Test with missing sections
Test pattern matching variations
Test combined sections
Test error handling for missing/unreadable files
Test metadata tracking
Case-insensitive matching verification

See tests/unit/test_doi_validator.py for validator test patterns.

Enhancement: Consider Using ErrorCode Enum (Severity: Medium)

Other validators use ErrorCode from core.error_codes. Your code uses the deprecated _create_error() method. Consider defining section-specific error codes for consistency (e.g., MISSING_SECTION).

See figure_validator.py:77 and citation_validator.py:74 for examples.

Pattern Matching: Edge Cases (Severity: Low)

Question: Is it intentional to only match h2 headers and not h3? This might be worth documenting.

Your regex handles multiple spaces and case variations correctly.

Security Review: PASS

Uses safe file reading via _read_file_safely()
No user input in regex
No shell commands or file mutations
Appropriate error handling

Before Merging (Required)

Add comprehensive unit tests
Run: nox -s test
Run: nox -s lint
Run: mypy src/rxiv_maker/validators/section_validator.py

Overall Assessment: Conditional Approval

The implementation is solid and well-designed, but missing test coverage is a blocker. Add tests, then re-request review.

Great work on the feature design! The validator pattern is well-executed and will be genuinely useful for authors.

Copilot

Pull request overview

This PR adds a new SectionValidator that checks manuscripts for six standard sections commonly required by journals (Data Availability, Code Availability, Author Contributions, Acknowledgements, Funding, and Competing Interests). The validator issues non-blocking warnings when recommended sections are missing, helping authors ensure manuscript completeness before journal submission.

Key Changes:

New section validation system with flexible pattern matching supporting multiple section name variations
Integration as the first validator in the validation workflow ("Structure" validation)
Smart handling of combined sections (e.g., "Data and Code Availability")

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 5 comments.

File	Description
`src/rxiv_maker/validators/section_validator.py`	New validator class that checks for six recommended manuscript sections using regex patterns and generates warnings for missing sections
`src/rxiv_maker/engines/operations/validate.py`	Integrates SectionValidator into the validation workflow as the first validator, with proper imports in both relative and absolute import blocks

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-24T12:06:53Z