Skip to content

Conversation

@KOKOSde
Copy link

@KOKOSde KOKOSde commented Jan 31, 2026

Summary

Update low-precision training docs to clarify MS-AMP compatibility issues (older CUDA versions, MSCCL/NCCL conflicts) and emphasize TransformersEngine/torchao for new FP8 projects.

Changes

  • Add practical notes about container/CUDA constraints and NCCL symbol conflicts.
  • Mark MS-AMP as "legacy" in intro text.

Test plan

  • N/A (documentation-only change)

Ref: #3639

Add practical notes about CUDA/container and MSCCL/NCCL issues, and emphasize TE/torchao for new FP8 projects.

Ref: huggingface#3639
Co-authored-by: Cursor <cursoragent@cursor.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant