Skip to content

Conversation

@sywangyi
Copy link
Contributor

@sywangyi sywangyi commented Dec 5, 2025

What does this PR do?

please help review. enable rowwise fp8 quantization for xpu

Signed-off-by: Wang, Yi <yi.a.wang@intel.com>
@github-actions
Copy link
Contributor

github-actions bot commented Dec 5, 2025

[For maintainers] Suggested jobs to run (before merge)

run-slow: fbgemm_fp8

Signed-off-by: Wang, Yi <yi.a.wang@intel.com>
Signed-off-by: Wang, Yi <yi.a.wang@intel.com>
Signed-off-by: Wang, Yi <yi.a.wang@intel.com>
Copy link
Contributor

@MekkCyber MekkCyber left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @sywangyi for you work! I think it would be better to contribute this kernel to kernels-community and use it from there since we are trying to move away from having kernels inside transformers

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants