[2025秋季][T1-1-7] GreenHandHand #776
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
描述
Infinicore 赛题 T1-1-7,算子 logsumexp, lp_pool1d, lp_pool2d, lp_pool3d, max 的 cpu 实现,gpu 部分实现使用 ninetoothed,在 ntops 的 pr 中。
值得陈述的技术细节
技术细节
ceil_mode=True时行为存在不一致,这里采用了与 pytorch 结果一致的实现方式。具体见 lp_poolnd 的 kernel 代码。一些问题
运行截图
cpu
nvidia
摩尔线程
摩尔线程平台自带的 max 算子在不连续张量上的实现存在问题,且不支持 global max,因此这里跳过了一些测试。
沐曦
天数
HONOR_CODE
HONOR_CODE.md
REFERENCE
pytorch 官方文档、triton 官方文档。