-
Notifications
You must be signed in to change notification settings - Fork 107
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature](mluOpLogcumsumexp):add new op #1027
base: master
Are you sure you want to change the base?
Conversation
|
||
及格: 算子 hw time 是竞品 v100 的 15 倍 | ||
|
||
- (此标准用于竞品实现是单算子实现, 若竞品使用算子拼接的方式实现, 需单独说明) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
收到
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
建议修改为上述贴的链接中的表格形式,表格后文字补充370S4 在上述规模下时间开销需保持在 *** 几倍范围内
而不是
“良好: 算子 hw time 是竞品 v100 的 8 倍 ”, 什么是V100 8倍, 是超过还是低于?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
该算子的典型规模有10个,两种数据类型一共有20行。这个表格放在测试报告里是否会更好?
test/mlu_op_gtest/pb_gtest/src/zoo/logcumsumexp/logcumsumexp.cpp
Outdated
Show resolved
Hide resolved
test/mlu_op_gtest/pb_gtest/src/zoo/logcumsumexp/test_case/case0.prototxt
Outdated
Show resolved
Hide resolved
test/mlu_op_gtest/pb_gtest/src/zoo/logcumsumexp/logcumsumexp.cpp
Outdated
Show resolved
Hide resolved
error_func: DIFF1 | ||
error_func: DIFF2 | ||
error_threshold: 0.003 | ||
error_threshold: 0.003 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
补充
- proto
- mlu_op.h
- docs/bangc-docs/user_guide/9_operators/index.rst 算子说明
- 完善测试报告 1. cpu case pass + generator 生成网络规模 + 其他随机规模的功能测试
- 完善测试报告 2. 性能测试结果
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
收到
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
还缺少
- op.h 接口注释
- docs/bangc-docs/user_guide/9_operators/index.rst 算子说明
- 测试报告
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
已补充
除了任务书的规模,有没有自己另外加一些规模生成GPU测例测试? 把运行成功的log也贴一下吧 |
* @par Data Layout | ||
* - ::MLUOP_LAYOUT_ARRAY | ||
* | ||
* @par Scale Limitation |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
dim参数有限制吧
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这个限制来自任务要求吗?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
是算子自身的限制, dim要在[ -input->dims, input->dims -1 ]范围内
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
已修改
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The parameter "dim" -> The value of \p dim
代码已完成,要求测例的自测均通过,等待评审