天元 MegEngine v1.2.0 版本发布～

Chai · 2021年01月18日11:04

由于 PyPI 对项目 Wheel 包的体积存在限制， Windows 用户需要使用以下 pip 命令选择从天元 MegEngine 官网进行下载和安装：

pip3 install megengine -f https://megengine.org.cn/whl/mge.html

对应 .whl 的文件地址列表在：https://megengine.org.cn/whl/mge.html

其它系统的用户无需添加 -f https://megengine.org.cn/whl/mge.html 参数即可正常下载安装。

问题修复

Fix errors reported by ASAN
Fix the problem of cross compute node copy in Cambricon
Fix out of memory error caused by profiling
Fix memory leak in the Cambrian
Fix out of memory error during distributed training due to the incorrect setting of CUDA environment variables
Fix tensor split
Reduce the memory usage of ARM testcase
Reduce the memory usage of Fastrun
Fix the issue that the batch size specified when dumping the Atlas model exceeds the maximum batch size of the model
Fix the problem that MLIR cannot handle different shapes correctly
Fix the problem of Dangling Pointer when MLIR executes CUDA
Fix the weight pre-processing to handle ConvBias without bias correctly
Fix the broken log caused by crash again in the process of printing error stack

Optimize common Video Detection network by pre-processing fusion
Optimize performance by fusing DimShuffle and Reformat with Convolution
Fuse WarpPerspective with DimShuffle
Improve performance by rewriting tensor, derivation and trace in cpp
Refactor some opr derivation rules to save memory usage
Optimize QAT and TQT quantitative training in terms of both performance and memory usage
Adjust the CUDA chanwise Convolution algorithm selection strategy
Optimize the performance of NCHW32 pooling operator
Optimize the performance of CallbackCaller operator
Optimize CUDA IO communication

Chai · 2021年01月19日02:09

欢迎大家体验

feynman1999 · 2021年01月19日10:03

热烈支持！！