rocm-hip on amdgpu入门示例
run
"python build.py [hip/asm] [v2/v3] [llvm/hcc]"
under each project folder, the exacutable will generated under ./out path.
1. smem读写
2. flat读写
3. mubuf读写
4. lds读写
5. group间条件跳转
6. thread间条件执行
7. packed float16指令
8. dpp指令
9. permute指令
10. mfma指令