Skip to content

CLRadeonExtender 0.1.7

Compare
Choose a tag to compare
@matszpk matszpk released this 21 Feb 15:34
· 1184 commits to master since this release

CLRadeonExtender 0.1.7:

  • update AmdCL2ABI chapter
  • fixed kernel arguments sizes in GalliumCompute binary format
  • add new GPU devices gfx902-gfx905
  • update device tables for Amd Crimson drivers
  • small fixes in DynLibrary interface
  • add relocations to GalliumCompute binary format (for scratch buffer symbols)
  • make getXXXDisasmInputFromBinaryXX as public interface
  • speeding up evaluation of simple expressions without symbols
  • add '.for' and '.while' pseudo-ops ('for' and 'while' loops)
  • fixed some grammar/typos in CLRX documentation
  • add GPU device names from ROCm-OpenCL
  • handle new ROCm binary format with YAML metadatas (assembler and disassembler)
  • add few pseudo-ops to ROCm handling
  • add new pseudo-ops to set parameters in ROCm YAML metadata
  • fixes in GalliumCompute binary generator (for conformant with standards)
  • add '.reqd_work_group_size' pseudo-op (equivalent of '.cws')
  • add support for work_group_size_hint and vec_type hint in Amd OpenCL 2.0 binary format
  • some small bug fixes in ROCm disassembler
  • updates in README.md and INSTALL files
  • small sanitizations in DisasmAmd, DisasmAmdCL2 (argument type checking)
  • change behaviour of '.cws' (.reqd_work_group_size) while setting default values
  • add calculation of section differences in an expressions (for ROCm handling)
  • fixed invalid reads (potential segfault) after undefining symbol
  • fixed old stupid bug: resolve symbol value by using new value (or just if undefined then
    do not resolve symbol) instead old unresolved symbol value later when expression
    has been evaluated
  • Add GOT table handling in ROCm binary format
  • add new option '--newROCmBinFormat'
  • add untested support for ROCm in CLHelper and VectorAdd sample
  • add support for multiple OpenCL platforms in CLHelper and samples
  • allow te call_convetion to 0xffffffff in AMDHSA config
  • handle special cases with relatives while evaluating binary/logical operators
  • small fixes in CLRX documentation and Unix manuals
  • developing unfinished AsmRegAlloc
  • add a missing access qualifier to images 'read_write' for AMD OpenCL 2.0