Found 1 platform(s): Plaform Profile: FULL_PROFILE Plaform Version: OpenCL 1.1 AMD-APP-SDK-v2.4 (595.10) Plaform Name: AMD Accelerated Parallel Processing Plaform Vendor: Advanced Micro Devices, Inc. PS: Yes, i tried putting explicit "_private", "private" or omitting the qualifier. Has anybody ever gotten a real value from getWorkGroupInfo or knows another way to avoid register-spilling without hardcoding hand-counted values? Don't think my getWorkGroupInfo calls are wrong cause CL_KERNEL_COMPILE_WORK_GROUP_SIZE ie returns good values.
PRIVATE KERNEL R4 SOFTWARE
I am not sure if this is a bug or my stupidity, because same software on NVIDIA-GPUs has same problems. getWorkGroupInfo seems to work (returned values seem to be rounded up towards next register-size-multiple). However using getWorkGroupInfo after i compiled my kernel never works. To do so i need to know how many memory i can and need to use. In order to avoid register-spilling i try to find the optimum work-group-size. The amount it uses depends and is not known until compilation (array-size set by macros). I have a kernel that uses a LOT of private memory. CL_KERNEL_PRIVATE_MEM_SIZE returns zero, always