openCL GPGPU simplification

Hi all,
If anyone’s interested in giving a look and some feedback, I wrote an extension to the openCL api to reduce overhead and simplify kernel calling. The project’s on gitlab, you can find it under noam_abadi/cl_simple.

I’m not a professional programmer, but I use parallel computing for simulations and I’m quite happy with this, so wanted to get other people’s opinions on what’s missing, what could be removed, and what’s just plain bad. I don’t know if this is the right place to post it, but it seemed like a good start (sorry if I’m out of place here).

Kind regards and thank you,