Dear all,
I heard that "GPU can create threads using hardware, one instruction creating one thread".
Does Phi have similar support? For example, when I use OpenMP, can I use some features of Phi to reduce the overhead of creating threads?
Thanks very much.