What does __kmp_wait_yield_4 imply in hot spot analysis of vTune

Hi,

I am rewriting part of LibSVM code for vectorization in order to accelerate it on Xeon Phi. Unfortunately, the performance of my rewriting code is the same as, if not slightly worse than, the original performance on Xeon Phi. To figure out the reason, I profiled two programs using vTune hot spot analysis. According to the results, the target code segment takes significantly less time in the rewriting version, however, most of its time goes to __kmp_wait_yield_4, which only occupies a tiny amount of time in the original version.

Does anyone know what it means? I tried to Google it but got very little information.

BTW, my vectorization code gets about 20% improvement in total if running in the host.

Thanks in advance!

What does __kmp_wait_yield_4 imply in hot spot analysis of vTune

Trending Articles

FINAL LESSON

PURPLE RANGE LIVE AT GAL AMUNA 2013

Download: Dismanto Ft Rich Bizzy – Bwete (Prod by: Dismanto)

Moondru Mudichu 27-05-2016 – Polimer tv Serial

Tigers to Lions: San Beda names Kungfu Reyes as Lady Red Spikers head coach...

Pasulong o Paurong? (Col. 2:1-7)

FIFA 15 PPSSPP Android Download

Huzurabad Municipality into 30 wards

NOTES ZA GENERAL CHEMISTRY ZA NGAIZA

Download EFF Song –“Azania”, led by Mbuyiseni Dlozi

Principal’s past includes domestic violence case

Adolescence Paragraph

Nalgonda District Police Office Mobile Numbers List in Telangana State

Daru and Sharab Status for Sharabi Friends in Hindi, Punjabi

XAMJYSS VPN APP | Powered by XAMJYSSVPN | Sun TU CTC FLIP | GTM FB IG |

मुख मैथुन से उठाएं सेक्स का भरपूर मज़ा, जानें क्या है इसका सही तरीकामुख मैथुन...

[Download MP3] Iyzeal Feat. Okpo Records –“Ekaette Ibak”

Lirik Lagu Rohani Glory Haleluya - Yochen Amos

Lady Gaga – MAYHEM (2025) [FLAC 24bit/44,1kHz]

Arrow Flash 2 Sinhala Teledrama – Last Episode 33 – 24th April 2016