How to
for example, I have#pragma offload nocopy(a) { a = malloc(sizeof(double)*ny*nx); }And now I want to initialize its first k lines from the data from HostI can do something like:inout =...
View Articlemalloc issue on more than one MIC
I have a source code: #include<offload.h> #include<stdio.h> int main() { int ndev = _Offload_number_of_devices(); printf("Ndev = %d\n", ndev); void *u; for (int d = 0; d < ndev; d++) {...
View ArticleMost efficient way for atomic updates on Xeon Phi
I have found out that __kmpc_atomic_float4_add was used in the assembly code of the following two lines:#pragma omp atomic array[i] += 1.0;Performance of this code is not good on Intel Xeon Phi when...
View Articlecilk runtime error
I am using MYO model for offloading my code to MIC but it I am getting some sort of runtime errors .Is this errors are code related or some system issues.CARD--ERROR:1 _myoiPageFaultHandler:...
View Articleoffload error: process on the device 0 was terminated by signal 11 (SIGSEGV)
I need some help,here is my code,I want to know why I can't do like this: //this is the headfile#include <stdio.h> #include <stdlib.h> class ihavefun{ public: void callfun();...
View ArticleSearching for applications
I am searching for applications, where Xeon Phi is just faster than Nvidia Cards at the moment i am trying to run the benchmark parallutions based on spares matrices to get some data. I still figured...
View Articlemicctrl crash
micctrl crashes if I change OSimage in micX.conf on MPSS 3.2. I traced it and found a problem. I checked the MPSS-3.2.1 source code and looks this problem still there. See below for a fix.- kaz---...
View ArticleXeon-Phi asynchronous offload from host openMP parallel region
I am using intel's offload pragmas in host openMP code. The code looks as follows int s1 = f(a,b,c); #prama offload singnal(s1) in (...) out(x:len) { for (int i = 0; i < len; ++i)...
View ArticleDebugging Intel® Xeon Phi™ Applications on Linux* Host
ContentsIntroductionDebug Solution for Intel® MICHow to get it?Why use the provided GNU* GDB from Intel?Why is Intel providing a Command Line and Eclipse* IDE Integration?Deprecation...
View ArticleDebugging Intel® Xeon Phi™ Applications on Windows* Host
ContentsIntroductionDebug Solution for Intel® MICHow to get it?Debug Solution as IntegrationComponents RequiredConfigure & TestPrerequisite for DebuggingDebugging Applications with Offload...
View ArticleMPSS 3.2 on 3.13.0-24 kernel
Hi,I am trying to install MPSS 3.2 on kernel 3.13.0-24, but when I start the services, I am getting error"Error getting SCIF driver version mic0: booting (mode: linux image:...
View ArticleCompiling for Offload, libirng.so needed, crash on first Offload
Dear Support Team, dear all,I'm trying to compile and run an application for the Xeon PHI. However when I link my application with a build script it prints the following warning:ld: warning:...
View ArticlePhi asynchronous offload from host openMP parallel region -II
This question, in a way, is continuation of question asked before (https://software.intel.com/en-us/forums/topic/509845). In my code, I need to use offload in slightly more obfuscated fashion. Main...
View ArticleRuntime error with Offload
HOST--ERROR:myoiOSSetPageAccess: mprotect failed! Please increase the maximum of memory map areas i.e. echo 256000 > /proc/sys/vm/max_map_countoffload error: process on the device 0...
View ArticleIssues using shared-memory based COIBUFFER
I am trying to reduce latency for exchanging data between host and target via PINNED COIBuffers. Based on the description provided in the header file, PINNED buffers use shared memory so I am assuming...
View ArticleIs the Window's coi_host.dll buildable?
We have issues with our program which uses offload mode on Windows. We think our issues are possibly related with something with coi_host.dll. We have callstacks that mention coi_host.dll but the...
View ArticleReport a bug found in micvec.h
While using the shuffle operations in micvec.h (provided by Intel Composer 13.1), I found the template function of shuffle() cannot be correctly compiled. The contents are as below:template...
View Article_mm512_i32logather_pd
Hi, I am trying to use gather function in the follow code on MIC. When I ran it, it said "Segmentation fault". Can someone tell me how to fix it? Thanks, #include <stdlib.h> #include...
View ArticleCatastrophic error while using _mm512_extload_epi32
Dear experts,I'm having some troubles while using the _mm512_extload_epi32 instrinsic. I want to load 16 signed char elements and convert them to int32 vector. The instruction is:___m512i v =...
View ArticleOptimizing indirect memory read and write access on MIC
Extended addition (described below ) is one of the most performance critical kernel in our code (that implements important functions from sparse linear algebra). AddEx(double* A, int LDA, double *B,...
View Article