Quantcast
Channel: Intel® Many Integrated Core Architecture
Viewing all articles
Browse latest Browse all 1347

Optmization Techniques for the Intel® MIC Architecture: Part 2 of 3

$
0
0

Abstract

This is part 2 of a 3-part educational series of publications introducing select topics on optimization of applications for Intel’s multi-core and manycore architectures (Intel® Xeon®  processors and Intel® XeonPhi™ coprocessors).

In this paper we discuss data parallelism. Our focus is automatic vectorization and exposing vectorization opportunities to the compiler. For a practical illustration, we construct and optimize a micro-kernel for particle binning particles.

Similar workloads occur applications in Monte Carlo simulations, particle physics software, and statistical analysis.

The optimization technique discussed in this paper leads to code vectorization, which results in an order of magnitude performance improvement on an Intel Xeon processor. Performance on Xeon Phi coprocessor compared to that on a high-end Intel Xeon is 1.4x greater in single precision and 1.6x greater in double precision.

Download the full articleDownloadDownload

  • Sviluppatori
  • Professori
  • Studenti
  • Linux*
  • C/C++
  • Modernizzazione codici
  • Architettura Intel® Many Integrated Core
  • Vettorizzazione
  • URL

  • Viewing all articles
    Browse latest Browse all 1347

    Trending Articles



    <script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>