We are improving our search experience. To check which content you have full access to, or for advanced search, go back to the old search.

Search

Please fill in this field.
Filters applied:

Search Results

Showing 1-20 of 3,038 results
  1. High-performance simulations of turbulent boundary layer flow using Intel Xeon Phi many-core processors

    Direct numerical simulations (DNS) of turbulent flows have increasing importance because they not only provide fundamental understanding of turbulent...

    Ji-Hoon Kang, **yul Hwang, ... Hoon Ryu in The Journal of Supercomputing
    Article 11 February 2021
  2. Performance benchmarking of deep learning framework on Intel Xeon Phi

    With the success of deep learning (DL) methods in diverse application domains, several deep learning software frameworks have been proposed to...

    Chao-Tung Yang, Jung-Chun Liu, ... Chan-Fu Kuo in The Journal of Supercomputing
    Article 17 June 2020
  3. Comparison of HPC Architectures for Computing All-Pairs Shortest Paths. Intel Xeon Phi KNL vs NVIDIA Pascal

    Today, one of the main challenges for high-performance computing systems is to improve their performance by kee** energy consumption at acceptable...
    Manuel Costanzo, Enzo Rucci, ... Marcelo Naiouf in Computer Science – CACIC 2020
    Conference paper 2021
  4. A server-side accelerator framework for multi-core CPUs and Intel Xeon Phi co-processor systems

    Processing-intensive web server requests can lead to low Quality of Service (QoS), such as longer mean response time and lower throughput, which...

    Guohua You, Xue**g Wang in Cluster Computing
    Article 01 January 2020
  5. Revisiting the performance optimization of QR factorization on Intel KNL and SKL multiprocessors

    This study focused on the optimization of double-precision general matrix–matrix multiplication (DGEMM) routine to improve the QR factorization...

    Muhammad Rizwan, Enoch Jung, ... Jaeyoung Choi in The Journal of Supercomputing
    Article 13 March 2024
  6. Improving blocked matrix-matrix multiplication routine by utilizing AVX-512 instructions on intel knights landing and xeon scalable processors

    In high-performance computing, the general matrix-matrix multiplication (xGEMM) routine is the core of the Level 3 BLAS kernel for effective...

    Yoosang Park, Raehyun Kim, ... Jaeyoung Choi in Cluster Computing
    Article 12 April 2021
  7. Performance Evaluation of Pseudospectral Ultrasound Simulations on a Cluster of Xeon Phi Accelerators

    The rapid development of novel procedures in medical ultrasonics, including treatment planning in therapeutic ultrasound and image reconstruction in...
    Filip Vaverka, Bradley E. Treeby, Jiri Jaros in High Performance Computing in Science and Engineering
    Conference paper 2021
  8. Implementation of Parallel 3-D Real FFT with 2-D Decomposition on Intel Xeon Phi Clusters

    In this paper, we propose an implementation of a parallel 3-D real fast Fourier transform (FFT) with 2-D decomposition on Intel Xeon Phi clusters....
    Conference paper 2020
  9. Accelerating time series motif discovery in the Intel Xeon Phi KNL processor

    Time series analysis is an important research topic of great interest in many fields. Recently, the Matrix Profile method, and particularly one of...

    Ivan Fernandez, Alejandro Villegas, ... Oscar Plata in The Journal of Supercomputing
    Article 10 June 2019
  10. Enhanced OpenMP Algorithm to Compute All-Pairs Shortest Path on X86 Architectures

    Graphs have become a key tool when modeling and solving problems in different areas. The Floyd-Warshall (FW) algorithm computes the shortest path...
    Sergio Calderón, Enzo Rucci, Franco Chichizola in Computer Science – CACIC 2023
    Conference paper 2024
  11. Performance Analysis of a Parallel Denoising Algorithm on Intel Xeon Computer System

    This paper presents an experimental performance study of a parallel implementation of the Poissonian image restoration algorithm. Hybrid...
    Conference paper 2020
  12. Black-Scholes Option Pricing on Intel CPUs and GPUs: Implementation on SYCL and Optimization Techniques

    The Black-Scholes option pricing problem is one of the widely used financial benchmarks. We explore the possibility of develo** a high-performance...
    Elena Panova, Valentin Volokitin, ... Iosif Meyerov in Supercomputing
    Conference paper 2022
  13. Optimization of heterogeneous systems with AI planning heuristics and machine learning: a performance and energy aware approach

    Heterogeneous computing systems provide high performance and energy efficiency. However, to optimally utilize such systems, solutions that distribute...

    Suejb Memeti, Sabri Pllana in Computing
    Article Open access 19 October 2021
  14. Performance Analysis of Deep Learning Inference in Convolutional Neural Networks on Intel Cascade Lake CPUs

    The paper aims to compare the performance of deep convolutional network inference. Experiments are carried out on a high-end server with two Intel...
    Evgenii P. Vasiliev, Valentina D. Kustikova, ... Iosif B. Meyerov in Mathematical Modeling and Supercomputer Technologies
    Conference paper 2021
  15. A Novel Algorithm for Bi-objective Performance-Energy Optimization of Applications with Continuous Performance and Linear Energy Profiles on Heterogeneous HPC Platforms

    Performance and energy are the two most important objectives for optimization on heterogeneous HPC platforms. This work studies a mathematical...
    Hamidreza Khaleghzadeh, Ravi Reddy Manumachu, Alexey Lastovetsky in Euro-Par 2021: Parallel Processing Workshops
    Conference paper 2022
  16. Numerical Modeling of Hydrodynamic Turbulence with Self-gravity on Intel Xeon Phi KNL

    In this paper, we present the results of numerical simulations of hydrodynamic turbulence with self-gravity, employing the latest Intel Xeon Phi...
    Igor Kulikov, Igor Chernykh, ... Alexander Tutukov in Parallel Computational Technologies
    Conference paper 2019
  17. A fully-customized dataflow engine for 3D earthquake simulation with a complex topography

    With HPC (high performance computing) evolving into the exascale era, improvements in computing performance and power efficiency have become...

    Bingwei Chen, Haohuan Fu, ... Guangwen Yang in Science China Information Sciences
    Article 29 November 2021
  18. Fast solution of electromagnetic scattering problems using Xeon Phi coprocessors

    Electromagnetic scattering problems can be solved by discretizing and transforming integral equations into matrix equations using the method of...

    J. L. Campon, L. Landesa in The Journal of Supercomputing
    Article 02 January 2019
  19. A System-Wide Communication to Couple Multiple MPI Programs for Heterogeneous Computing

    This paper proposes a system-wide communication library to couple multiple MPI programs for heterogeneous coupling computing called...
    Shinji Sumimoto, Takashi Arakawa, ... Kengo Nakajima in Parallel and Distributed Computing, Applications and Technologies
    Conference paper 2023
  20. Performance and Scalability Analysis of AI-Accelerated CFD Simulations Across Various Computing Platforms

    In this paper, we perform an extensive benchmarking and analysis of the performance and scalability of our software tool called CFD suite, which...
    Krzysztof Rojek, Roman Wyrzykowski in Euro-Par 2022: Parallel Processing Workshops
    Conference paper 2023
Did you find what you were looking for? Share feedback.