401-402-403-404
20191119T153000
20191119T170000
Linear Algebra Algorithms
Paper

AutoFFT: A Template-Based FFT Codes Auto-Generation F
ramework for ARM and X86 CPUs

Li, Jia, Zhang, Chen, Yuan...

The disc
rete Fourier transform (DFT) is widely used in scientific and engineering
computation. This paper proposes a template-based code generation framewor
k named AutoFFT that can automatically generate high-performance fast Four
ier transform (FFT) codes. AutoFFT employs the Cooley-Tukey FFT algori...\
el Matrix Multiplication

Kwasniewski, Kabic, Besta, Solca, VandeVondele
...

We propose COSMA: a parallel matrix-matrix multiplication algorithm
that is near communication-optimal for all combinations of matrix dimensi
ons, processor counts, and memory sizes. The key idea behind COSMA is to d
erive an optimal (up to a factor of 0.03% for 10MB of fast memory) sequent
ial schedul...

---------------------
SLATE: Design of a Modern Distrib
uted and Accelerated Linear Algebra Library

Gates, Kurzak, Charara, Yar
Khan, Dongarra

The SLATE (Software for Linear Algebra Targeting Exascal
e) library is being developed to provide fundamental dense linear algebra
capabilities for current and upcoming distributed high-performance systems
, both accelerated CPU-GPU-based and CPU-based. SLATE will provide coverag
e of existing ScaLAPAC...


Tag: Tech Program Reg Pass, Algorithms, I/O
, Linear Algebra, Parallel Programming Languages, Libraries, and Models, P
erformance, Task-based programming
erformance, Task-based programming\n\nRegistration Category: Tech Program
Reg Pass, Algorithms, I/O, Linear Algebra, Parallel Programming Languages,
Libraries, and Models, Performance, Task-based programming
