research

Publications, talks, and posters by categories in reverse chronological order.

Publications

2025

  1. SC ’25
    RAPTOR: Practical Numerical Profiling of Scientific Applications
    Faveo Hoerold, Ivan R. Ivanov, Akash Dhruv, William S. Moses, Anshu Dubey, Mohamed Wahib, and Jens Domke
    In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis , , 2025
  2. LLVM-HPC @ SC ’25
    Dynamic Thread Coarsening for CPU and GPU OpenMP Code
    Ivan R. Ivanov, Jens Domke, Toshio Endo, and Johannes Doerfert
    In Proceedings of the SC ’25 Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis , , 2025

2024

  1. arXiv
    Tadashi: Enabling AI-Based Automated Code Generation With Guaranteed Correctness
    Emil Vatai, Aleksandr Drozd, Ivan R. Ivanov, Yinghao Ren, and Mohamed Wahib
    2024
  2. IWOMP ’24
    Automatic Parallelization and OpenMP Offloading of Fortran Array Notation
    Ivan R. Ivanov, Jens Domke, Toshio Endo, and Johannes Doerfert
    In Advancing OpenMP for Future Accelerators , 2024
  3. EuroMPI ’24
    SPMD IR: Unifying SPMD and Multi-value IR Showcased for Static Verification of Collectives
    Semih Burak, Ivan R. Ivanov, Jens Domke, and Matthias Müller
    In Recent Advances in the Message Passing Interface , 2024
  4. arXiv
    Input-Gen: Guided Generation of Stateful Inputs for Testing, Tuning, and Training
    Ivan R. Ivanov, Joachim Meyer, Aiden Grossman, William S. Moses, and Johannes Doerfert
    Jun 2024
  5. CGO ’24
    Retargeting and Respecializing GPU Workloads for Performance Portability
    I. R. Ivanov, O. Zinenko, J. Domke, T. Endo, and W. S. Moses
    In 2024 IEEE/ACM International Symposium on Code Generation and Optimization (CGO) , Mar 2024

2023

  1. PPoPP ’23
    High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs
    William S. Moses, Ivan R. Ivanov, Jens Domke, Toshio Endo, Johannes Doerfert, and Oleksandr Zinenko
    In Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming , Montreal, QC, Canada, Mar 2023

2019

  1. SC ’19
    HyperX Topology: First at-Scale Implementation and Comparison to the Fat-Tree
    Jens Domke, Satoshi Matsuoka, Ivan R. Ivanov, Yuki Tsushima, Tomoya Yuki, Akihiro Nomura, Shin’ichi Miura, Nie McDonald, Dennis L. Floyd, and Nicolas Dubé
    In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis , Denver, Colorado, Mar 2019

Talks

2025

  1. RAPTOR: Practical Numerical Profiling of Scientific Applications
    Faveo Hoerold, Ivan R. Ivanov, Akash Dhruv, William S. Moses, Anshu Dubey, Mohamed Wahib, and Jens Domke
    In SC ’25 , Nov 2025
  2. Dynamic Thread Coarsening for CPU and GPU OpenMP Code
    Ivan R. Ivanov, Jens Domke, Toshio Endo, and Johannes Doerfert
    In LLVM-HPC @ SC ’25 , Nov 2025
  3. Automatic Minimal and Relocatable Proxy App Generation
    Ivan R. Ivanov, Jens Domke, Toshio Endo, and Johannes Doerfert
    In Student Research Competition at CGO 2025 , Mar 2025
  4. Polyhedral Rescheduling of GPU Kernels To Exploit Async Memory Movement
    Ivan R. Ivanov, William Moses, Emil Vatai, Toshio Endo, Jens Domke, and Alex Zinenko
    In Ninth LLVM Performance Workshop at CGO 2025 , Mar 2025

2024

  1. Automatic Parallelization and OpenMP Offloading of Fortran Array Notation
    Ivan R. Ivanov, Jens Domke, Toshio Endo, and Johannes Doerfert
    In 20th International Workshop on OpenMP , Sep 2024
  2. Input-Gen: Guided Generation of Stateful Inputs for Testing, Tuning, and Training
    Ivan R. Ivanov
    In Monthly LLVM ML Guided Compiler Optimizations Meeting , Aug 2024
  3. Retargeting and Respecializing GPU Workloads for Performance Portability
    Ivan R. Ivanov
    In R-CCS Cafe , Jun 2024
  4. Automatic Retuning of Floating-Point Precision
    Ivan R. Ivanov, and W. S. Moses
    In 2024 Euro LLVM Developers’ Meeting , Apr 2024
  5. Automatic Proxy App Generation through Input Capture and Generation
    Ivan R. Ivanov, Aiden Grossman, Ludger Paehler, William S. Moses, and Johannes Doerfert
    In 2024 Euro LLVM Developers’ Meeting , Apr 2024
  6. Retargeting and Respecializing GPU Workloads for Performance Portability
    Ivan R. Ivanov, O. Zinenko, J. Domke, T. Endo, and W. S. Moses
    In CGO ’24 , Mar 2024
  7. Automatic Parallelization and OpenMP Offloading of Fortran
    Ivan R. Ivanov, J. Domke, T. Endo, and J. Doerfert
    In CGO ’24 LLVM Performance Workshop , Mar 2024

2023

  1. Optimization of CUDA GPU Kernels and Translation to AMDGPU in Polygeist/MLIR
    Ivan R. Ivanov, O. Zinenko, J. Domke, T. Endo, and W. S. Moses
    In 2023 LLVM Developers’ Meeting. Student Talk , Oct 2023
  2. GPU Kernel Compilation in Polygeist/MLIR
    Ivan R. Ivanov, O. Zinenko, J. Domke, T. Endo, J. Doerfert, and W. S. Moses
    In 2023 LLVM Developers’ Meeting GPU Offloading Workshop. Lightning Talk , Oct 2023

2022

  1. High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs in Polygeist/MLIR
    W. S. Moses, Ivan R. Ivanov, J. Domke, T. Endo, J. Doerfert, and O. Zinenko
    In 2022 LLVM Developers’ Meeting. Lightning Talk , Nov 2022
  2. Automatic translation of CUDA code into high performance CPU code using LLVM IR transformations
    Ivan R. Ivanov, J. Domke, and T. Endo
    In The 4th R-CCS Internation Symposium. Lightning Talk , Feb 2022

2021

  1. Improved failover for HPC interconnects through localised routing restoration
    Ivan R. Ivanov, J. Domke, A. Nomura, and T. Endo
    In The 3rd R-CCS Internation Symposium. Lightning Talk , Feb 2021

Posters

2025

  1. Automatic Minimal and Relocatable Proxy App Generation
    Ivan R. Ivanov, Jens Domke, Toshio Endo, and Johannes Doerfert
    In Student Research Competition at CGO 2025 , Mar 2025

2024

  1. Dynamic Thread Coarsening for OpenMP Offloading.
    Ivan R. Ivanov, J. Domke, T. Endo, and J. Doerfert.
    In CGO ’24 Student Research Competition. , Mar 2024
  2. Unifying SPMD and Multi-Value IR - Use Case: Static Verification of Collective Communication.
    S. Burak, Ivan R. Ivanov, J. Domke, and M. Mueller.
    In CGO ’24 Student Research Competition. , Mar 2024

2023

  1. Performance Portability of C/C++ CUDA Code via High-Level Intermediate Representation
    Ivan R. Ivanov, O. Zinenko, J. Domke, T. Endo, and W. S. Moses.
    In 2023 RIKEN Summer School , Sep 2023
  2. BITFLEX - An HPC User-Driven Automatic Toolchain for Precision Manipulation and Approximate Computing.
    Ryan Barton, Mohamed Wahib, Jens Domke, Ivan R. Ivanov, Toshio Endo, and Satoshi Matsuoka.
    In ISC High Performance 2023 , May 2023
  3. Parallel Optimizations and Transformations of GPU Kernels Using a High-Level representation in MLIR/Polygeist.
    Ivan R. Ivanov, William S. Moses, Jens Domke, and Toshio Endo.
    In CGO ’23 Student Research Competition , Feb 2023

2022

  1. High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs in Polygeist/MLIR.
    W. S. Moses, Ivan R. Ivanov, J. Domke, T. Endo, J. Doerfert, and O. Zinenko.
    In 2022 LLVM Developers’ Meeting. , Nov 2022

Awards

SC25 Best Reproducibility Advancement Award

CGO 2025 ACM Student Research Competition 3rd place

Theses

Mar 2023 - Master’s Thesis

Optimizations and Transformations of Parallel Code via High Level Intermediate Representation

Mar 2021 - Bachelor’s Thesis

Improved failover for HPC interconnects through localised routing restoration

Conference Service

CGO 2026 Workshop & Tutorial Chair

EuroPar 2025 Program Comittee Member.

SC 24 Reproducibility Committee Member.

LLVM-GPU: First International Workshop on LLVM for GPUs at EuroPar 24 Program Committee Member

CGO25 Artifact Evaluation Comittee Member.

CGO24 LLVM Performance Workshop Moderation.

2024 Euro LLVM Session Moderation.

Review for LLMxHPC 2024 at Cluster ‘24

Review for IPDPS25.

Review for HPC Asia 2025.