Posts by Category

GSoC

GSoC Week 5: The Hardware-Aware Mindset for High-Performance C++

5 minute read

Published:

Refactoring SU2 code taught me that nested std::vector<std::vector> scatters data across memory, leading to cache misses, while a flat std::vector creates a predictable layout that lets hardware prefetchers work efficiently. This simple change delivered notable performance gains and streamlined GPU interfacing and serialization.

GSoC Week 3: Profiling SU2 with Tracy

6 minute read

Published:

In this blog post, I explore integrating Tracy Profiler with SU2 for my Google Summer of Code project. I detail the setup of Tracy’s client and server, instrumenting functions, and leveraging its low-overhead, real-time visualization to optimize performance in SU2 simulations.

GSoC Week 2: Investigating Performance Bottlenecks in SU2

2 minute read

Published:

This post explores how I used gprof, valgrind, and kcachegrind to identify and analyze performance bottlenecks in SU2, focusing on the neural network module. The approach is generalizable to other large C++ projects.