Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
While GPU utilization allows one to speed up computations to the orders of magnitude, memory management remains the bottleneck making it often a challenge to achieve the desired performance. Hence, different memory optimizations are leveraged to make memory being used more effectively. We propose an approach automating memory management utilizing partial evaluation, a program transformation technique that enables data accesses to be pre-computed, optimized, and embedded into the code, saving memory transactions. An empirical evaluation of our approach shows that the transformed program could be up to 8 times as efficient as the original one in the case of CUDA C naïve string pattern matching algorithm implementation.
Original language | English |
---|---|
Title of host publication | PPoPP 2020 - Proceedings of the 2020 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming |
Publisher | Association for Computing Machinery |
Pages | 431-432 |
Number of pages | 2 |
ISBN (Electronic) | 9781450368186 |
ISBN (Print) | 9781450368186 |
DOIs | |
State | Published - 19 Feb 2020 |
Event | 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2020 - San Diego, United States Duration: 22 Feb 2020 → 26 Feb 2020 |
Name | Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP |
---|
Conference | 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2020 |
---|---|
Country/Territory | United States |
City | San Diego |
Period | 22/02/20 → 26/02/20 |
ID: 53079211