March 3, 2026Open Access

Enhancing GPU-HBM Data Transfer Efficiency Using Markov Chains and Neural Network-Driven Predictive Caching with Quantization and Pruning

Puntos clave

Data transfer efficiency improves with the combined use of markov chains and neural networks, reducing bottlenecks.
Using markov chains helps model data access transitions, leading to better cache management and predictive caching.
Neural network optimization enhances the accuracy of data prefetching, supporting better overall performance.
Efficient cache utilization addresses CPU-GPU bandwidth limitations, vital for high-demand computational needs.

Resumen

Background High-bandwidth memory (HBM) systems face persistent data transfer bottlenecks, particularly when CPUs are unable to supply data to GPUs at a sufficient rate. This limitation reduces overall computational efficiency and highlights the need for improved cache management strategies. Methods: Markov Chains represented transitions between frequently accessed memory blocks, enabling predictive sequencing of data needs. A neural network was then applied to model and optimise these Markov transitions, improving cache prefetching accuracy and further optimising data movement techniques. Results & Conclusions: The combined use of Markov-based memory modelling, NN optimisation, and supplementary data transfer techniques demonstrates strong potential to mitigate CPU–GPU bandwidth limitations. Together, these methods offer more efficient cache utilization and reduced bottlenecks in high-demand computational environments.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Samiel Azmaien

Journals

International Journal of Soft Computing and Engineering

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Enhancing GPU-HBM Data Transfer Efficiency Using Markov Chains and Neural Network-Driven Predictive Caching with Quantization and Pruning

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study