Lecture 12: Global Memory Access Patterns and Implications.
Lecture Summary
Banks


Example


Getting the results right (broadly for parallel computing)
Example


Data Hazards
Getting the results fast (for specifically GPU computing)


PreviousLecture 11: Execution Divergence. Control Flow in CUDA. CUDA Shared Memory Issues.NextLecture 13: Atomic operations in CUDA. GPU ode optimization rules of thumb.
Last updated