Lecture 1: Basic I (slides)
Lecture 2: Basic II (slides)
Lecture 3: Distributed-Memory Programming with MPI (I) (slides)
Lecture 4: Distributed-Memory Programming with MPI (II) (slides)
Lecture 5: Shared-Memory Programming with Pthreads (I) (slides)
Lecture 6: Shared-Memory Programming with Pthreads (II) (slides)
Lecture 7: Shared-Memory Programming with OpenMP (I) (slides)
Lecture 8: Shared-Memory Programming with OpenMP (II) (slides)
Lecture 9: Parallel Program Development (slides)
Lecture 10: Parallel Computing using GPU (slides)
Lecture 11: CUDA Memory and Data Locality (slides)
Lecture 12: CUDA Performance Considerations (slides)
Lecture 13: Parallel Convolution and Constant Memory (slides)
Lecture 14: Application Case Study of CUDA – Machine Learning (slides)