Skip to content

skj092/cuda-programming

Repository files navigation

CUDA Learning Challenge

Resources:

  1. https://leetgpu.com/playground
  2. Video Lecture: FreeCodeCamp
  3. Jeremy Haward 2 lectures
  4. PMPP Book

Day1: 28 Jan, 2025

  • Watched FreeCodeCamp CUDA lecture till Chapter 3 (1:36:48).
  • Find resources for writing cuda program.
  • Hello world in cuda
  • Copy one array to another in cuda

Day 2:

  • Add two vector

Day 3:

  • Fine square of every element of a metrix.

Day 4:

  • Element wise multiple two metrix.
  • Neive Matrix Multiplication

Day 5:

  • Solve all the problems of Chapter 3: PMPP

Day 6:

  • Add two vector using shared memory (failed).
  • Add two matrix using shared memory (failed).
  • Print Device properties

Day 7:

  • Add two vector using shared memory in python.
  • Add two vector using shared memory in python.

Day 8:

  • Add two vector using shared memory in cuda.
  • Add two vector using shared memory in cuda.

Day 9:

  • Tried matrix multiplicationusing shared memory - failed
  • I need more clearity on index mapping, tomorrow I'll again start writing python and will solve simple problem using shared memory.

Day 10:

  • Shared memory metrix index can be accessed using double index format like m[i][j]
  • Used double index format to add two matrix using shared memory.
  • Tried and failed to add two metrix take tile pair like multiplication but instead of multiplication just add them.
  • There are two way to use shared memory, dynamic and static.

Day 11:

  • Tried to understand the index patter mapping for tile matrix additiong and multiplication.

Day 12:

  • Matrix addition using shared matrix, understood clearly.
  • Matrix multiplication, using shared memory

Day 13:

  • Softmax of a matrix in python and c

Day 14:

  • Softmax for matrix in CUDA, use thread for each vector in matrix.

Day 15:

Day 16:

  • Implemented online softmax in c and cuda

Day 17:

  • Profiler setup on softmax
  • PyTorch setup for cpp

Day 18 : PMPP Chapter 6 half

Day 19 : PMPP chapter 6

Day 20 : Reading blog of softmax optimization on cuda by maharshi

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published