Skip to content

chenweiphd/OpenRVGPUCourse

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

47 Commits
 
 
 
 
 
 

Repository files navigation

RISC-V based Open-Source GPU Architecture and Design Exploration

img

Passing the Torch to Cultivate the Next Generation of Turing Award Laureates

As a core direction in the strategic priorities of the global chip industry and technology powerhouses, RISC-V, with its efficient instruction set, royalty-free licensing, and flexible extensibility, is reshaping the technical landscape of GPUs in large language model computing. This course focuses on the hot interdisciplinary frontiers of RISC-V open-source architecture and GPU design, constructing an exploratory knowledge system spanning from instruction sets to many-core computing architectures. It aims to achieve entry into RISC-V GPU architecture in the shortest path while exploring the application potential of technologies such as 2.5D/3D/3.5D Chiplet and open-source EDA in open-source GPUs. Beyond passing the torch of knowledge, the course takes "cultivating the next generation of Turing Award laureates" as its ultimate goal. Through a trinity training model of "theory-practice-innovation," it inspires students' deep insights into core architectures and their ability to innovate disruptively.

The core content covers: integrated design of RISC-V Instruction Set Architecture (ISA) and GPU parallel computing architectures; open-source GPU core modules (e.g., stream processors, memory architecture, TensorCore); and principles and exploratory implementations of core compilers. Building on this, the course extends to cutting-edge research areas: On one hand, relying on C-Model simulation and FPGA prototypes/OpenEDA chip prototype verification platforms, students must complete the full chip design process from architectural modeling, functional verification to deployment implementation, honing their "from-scratch" system-level design capabilities through practice. On the other hand, the course focuses on pain points in computational demands of large language models (LLM/Transformer), guiding students to explore architectural optimization strategies for open-source RISC-V GPUs in scenarios like tensor computation and mixed-precision training, and even challenging frontier topics such as "custom instruction extensions" and "memory compute integration units."

image-20251018175829993

To meet the needs of cultivating "Turing Award-caliber" talent, the course breaks traditional teaching boundaries, adopting a "problem-driven + academic symbiosis" model: On one hand, it sets up project clusters for "Open-Source GPU Architecture Exploration," training students to team up and complete academically competitive prototype designs—from analyzing performance bottlenecks in existing open-source GPUs, to proposing new parallel computing units based on RISC-V extended instructions, and validating optimization effects through simulation and testing, simulating the entire process of top research teams' breakthroughs. On the other hand, the course deeply collaborates with the global RISC-V Foundation, open-source GPU communities, and top university labs, allowing students to directly contribute to open-source projects and even lead sub-project research, accumulating international academic influence. Additionally, the course invites leading figures in GPU architecture and chief architects of top AI chip companies to host specialized seminars, sparking intellectual exchanges on topics like "evolution of open-source RVGPU architecture" and "reconstruction of computing paradigms by open-source ecosystems," inspiring students' forward-looking expansion and exploration of technological trends.

The course emphasizes dual cultivation of research thinking and industrial literacy in the "Berkeley mode": fostering students' resilience in facing technical challenges, guiding reflection on the social value of technological innovation through "architectural design," and breaking down disciplinary barriers via "interdisciplinary collisions" (e.g., cross-domain discussions with computer architecture, artificial intelligence, and integrated circuits). Ultimately, students will possess the hard power to independently conduct cutting-edge architecture research—able to delve into micro-architectural details of RISC-V GPUs while gaining insight into the essence of computing beyond technical specifics; mastering solid simulation and verification toolchains while driving technical consensus in open-source communities.

From the intricacies of the RISC-V instruction set to the grand design of GPU many-core architectures, from the rigorous logic of simulation and verification to collaborative innovation in open-source ecosystems, this course not only imparts knowledge but also strives to ignite students' passion for computing architecture, cultivating them into next-generation computing architecture leaders with technical depth, innovative courage, and academic vision. We believe this course will produce pioneers defining processor and AI computing paradigms for the next decade, and perhaps even nurture future Turing Award laureates.

Syllabus(2025/Draft)

RISC-V based Open-Source GPU Architecture and Design Exploration(Open Course)

Week Date Daily Topic of Teaching Content Guest/Video Sharing Content or Exploration Project Technical Report or Exploration Project
课周 日期 授课内容当天主题 嘉宾分享或探索项目 技术报告或探索项目
9:00-9:50 10:00-10:50 11:00-11:50
Lesson 1 10月18日 Course Overview and GPU Introduction UC Berkeley: Berkeley Approach to Graduate Education in EECS(David Patterson Video) Data Format and Converter Design in GPU
RIOS与零一代表讲话 课程概述与GPU简介(陈) 加州伯克利:伯克利EECS教学模式(大卫·帕特森) GPU中的数据格式与转换
Lesson 2 10月25日 SoC Design Flow and OpenEDA UC San Diego: OpenROAD-The journey so far and roadmap (Andrew Kahng) OpenTensorCore Architecture Open3DNoC Architecture
SoC设计流程与OpenEDA(陈) 加州圣地亚哥:OpenROAD进展与路线图(Andrew Kahng) OpenTensorCore 架构 Open3DNoC 架构
Lesson 3 11月1日 RISC-V Instruction Set and Architecture Synopsys:Full Flow of Digital Chip Design and Cutting-Edge Challenges (Muming Tang) OpenRoad Architecture Analysis llama.cpp Arcitecture Analysis
RISC-V指令集与架构(任) 新思科技:数字芯片设计全流程与前沿挑战(汤木明 资深总监) OpenRoad架构分析 llama.cpp架构分析
Lesson 4 11月8日 GPU Architecture and Open-Source RISC-V GPU Tsinghua: Ventus GPGPU Software and Hardware Design(Hu He) Vortex存储互连与流式处理器代码分析
GPU架构与开源RISC-V GPU(任、陈) 清华大学:乘影通用图形处理器软硬件设计(何虎 副教授)
Lesson 5 11月15日 2.5D/3D/3.5D Architecture and Memory Compute Integration CreMemory: Accelerating Computing Performance with Hybrid-Bonding Memory Chiplet Technology Exploration Project
2.5D/3D/3.5D架构与存算一体(任、陈) 青耘科技:以混合键合内存芯粒技术加速计算性能(赵建中 首席技术专家)
Lesson 6 11月22日 Tensor Core and Transformer DeToolIC: Chiplet Interconnect and EDA Challenges and Solutions Exploration Project
张量核心(TensorCore)与Transformer 德图科技:Chiplet互连和电子设计自动化技术(EDA)挑战和解决方案(蒲菠 副总经理)
Lesson 7 11月29日 Memory Architecture and Network on Chip (NoC) Google:XLA Exploration Project
存储架构与片上网络(NoC)
Lesson 8 12月6日 GPU Compiler and Triton Exploration Project Exploration Project
GPU编译器与Triton
Lesson 9 12月20日 GPU Deployment Acceleration and DeepSeek Optimization Principles Exploration Project Exploration Project
GPU部署加速与deepseek优化原理
Lesson 10 12月21日 Exploration Project Report Exploration Project Report Exploration Project
各探索项目总结报告

Exploring Projects

OpenTensorCore

Open3DNoC

OpenRVGPU

OpenLLMRoad

Guest Expert

image-20251220192224902

Main Contributor of Lecture

image-20251018172931236

image

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •