feat: Add TensorRT-LLM recipes for G4 instances by tohaowu · Pull Request #57 · AI-Hypercomputer/gpu-recipes

tohaowu · 2025-12-04T22:47:53Z

Adds single-host TensorRT-LLM benchmark recipes for the following models on G4 instances:

Qwen3-30B-A3B
Qwen3-4B
Qwen3-8B
Qwen3-32B
Llama3.1-70B
DeepSeek-R1

Each recipe includes steps for VM creation, TensorRT-LLM setup, model quantization (if needed), and running benchmarks.

Adds single-host TensorRT-LLM benchmark recipes for the following models on G4 instances: - Qwen3-30B-A3B - Qwen3-4B - Qwen3-8B - Qwen3-32B - Llama3.1-70B - DeepSeek-R1 Each recipe includes steps for VM creation, TensorRT-LLM setup, model quantization (if needed), and running benchmarks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add TensorRT-LLM recipes for G4 instances#57

feat: Add TensorRT-LLM recipes for G4 instances#57
tohaowu wants to merge 1 commit intoAI-Hypercomputer:add-g4-trtllm-recipesfrom
tohaowu:add-g4-trtllm-recipes

tohaowu commented Dec 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

tohaowu commented Dec 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant