Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

Slow TPS

Hello,

We are running deepseek:70b model with SSD disk in g2-standard-96 instance but we get approximately 25 TPS in benchmark tests, which is impossible.What is the reason ?

Model: L4x8
Image: Deep Learning VM for PyTorch 2.4 with CUDA 12.4 M127

0 0 19
0 REPLIES 0