Hi Community,
I am getting a quota exhaustion message: "quota exceeded for aiplatform.googleapis.com/online_prediction_requests_per_base_model with base model: gemini-pro".
However, I haven't even started using the gemini-pro model, had just been using bison so far and have a 60 QPM limit.
Any guidance on how to resolve this?
User | Count |
---|---|
2 | |
1 | |
1 | |
1 | |
1 |