I'm not able to find any Vertex AI quota that seems to have any usage on it, but I'm getting quota exceeded errors.
The error I'm getting is:
Statement failed: Vertex AI endpoint projects/alt-staging/locations/us-east4/publishers/google/models/text-embedding-004 quota has been exceeded. Please see Vertex AI error log, quota dashboard and https://cloud.google.com/vertex-ai/docs/quotas for details.
The code being executed is:
query = """
SELECT embeddings.values
FROM ML.PREDICT(MODEL EmbeddingsModel,
(SELECT CONCAT(COALESCE(@title, ''), ' ', COALESCE(@description, '')) AS content))
"""
params = {"title": title, "description": description}
param_types = {
"title": spanner.param_types.STRING,
"description": spanner.param_types.STRING,
}
with database.snapshot() as snapshot:
result = snapshot.execute_sql(query, params=params, param_types=param_types)
for row in result:
return row[0]
except Exception as e:
logger.error(f"Error fetching embeddings: {e}")
return None
Can somebody help me understand what quota I need to adjust?
Thanks!
User | Count |
---|---|
2 | |
1 | |
1 | |
1 | |
1 |