Which quota needs to be adjusted for executing Spa... - Page 2

TheBigMac · 04-02-2025 07:08 AM

I'm not able to find any Vertex AI quota that seems to have any usage on it, but I'm getting quota exceeded errors.

The error I'm getting is:
Statement failed: Vertex AI endpoint projects/alt-staging/locations/us-east4/publishers/google/models/text-embedding-004 quota has been exceeded. Please see Vertex AI error log, quota dashboard and https://cloud.google.com/vertex-ai/docs/quotas for details.

The code being executed is:
query = """
SELECT embeddings.values
FROM ML.PREDICT(MODEL EmbeddingsModel,
(SELECT CONCAT(COALESCE(@title, ''), ' ', COALESCE(@description, '')) AS content))
"""
params = {"title": title, "description": description}
param_types = {
"title": spanner.param_types.STRING,
"description": spanner.param_types.STRING,
}

with database.snapshot() as snapshot:
result = snapshot.execute_sql(query, params=params, param_types=param_types)
for row in result:
return row[0]
except Exception as e:
logger.error(f"Error fetching embeddings: {e}")
return None

Can somebody help me understand what quota I need to adjust?

Thanks!

Which quota needs to be adjusted for executing Spanner ML.PREDICT operations?