Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

Which quota needs to be adjusted for executing Spanner ML.PREDICT operations?

I'm not able to find any Vertex AI quota that seems to have any usage on it, but I'm getting quota exceeded errors. 

The error I'm getting is:
Statement failed: Vertex AI endpoint projects/alt-staging/locations/us-east4/publishers/google/models/text-embedding-004 quota has been exceeded. Please see Vertex AI error log, quota dashboard and https://cloud.google.com/vertex-ai/docs/quotas for details.

The code being executed is:
        query = """
            SELECT embeddings.values
            FROM ML.PREDICT(MODEL EmbeddingsModel,
            (SELECT CONCAT(COALESCE(@title, ''), ' ', COALESCE(@description, '')) AS content))
        """
        params = {"title": title, "description": description}
        param_types = {
            "title": spanner.param_types.STRING,
            "description": spanner.param_types.STRING,
        }

        with database.snapshot() as snapshot:
            result = snapshot.execute_sql(query, params=params, param_types=param_types)
            for row in result:
                return row[0]
    except Exception as e:
        logger.error(f"Error fetching embeddings: {e}")
        return None

Can somebody help me understand what quota I need to adjust?

 

Thanks!

 

1 4 407
4 REPLIES 4