Hi EveryOne,
I have an issue related to a GKE Autopilot cluster:
Currently, I have this deployment:
"""apiVersion: apps/v1
kind: Deployment
metadata:
labels:
app: test
name: test
namespace: default
spec:
replicas: 1
selector:
matchLabels:
app: test
template:
metadata:
labels:
app: test
spec:
containers:
- env:
- name: RABBITMQ_HOST
value: 10.250.0.5
- name: QUEUE_NAME
value: testing
image: asia-southeast1-docker.pkg.dev/project-delivery-452706/delivery-worker/dev-delivery-worker-orchestration:1
imagePullPolicy: Always
name: test-image-1
resources:
requests:
cpu: "500m"
memory: "1Gi"
nodeSelector:
cloud.google.com/compute-class: Performance
kubernetes.io/arch: arm64
restartPolicy: Always
terminationGracePeriodSeconds: 30"""
When I use this architecture, I encounter the following error: 'Node scale up in zones asia-southeast1-c associated with this pod failed: GCE quota exceeded. Pod is at risk of not being scheduled.'
However, when I switch to the amd64 architecture, it has resources available and can schedule the pod normally. I’ve looked through some documentation and it mentions something about quotas, but I’m not sure what the specific issue is. Could you explain what the problem might be and how I can resolve it if I want to use the arm64 architecture for my pod? Additionally, what is this quota limit? Currently, when I check, none of my quotas are set to 'Unlimited,' and some haven’t even exceeded 10% of their usage.
Is it because the resources for the ARM64 architecture are insufficient to meet the demand in the Southeast Asia region?
Thank you."
Your Google Kubernetes Engine (GKE) Autopilot cluster is unable to provision additional nodes to accommodate your ARM64-based pod due to quota limitations in the asia-southeast1-c zone. Google Cloud enforces quotas to manage resource usage and ensure fair access across projects. These quotas are applied at both the regional and zonal levels and can vary based on the specific machine types and architectures. In your case, while your overall project quotas may appear underutilized, the quotas specific to ARM64 (T2A) machine types in the asia-southeast1-c zone might be exhausted or set lower than those for AMD64 (x86) machine types.
Request a Quota Increase:
Deploy in a Different Zone or Region
You're encountering a classic cloud resource quota issue, specifically related to the availability of ARM64 resources in your Google Cloud region (asia-southeast1) and zone (asia-southeast1-c). Let's break down the problem and how to resolve it:
Understanding the Problem
GCE Quota Exceeded:
ARM64 Resource Availability:
Specific Quota Limits:
Troubleshooting and Resolution
Check Specific Quotas:
Request a Quota Increase:
Consider Other Zones or Regions:
Reduce Resource Requests:
Remove the nodeSelector:
Regional Cluster:
Google Cloud Status Dashboard: