Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

Can't use Dataproc 2.2

When trying to use Dataproc image version 2.2. the gcloud CLI raises a validation error.

OrrGranulate_0-1709137700277.png

Trying the same command with version 2.1 works properly.
Is there anything I'm doing wrong here?

0 2 493
2 REPLIES 2

While image deprecation is a common reason for validation errors in Dataproc, there are other potential causes for the error you're encountering with version 2.2:

  • Availability and Region: Image versions can be region-specific. Ensure that version 2.2 is actually available in your desired region (us-central1).
  • Syntax and Typos: Double-check the command syntax and make sure there are no typos in the version number or any other parameters.
  • Additional Restrictions: There may be additional restrictions or limitations on certain image versions. Consulting the Dataproc documentation or release notes might reveal these.

Troubleshooting Steps

  1. Verify Availability:

  2. Check Syntax and Typos:

    • Carefully review your gcloud dataproc clusters create command. Ensure no typos exist in the image version specification or other parameters. Pay close attention to the region specified.
  3. Consult Release Notes:

  4. Update/Upgrade if Necessary

    • If 2.2 is indeed unavailable in your region or has some limitations, choose a supported version from your gcloud dataproc images list results.
    • Consider upgrading to the latest Dataproc image for maximum benefits.

Example:

 
gcloud dataproc clusters create cluster-251b --region us-central1 --master-machine-type n2-standard-4 --master-boot-disk-size 500 \
--num-workers 2 --worker-machine-type n2-standard-4 --worker-boot-disk-size 500 \
--image-version 2.2.11 --project granulate

Is the `gcloud dataproc images list `command publicly available? I couldn't find any reference to it or access it via cloud shell.