I'm building a pipeline to load data from my GCS bucket to BigQuery with wrangler transformation.
Here is Data Fusion instance details:
I am already gave these role to the service account :
And this is my log of failed pipelines :
10/31/2024 0:43:02
ERROR
Spark Program 'phase-1' failed.
10/31/2024 0:43:02
ERROR
Spark program 'phase-1' failed with error: Application application_1730309833442_0002 finished with failed status. Please check the system logs for more details.
10/31/2024 0:43:03
ERROR
Pipeline 'defect-project-dept-data-pipeline' failed.
10/31/2024 0:43:03
ERROR
Workflow service 'workflow.default.defect-project-dept-data-pipeline.DataPipelineWorkflow.52a8dc23-96e5-11ef-ab12-c66d48daef21' failed.
10/31/2024 0:43:03
ERROR
Program DataPipelineWorkflow execution failed.
10/31/2024 0:43:05
WARN
Container container_1730309833442_0001_01_000002 exited abnormally with state COMPLETE, exit code 1.
10/31/2024 0:43:10
ERROR
Dataproc job 'FAILED' with the status details: Job failed with message [java.lang.reflect.InvocationTargetException: null]
Can anyone help me with this issue?
Thanks.
Hi @instantkit,
Welcome to Google Cloud Community!
It seems you’re encountering several errors on your Datafusion Pipeline, which mostly general and requires further checking and might be due to several reasons. Here are some suggestions that may help resolve the issue:
If the issue persists, I recommend reaching out to Google Cloud Support for further assistance, as they can provide insights into whether this behavior is specific to your project.
I hope the above information is helpful.
Would it be possible to get yarn application logs for
application_1730309833442_0002