Hello guys,
I get this error when trying to run an ingest pipeline from on-prem SQL to BigQuery:
"Spark program 'phase-1' failed with error: The TCP/IP connection to the host xx.xx.xx, port 1433 has failed. Error: "connect timed out. Verify the connection properties. Make sure that an instance of SQL Server is running on the host and accepting TCP/IP connections at the port. Make sure that TCP connections to the port are not blocked by a firewall.".. Please check the system logs for more details.
I did a "connectivity test" in GCP where the destination was reachable. However when I try to ping the SQL server it cant reach from a Linux instance in the same network. I cannot reach it using Dataproc (SSH) either.
When looking at all completed Dataproc ingest jobs, they all have succeeded.
What can be the issue?
Solved! Go to Solution.
Here are several steps and considerations to help troubleshoot and resolve the issue:
Direct Port Connectivity Tests:
nc -zv xx.xx.xx 1433
). Successful tests indicate open network paths and ports, while failures suggest network or firewall issues requiring attention.SQL Server Configuration and Firewall:
Networking and Security:
Understanding Successful Dataproc Jobs:
Additional Investigation Points: