Hi all,
We have Postgres database ( Amazon hosted) and there is one view ( 10 colums) I need to transfer to Big Query on daily basis. What would be the best solution for this? It must be automatic.
Can I create ODBC connection and somehow link the servers?
Im very confused from GCP, so mamy services and functions and I couldnt find any simple solution for my issue.
Thank you in advance for your answers!
Unfortunately, direct ODBC connection between your PostgreSQL instance (hosted on AWS) and BigQuery isn't currently supported. However, there are several effective approaches:
Datastream: This is ideal for near real-time replication of entire databases, but might be a bit heavy-duty for just a single view.
Custom ETL Pipelines: This gives you complete control using tools like Cloud Composer (Apache Airflow) or Dataflow (Apache Beam). However, it requires more technical setup and maintenance.
Third-Party Tools: Services like Fivetran or Stitch specialize in data integration and could simplify the process, but often come with a cost.
My Recommendation: Dataflow + Cloud Scheduler + Pub/Sub (Google Cloud)
This approach is well-suited for your needs, as it leverages the power and flexibility of Google Cloud Platform:
How It Works
thank you for your answer!
Hi @lucierabochova if you need to automate the daily transfer of a PostgreSQL (Amazon RDS) view to BigQuery, here are a few options to consider:
Using Dataflow (Managed by Google Cloud)
Using Cloud SQL + Scheduled Queries
Using a Third-Party Connector (Faster & More Flexible)
Since you're finding GCP a bit overwhelming, you might want to start with a third-party tool like Windsor.ai for a quick setup. Once you're comfortable, you can explore GCP-native options like Dataflow or Cloud SQL for a more customized approach.
Hope this helps!
You can use Google Cloud Dataflow or Cloud Functions with a scheduled trigger to automate the transfer. Another simple option is BigQuery Data Transfer Service with AWS S3 as an intermediary (export Postgres data to S3, then load it into BigQuery). Direct ODBC linking is not natively supported for BigQuery.