Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

How to stream MongoDB cdc changes to BigQuery using Dataflow?

RC1
Bronze 4
Bronze 4

Hi folks so I have a usecase where I need a realtime pipeline to push mongodb cdc data to big query for analytics. I went through Google cloud docs and I found a solution which covers some part of my problem [ https://cloud.google.com/dataflow/docs/guides/templates/provided-streaming#mongodb-to-bigquery-cdc ] . But here the pre requisites is to have "The change stream pushing changes from MongoDB to Pub/Sub should be running". In order to do this we have to use debezium or similar kind of service. We are mostly looking for a managed service for this usecase rather than self hosting debezium. Are there any services in GCP which can help me in this ? Also are there any plans for modifying apache beam mongo connector libraries  for cdc streaming ?

cc: @fguan 

 

1 6 4,491
6 REPLIES 6