Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

Data Fusion - Preventing Duplicated from sinking to Bigquery & Denormalise

Please assist?
 
I would like to prevent duplicates entering into my Google Bigquery db. I have used the Distinct plugin and noticed that my final data is more than 50% less as my original data. Can i trust this plugin(see screen shots)?
 
In this ETL Flow, i would like to denormalise some tables to make it more compact in Bigquery. I would like to use the Joiner plugin, see attached. The idea is to join 2 or more tables in the ETL flow and have one table written in Bigquery for it. Please advise?
 
Screenshot 2022-07-21 at 12.22.15.pngScreenshot 2022-07-21 at 12.23.00.pngScreenshot 2022-07-21 at 12.29.02.png
0 3 1,113
3 REPLIES 3