Hello
My team and I want to create a process where a target table gets data from several sources.
All sources and target tables are BigQuery tables.
We want the target table to get data from each source independently from the other sources.
It's not possible to create several Dataform processes that have the same name and schema.
How do we create this kind of workflow?
Our Data warehouse is based on the same principle where a data model (Target table) is getting data from multiple sources (We have a 3 tier architecture). You would need to first create an intermediate layer(s) where you capture the data from each source independently using CDC/full refresh. This is important since Target table would be dependent on the upstream objects and that is why the upstream objects first should be maintained in dataform.
Later when you integrate the data from this multiple tables from this intermediate layer, you need to fetch the data as delta/full refresh.
For us the intermediate layer was much advantages such as :