Hi,
I am exploring using dataform to execute sql workflows. After testing I have noticed costs attributed to dataplex. Does dataform uses dataplex internally. Is there any option to disable that?. Thank you
best regards
Dipu Chemmala Sadanandan
Hi dipucs,
Welcome to Google Cloud Community!
Yes, the managed Dataform service on Google Cloud is automatically integrated with Dataplex to provide enhanced data governance.
The costs you are observing are almost certainly from the Data Lineage API, which is a part of the Dataplex service family. After every successful run, Dataform automatically analyzes your SQL to determine column-level dependencies and sends this information to the Data Lineage API. This populates the lineage graphs you see in the BigQuery and Dataplex UIs. While basic metadata cataloging is very cheap, the processing of these lineage events is what incurs specific Dataplex charges.
You can disable this integration and its associated costs. The most direct and recommended way is to disable the Data Lineage API for your Google Cloud project. You can do this by navigating to "APIs & Services" > "Enabled APIs & services" in the Cloud Console, finding the "Data Lineage API," and clicking "Disable."
However be mindful of the consequences of disabling:
Was this helpful? If so, please accept this answer as “Solution”. If you need additional assistance, reply here within 2 business days and I’ll be happy to help.