Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

Cloud composer data size

Hi,

Is there a way to check the total amount of data that is being loaded in GCP big query via cloud composer (Airflow environment) on a daily basis

0 1 106
1 REPLY 1

Hi @pavan26,

Welcome to the Google Cloud Community!

Here are some possible solutions for monitoring data ingestion in BigQuery:

  • INFORMATION_SCHEMA.JOBS: Use INFORMATION_SCHEMA.JOBS to find the total bytes processed for a specific job and get insights into your BigQuery data loading activities. You can query job_type, total_bytes_processed, and select additional columns according to your requirements, following the provided schema.
  • Cloud Monitoring: Leverage Cloud Monitoring to track key BigQuery metrics, including the amount of data and the number of rows uploaded to a specific table. Monitoring these metrics can serve as a proxy for tracking data usage during load jobs.

For more details, refer to the BigQuery monitoring documentation.

I hope the above information is helpful.