I am developing a Dialogflow CX app connected to a website datastore.
I have the app in two different projects (one for testing and one for production) and I have the same configuration for both (i.e. sites to include and sites to exclude).
Both have not been automatically refreshed in the last 14 days, the second was created more recently and has more content than the first.
Website Datastore Test:
Documents usage / project quota limit 15440 / 200000
Data size 1.24 GB
Website Datastore Prod: Documents usage / project quota limit 15440 / 200000
Data size 4.99 GB

I need to know that they are at least close to the same, I can re-index the test datastore now but moving forward how do I know that they will re-index the same way and have the same website data? How can I manage and monitor this?
I have Advanced Website indexing turned on.
Thank you in advance