I am developing a Dialogflow CX app connected to a website datastore.
I have the app in two different projects (one for testing and one for production) and I have the same configuration for both (i.e. sites to include and sites to exclude).
Both have not been automatically refreshed in the last 14 days, the second was created more recently and has more content than the first.
Website Datastore Test:
The Datastore Size continues to go down (now 1.13 gb) and when I ran manual refresh curl command it had no effect on the index.
The discrepancy between your test and production Dialogflow CX website datastores, despite seemingly identical configurations, highlights a potential issue with the website data itself or the indexing process. While both show no recent automatic refreshes, the production datastore's significantly larger size (4.99 GB vs 1.24 GB) strongly suggests different data being indexed. Let's address your concerns:
1. Why are the Datastores Different?
The most likely reasons for the difference are:
2. Ensuring Consistent Re-indexing:
To ensure both datastores stay consistent, follow these steps:
3. Monitoring and Management:
The indexing process isn't instant. Be patient and give it sufficient time to complete. If issues persist after following these steps, contact Google Cloud Support for assistance. They can investigate potential problems with the indexing process itself.
I hope the above information is helpful.
Ruth, thank you for your response.
1. I ensured that the configuration of sites was identical
2. I had to create a new datastore in order to avoid production issues
3. There is an issue of the datastore reducing in size, this behavior continues (the 4.99 gb is now 2.84 gb in size and is not shown on the activity graph. There is a cost to reindexing my website weekly which I did not expect when starting my project on Dialogflow.
4. I will look into the sitemap, thank you.
5. Another Critical Concern is that I tried the curl command for manual refresh, along with the manual recrawl uris feature in the console and neither worked (and the operations were "successful"). Documents were not added to the index and nothing showed up on the activity tab. This is extremely worrying.
User | Count |
---|---|
2 | |
1 | |
1 | |
1 | |
1 |