Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

Cases where datastream overwrites a file twice causing double triggering of cloud function

RC1
Bronze 4
Bronze 4

Hi folks , I have a usecase where the files ingested by datastream is used to trigger(object finalize)  cloud function for further processing. I have seen cases where datastream writes a file (with same name) twice (to GCS) causing double triggering of cloud function. Does datastream have such issues ?

Solved Solved
0 1 204
1 ACCEPTED SOLUTION

Hi, are you sure these are duplicate files with the same content? Datastream will, by design, write multiple files into the same folder in Cloud Storage if either (a) the file exceeds the limit set in the Datastream configuration (size and/or time), or (b) if Datastream detects a change in the schema (each file only contains events that correspond the same schema).

View solution in original post

1 REPLY 1

Hi, are you sure these are duplicate files with the same content? Datastream will, by design, write multiple files into the same folder in Cloud Storage if either (a) the file exceeds the limit set in the Datastream configuration (size and/or time), or (b) if Datastream detects a change in the schema (each file only contains events that correspond the same schema).

Top Solution Authors