My problem is that my usecase requires the AI engine I use to provide
predictions with the entire duration of the action. It seems to me that
vertex AI picks a random frame in the span of the action and return it
as the same start/end values. Here's ...