I have video files on my local disk for which I would like to create multimodal embeddings using Vertex API. I am finding that the largest MP4 file I can send is about 240 seconds. I thought that the documentation said if you specified a start and end time you could handle video files of arbitrary length. Am I doing something obviously wrong or do arbitrary length files only work if they are located on Google Cloud storage?
Hi @picheny,
Welcome to Google Cloud Community!
You're right! Vertex AI documentation confirms that specifying start and end times allows management of video files of any length, whether stored locally or in Google Cloud Storage. If you're encountering a 240-second limit with local files, potential factors include HTTP request size limits, client library constraints on file size, and network stability issues causing upload timeouts.
Here are some troubleshooting steps:
requests.post
with stream=True
in Python.For understanding Vertex AI's video processing capabilities, you can explore these resources:
I hope the above information is helpful.
I am using the "get_embeddings" api inside of the vertex API. You pass it a "video" object. If the object is on google storage everything works, but if the file is remote, it fails if the file is too large. I don't see how to apply solution 1, file transmission is buried somewhere in the API.
Hi @picheny,
You're right that the "get_embeddings" API in Vertex AI doesn't support streaming video data from local files. This makes it difficult to handle large videos stored locally because the API requires all video data to be provided at once, without the ability to handle smaller segments or streams. This limitation is likely due to the API's internal processing of video data as a single entity, which would need significant changes to support streaming video segments effectively.
Using Cloud Storage is ideal, Uploading your videos to Google Cloud Storage is highly recommended. It offers scalability, reliability, and seamless integration with Vertex AI's "get_embeddings" API.
Here are some approaches to consider:
I hope the above information is helpful.
User | Count |
---|---|
2 | |
1 | |
1 | |
1 | |
1 |