This website uses Cookies. Click Accept to agree to our website's cookie use as described in our Privacy Policy. Click Preferences to customize your cookie settings.
Hi. I plan to utilize a multimodal vision-language transformer that only
(reasonably) supports English. The image translation feature in the web
version of Google almost perfectly suits my needs. Is this available for
use via an API? Thanks!
Not really. This was not on top of my priority list so I didn’t end up
spending more time on it. However, I think it is definitely feasible to
implement a solution.If you are interested in custom implementation,
please reach out! I have some ideas on...
This is cool! I was able to learn about the general gist of the API and
it looks good. Do you have any disclosable technical documentation on
how it works under the hood?
Hi @kvandres, thank you for the answer. However, my initial question is
left unanswered; I understand that training a custom model (using AutoML
or Vertex) is possible, but I want to know whether the end-to-end,
image-to-image translation feature cur...