I'm primarily working with video-intelligence on a dataset of videos for
some academic research. I'm a complete novice, but I've started to get
the hang of things, and I'm getting some really useful outputs from
speech to text, text recognition, and ...