Open Roles
We’re currently hiring for Generative Speech (LLM-based TTS). Other roles are planned and shown below for visibility.
Generative Speech — LLM-Based TTS
Research Engineer / Applied Scientist
Build controllable, identity-preserving speech synthesis for real-world video localization.
No formal application — just send what you have.
Optional: LinkedIn, CV, GitHub/paper link, or short intro.
Video Text Spotting — Detection, Tracking & Recognition
Applied Scientist / CV Engineer
Detect and track on-screen text in videos, then recognize it reliably across frames.
Optional: send a short intro + relevant work.
Video Motion Typography Translation — Layout-Aware Editing
Applied Scientist / Research Engineer
Translate animated on-screen typography while preserving motion, timing, and design.
Optional: send a short intro + relevant work.
Agentic LLM — Workflow Automation for Localization
ML Engineer / Applied Scientist
Build tool-using agents that orchestrate translation → terminology → QA → synthesis reliably at scale.
Optional: send a short intro + relevant work.