Tech Lead, ML Infrastructure
We are seeking an experienced Tech Lead to grow out and lead a team of MLOps engineers, data scientists and annotation tooling developers who will be building the backbone of all Machine Learning applied research at the organization. On the one hand, data annotation, validation, split management, curation, and database management for a web-hosted game development platform. On the other, training orchestration, queue management, load balancing, failure recovery, observability. Key Responsibilities Infrastructure buildout: Steward the end-to-end planning, execution, and delivery of the data and training infrastructure for the organization. Cross-Functional Coordination: Act as the primary point of contact for the team that serves infrastructure to multiple other machine learning teams. Cost Management: Own the cloud spend and implement cost-tracking, resource allocation and lifecycle management. Developer Experience: Service mindset in making the day-to-day of ML engineers smooth, balancing engineering rigor with ease of use.