Site Reliability Engineer, Distribution Engineering

NBCUniversal is seeking creative and driven Site Reliability Engineers to join our Distribution Engineering team. This team supports the infrastructure and systems that power NBCU’s broadcast, streaming, and monitoring platforms. Within Distribution Engineering, we’re hiring SRE’s across three closely integrated focus areas: Video Streaming, Monitoring & Control, and Playout. As an SRE, you will be responsible for the engineering, operations, support, deployment, and maintenance of critical systems across on-premises and cloud environments. You will work in a fast-paced, agile environment where innovation and reliability are key. Develop automation to deploy, maintain, and monitor infrastructure and applications. Troubleshoot and resolve issues in live, on-air environments. Participate in CI/CD pipelines, including code deployment, testing, and monitoring. Create and maintain system metrics, dashboards, and alerting to ensure high availability. Collaborate with engineering, operations, and vendor teams to support system health and performance. Act as a Level 2 support resource for broadcast-related incidents, including root cause analysis and documentation. Participate in on-call rotation for 24/7 support coverage. Evaluate new technologies and contribute to proof-of-concept deployments. Document system configurations, incident resolutions, and operational procedures.

Job ID
744000069923465
DetailURL
https://jobs.smartrecruiters.com/NBCUniversal3/744000069923465
Job Level
Profession
LastUpdated
Search Meta
REF30987R Operations & Technology Engineering Engineering United States Connecticut Stamford
Job Reference number
REF30987R
Multi Location
No
Is Remote Job?
No