Monitoring Tools Analyst

About the role: Are you passionate about digital media, entertainment, and software services? Do you like big challenges and working within a highly motivated team environment?  Keen with respect to Observability and Reliability? Candidates for the Monitoring Tools Analyst role in our AIOps group will be responsible for the operation, continuous service improvements and data quality within the monitoring tools landscape. Other key responsibilities include: Responding to and administration of events and alerts from all toolsets Testing and management of monitoring software agents Synthetic test creation, modification, and reporting Trend analysis of alerts/events within toolsets and aggregation platform Working with incident management, problem investigation and application support teams when required Creation & maintenance of groups, dashboards and reports within monitoring toolsets Requirements: Technology Expertise and Ownership Expertise/operational support experience with monitoring toolsets such as OP5/Nagios, Datadog, or related monitoring toolsets for infrastructure and synthetic test monitoring Expertise with Agile DevOps methodologies, process & associated software (ServiceNow Agile, Jira) Expertise and troubleshooting skills for large-scale distributed computing systems and software Experience with automation, CI/CD pipeline & software testing (Ansible, Terraform, Puppet, Chef, and Jenkins) Working knowledge of OS management and features (Windows, Linux distributions) Knowledge of public cloud service offerings (AWS, Azure, Google) Familiarity with network technology concepts (TCP/IP, UDP, IPV4, IPV6, DNS, SSL, Firewalls, F5 LTM) Familiarity with general cybersecurity best practices, close collaboration with Cyber Security team Collaboration and Technical Communication Requirements Excellent written and verbal communication and presentation skills for both technical and non-technical audiences Ability to collaborate enthusiastically with DevOps teams, customers, and peers across our organization Ability to identify patterns and trends with monitoring and alerting Demonstrate a tolerance for stress and provide a supportive attitude for all colleagues

Job ID
744000012946845
DetailURL
https://jobs.smartrecruiters.com/NBCUniversal3/744000012946845
Job Level
Job Location
LastUpdated
Search Meta
51576428 Operations & Technology Engineering Information Technology United Kingdom All London
Job Reference number
51576428
Multi Location
No
Is Remote Job?
No