Responsibilities
- Manage automation platforms in support of infrastructure rollouts across cloud providers.-
Optimize telemetry platform to identify customer impacting events while providing relevant data to drive debugging.-
Partner with the engineering team to optimize the performance of services for cloud architecture.-
Debug Live Site events and conduct follow up post mortem and root cause analysis.-
Participate in an SLA-driven on-call rotation, which will include after-hours, overnight, weekend, and rotating holiday participation. Required Skills and Experience :-
A solid devops/sysadmin skill set. That means you are hands-on with Linux and you know three ways to find a file, check utilization of system I/O, measure network throughput, and are comfortable navigating large log files.
Solid Infrastructure automation experience. Python and GoLang a plus.Knowledge of Kubernetes and the container ecosystemStrong cross-group collaboration and communication skillsFamiliar with at least one of AWS, Azure, or Google CloudExperience debugging, diagnosing and troubleshooting complex, production softwareB.S. Degree in Computer Science or related field