About the Role
As a Site Reliability Engineer on TD SYNNEX\'s Innovation Site Operations team, you will be part of a group that is intensely focused on our customers\' experience and the health of our StreamOne eCommerce solutions. Site Reliability Engineers will influence how code is deployed, configured, and monitored and the availability, latency, change management, emergency response, and capacity management of services in production.
What You\'ll Do:
Lead automation efforts for alert and issue remediation leveraging runbooks, tools, and documentation to help prepare on-call teams for future incidents.
Constant upkeep of documentation and runbooks to ensure the Site Operations team can maintain or exceed the defined IT KPI\'s.
Design, Implement and manage monitoring capabilities using APM, Alert, and Log Management solutions (PagerDuty, Dynatrace, Application Insights, and Graylog).
What We\'re Looking For:
Have extensive experience automating solutions to identified issues/bugs/anomalies. You have a passion for replacing manual processes with efficient and concise automated solutions.
You have been responsible for running critical services that multiple customers depend upon. You understand the importance and impact that operational optimization can have on a product and the positive ripple effects that it can have across an entire organization.
Knowledge of Splunk, Graylog, Dynatrace, Nagios, Application Insights or equivalent monitoring tools.
What\xe2\x80\x99s In It For You?
MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.