Come Work with Us!
At RBC, our culture is deeply supportive and rich in opportunity and reward. You will help our clients thrive and our communities prosper, empowered by a spirit of shared purpose.
Whether you\xe2\x80\x99re helping clients find new opportunities, developing new technology, or providing expert advice to internal partners, you will be doing work that matters in the world, in an environment built on teamwork, service, responsibility, diversity, and integrity.
Job Title Lead Site Reliability Engineer
What is the opportunity?
The WM Application Support, Maintenance and Transformation team is embarked in multiple initiatives to advance SRE practices and improve the resiliency and robustness of applications and processes. This role will be responsible for the ideation, implementation, administration, coordination, and support of Site Reliability Engineering (SRE) solutions for the Portfolio Management, Client Reporting, Charles River CoE, Surveillance, and Trading areas within Wealth Management.
What will you do?
- Champion Stability and Reliability across the Portfolio Management, Client Reporting, Charles River CoE, Surveillance, and Trading applications and services
- Work in conjunction with the SRE hub to develop and adopt SRE solutions (automation, monitoring, alerting, self-healing and reliability testing) and prioritize the work in alignment with the application and business roadmaps
- Explore & evaluate new technologies and drive innovation by designing/implementing new practices/processes.
- Improve and standardize the monitoring solutions all applications, to predict and prevent production issues
- Own and develop reports for SRE Metrics - gather and analyze metrics from both infrastructure and applications to assist in performance tuning and fault finding.
- Identify and establish SLIs/SLOs and error budgets for all critical business processes
- Assist in incident management and problem management for applications in scope and spearhead blameless post-mortems for the high impact incidents
- Provide guidance to other team members on managing end-to-end availability and performance of mission critical services, on building automation to prevent problem recurrence, and on building automated responses for non-exceptional service conditions
What do you need to succeed?
Must-have
- Thorough understanding of SRE principles and strong problem solving and analytical skills to triage issues
- Have a well-rounded understanding of Windows and Linux operating systems (including command line, firewalls, certificates, etc), Cloud technologies (Openshift/Azure, containerized apps) and experience working with APIs (REST and/ or SOAP endpoints).
- Ability to quickly pick up new tools, programming languages, libraries, frameworks, and other technical concepts as needed.
- Hands-on experience in a variety of Industry standard SRE tools (Ansible, Dynatrace, Moogsoft, PagerDuty, ServiceNow, Slack, Elastic Stack)
- Excellent written and verbal communication skills: ability to deal with key partners across the organization: Business, Operations, Application Development, Maintenance, and Infrastructure Teams
Nice-to-have
- Strong development background in at least couple of programming/scripting languages (Preferably Python, JavaScript, Java, Shell scripting, PowerShell)
- Exposure to Docker, Kubernetes, GitHub, NexusRepo & IQ, DevSecOps, IBM Urbancode Deploy, Jenkins, Jira, Confluence, Jira Service Desk, Databases.
- Experience in Vendor Management, application development, database, system engineering and/or systems analysis
- Understanding of banking/financial services industry
What\xe2\x80\x99s in it for you?
We thrive on the challenge to be our best, progressive thinking to keep growing, and working together to deliver trusted advice to help our clients thrive and communities prosper. We care about each other, reaching our potential, making a difference to our communities, and achieving success that is mutual.
MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.