Site Reliability Engineer

Calgary, AB, CA, Canada

Job Description

Position Summary:





Reporting to the Manager, Information Technology, this position plays a significant role at the intersection of Stream's Development and Operations teams by applying software engineering principles to solve complex problems and automate away toil for our proprietary simulation engine.


As Site Reliability Engineer, you will help us build and maintain the next generation of our infrastructure and ensure system reliability by applying best practices to achieve rapid, reliable and scalable SaaS delivery by means of applying strong infrastructure as code principles.

What you will be doing on a typical day:



Design, implement, and maintain resilient and scalable systems with a focus on high availability, performance, and low latency Develop and maintain CI/CD pipelines using

Jenkins

, and automate infrastructure and configuration management with

Terraform and Ansible

Provision, configure, and maintain our infrastructure on

Rocky Linux

. Manage and orchestrate containerized workloads and services using

Rancher and Helm

Implement and manage robust secrets management solutions using

Vault

and secure authentication and authorization with

Keycloak

Ensure the reliability and performance of our distributed messaging and event streaming systems with

Pulsar

Participate in an on-call rotation to respond to incidents, troubleshoot complex issues in production environments, and perform root cause analysis Work closely with development teams to define Service Level Objectives (SLOs) and Service Level Indicators (SLIs) and conduct post-incident reviews to drive proactive improvements

Your strengths include...



Self-starter; you can be given a task and come back with results 8+ years of proven experience as an SRE, DevOps Engineer, or a similar role Proficiency in at least one scripting or programming language (e.g., Python, Go, Bash, PHP) Excellent problem-solving skills, a proactive attitude, and the ability to work under pressure Excellent communication skills, including the ability to translate SRE "lingo" into clear messaging for non-technical folk

We need you to check these boxes for you to be successful:



Deep expertise in Linux systems, specifically Red Hat/Rocky Linux Strong experience with IaC using Terraform and configuration management with Ansible Hands-on experience with container orchestration and management tools like Rancher and Helm Solid understanding of CI/CD principles and experience building pipelines with Jenkins Familiarity with distributed systems Experience with a messaging platform like Pulsar Experience with secrets and identity management tools such as Vault and Keycloak

It would be even better if you had experience with:



Chaos engineering and performance testing Above-average networking and security mastery Relevant certifications in cloud or SRE practices

Company Overview




Stream Systems (www.streamsystems.ca) is a leading simulation software company that empowers businesses to make smarter, quicker and more efficient decisions. Our SimOpti intelligence platform simplifies the process for companies to quickly pinpoint and tackle optimization and decision-making challenges by leveraging Machine Learning, Deep Reinforcement Learning and AI to drive future growth. We empower our customers with cutting-edge dynamic simulation tools that facilitate rapid, informed decision-making, laying the groundwork for strategic planning throughout the entire value chain of your operation.

Hybrid Work Environment




This is a full-time position based on a 40-hour work week. Stream's head office is in Calgary, Alberta with remote workers located across the country. We are happy to provide a hybrid work environment, enabling you to work from home most of the week or in a virtual location of your choosing.

This role does require you to live in Calgary in order to participate in an on-call rotation, however.

We encourage a strong collaborative culture and provide workspaces in a variety of locations for team collaboration, design, planning sessions and social activities as available.

Benefits




Company benefits are available to our full-time, permanent employees and include extended health care, dental, long-term disability, AD&D, and life insurance for you and your dependents. The real benefits, in our opinion exist in the ability to become part of a flexible, passionate, and dedicated team where we continue to learn from one another every day.

Background Screening/Intellectual Property




As a business intelligence and optimization company, we work with a large amount of customer Intellectual Property. Therefore, successful candidates will be required to complete a background check including employment references, education verification and criminal record check.

Come do great things with us!

Beware of fraud agents! do not pay money to get a job

MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD3092083
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Calgary, AB, CA, Canada
  • Education
    Not mentioned