Sre – Observability Engineer

Montréal, QC, CA, Canada

Job Description

Job Summary


We are seeking a skilled Site Reliability Engineer (SRE) to join our dynamic team. The ideal candidate will be responsible for ensuring the reliability, availability, and performance of our systems and services. This role requires a proactive approach to problem-solving and a strong understanding of both software development and IT operations. As an SRE, you will work closely with development teams to implement best practices for system reliability and scalability.

Responsibilities



Design, implement, and maintain scalable systems and infrastructure to support our applications. Monitor system performance, troubleshoot issues, and optimize resource utilization. Collaborate with development teams to enhance application performance and reliability through automation and continuous integration/continuous deployment (CI/CD) practices. Develop and maintain documentation for system architecture, processes, and procedures. Participate in on-call rotations to provide support for production systems. Implement monitoring solutions to ensure high availability of services and respond to incidents effectively. Conduct post-mortem analyses on incidents to identify root causes and implement preventive measures.

Requirements



Bachelor's degree in Computer Science, Engineering, or a related field. Proven experience as a Site Reliability Engineer or similar role in IT operations or software engineering. Strong knowledge of cloud platforms (e.g., AWS, Azure, Google Cloud) and container orchestration tools (e.g., Kubernetes). Proficiency in scripting languages such as Python, Bash, or Ruby for automation tasks. Familiarity with configuration management tools (e.g., Ansible, Puppet, Chef). Experience with monitoring tools (e.g., Prometheus, Grafana) for system health checks. Excellent problem-solving skills with a focus on improving system reliability and performance. Strong communication skills with the ability to work collaboratively in a team environment. Join us as we strive to enhance our systems' reliability while fostering a culture of innovation and excellence!
Job Type: Full-time

Pay: $100,000.00-$105,000.00 per year

Benefits:

Dental care Extended health care Life insurance Paid time off RRSP match
Work Location: In person

Beware of fraud agents! do not pay money to get a job

MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD2749067
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Montréal, QC, CA, Canada
  • Education
    Not mentioned