Reliability Engineer

Mississauga, ON, Canada

Job Description


The Position Location: Mississauga or Costa Rica
Duration: Permanent
Position: Reliability Engineer

Reliability Engineering (RE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems, ensuring that Infrastructure services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to users' needs and a fast rate of improvement. Additionally RE’s keep an ever-watchful eye on our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. RE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.


  • Contribute to the maintenance of services once they are live by measuring and monitoring availability, latency and overall system health.
  • Look for continuous improvement activities both in technical, teamwork, collaboration and processes areas. Propose and contribute to continuous improvement activities.
  • Act as an analyst by transforming the customer needs into specific requirements to be implemented in components or managed services managed by the team or by other teams.
  • Remain proactive and aware of operational challenges and opportunities and work with support team staff to resolve incidents and major incidents.
  • Ensure implemented solutions and components comply with Quality/Regulatory standards, as applicable.

Job Requirements/Qualifications:
  • Demonstrated customer & delivery focus.
  • Well proven scripting and automation skills with strong knowledge in delivering and managing infrastructures code.
  • Ability to work effectively with team members and virtual teams from different locations and different cultural backgrounds.
  • Ability to function independently with low supervision and navigate ambiguity.
  • Strong problem-solving and decision-making skills.
  • Good oral and written communication skills in English. German, Spanish or Chinese (Mandarin) are significant pluses.
  • Moderate travel (20%) required and ability to work across multiple time zones, including on-call.
Education / Years of Experience:
4-7 years of relevant work experience or 2-5 years with Bachelor’s degree or 1-3 years with Masters degree. At least 1 years’ experience of working in one or more multinational work environments (e.g. healthcare industry experience is a plus) as a senior systems or software Engineer. Technology Skills:

  • Hands-on technical skills in automation, infrastructure as code, logging, monitoring and observability, infrastructure configuration, scripting languages and applications (python, Jenkins, Ansible, Terraform, CMP,AWX, Bitbucket, GitHub, ELK, Prometheus)
  • Knowledge about working with Infrastructure Systems internals, their administration and networking (Linux, Windows, Containers, .Application Hosting, AWS, Google, Azure, Nutanix, VMWare, Kubernetes)
  • Knowledge about applying design thinking, lean, prioritization and agile methodologies to evolve services offered to partners. (Scrum)
  • Knowledge about the definition of technical computing infrastructure entirely under the control of software with no operator or human intervention. (Infrastructure as Code and Configuration Management)
  • Knowledge about defining Service Level Objectives and Service Level Indicators. (SLOs, Metrics, SLI, Error Budget,
  • Knowledge about DevOps mindset, processes and tools.
  • Cross-Functional Technical Knowledge, tools/scripting/methodologies for: Configuration management, Infrastructure as Code, Automation Design, Infrastructure Development Life Cycle and hybrid Clouds.
  • Knowledge about algorithms, data structures, complexity analysis and software design.
Relocation benefits are not available for this job posting.
Who we are At Roche, more than 100,000 people across 100 countries are pushing back the frontiers of healthcare. Working together, we’ve become one of the world’s leading research-focused healthcare groups. Our success is built on innovation, curiosity and diversity. Roche Pharma Canada has its office in Mississauga, Ontario and employs over 850 employees. The Mississauga facility is bright, vibrant, fosters collaboration and teamwork, and is reflective of Roche's truly innovative culture. As of January 4, 2022, Roche requires all new employees who work in Canada to be fully vaccinated against COVID-19 on the date they take office. This requirement is a condition of employment at Roche that applies regardless of whether the position is on a Roche campus or remotely. If you have a valid reason for not being fully immunized, which is limited to certain specific medical reasons or other valid reasons protected by applicable human rights laws, you may request an exemption and / or adaptation measures regarding this vaccination requirement. Roche is an Equal Opportunity Employer.

Beware of fraud agents! do not pay money to get a job

MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD2009722
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Mississauga, ON, Canada
  • Education
    Not mentioned