Technology Operations Readiness Lead

Toronto, ON, Canada

Job Description

Title: Devops Technology Operations Readiness Lead
Location: Remote/ On-site options available
Duration: 4 Months
Introduction
The successful candidate will play a pivotal role in ensuring seamless go-lives and stable operations for mission-critical systems. This role bridges engineering, operations, and business stakeholders to drive production readiness, operational excellence, and long-term system reliability. The ideal candidate brings strong technical expertise across SRE, DevOps, or technical operations, coupled with proven experience in preparing high-stakes systems for production success.
Required Skills & Qualifications

  • 5 years of experience in SRE, DevOps, or technical operations roles supporting production systems
  • Proven experience leading operational readiness and go-live efforts for critical, real-time platforms
  • Strong understanding of production observability, CI/CD workflows, and infrastructure management practices
  • Demonstrated success driving cross-functional coordination across technology, product, and operations teams
  • Hands-on familiarity with service-level objectives (SLOs), incident management frameworks, and chaos engineering methodologies
  • Experience operating in hybrid cloud or cloud-native environments (AWS, GCP, Azure, etc.)
  • Exceptional communication, stakeholder management, and documentation skills
Preferred Skills & Qualifications
  • Experience in regulated or high-availability industries (financial services, telecommunications, e-commerce)
  • Working knowledge of Kubernetes, Terraform, or observability stacks (Datadog, Prometheus, Splunk, etc.)
  • Certification in AWS, GCP, or DevOps/SRE disciplines
Day-to-Day Responsibilities
  • Lead go-live readiness and cutover planning for large-scale, real-time systems to ensure smooth production transitions
  • Coordinate across engineering, product, and operations teams to define, validate, and execute operational readiness criteria
  • Oversee end-to-end operations readiness programs, including incident management preparedness, service monitoring setup, and support documentation
  • Collaborate with DevOps and SRE teams to ensure alignment of CI/CD pipelines, infrastructure-as-code, and observability tooling with operational standards
  • Validate that monitoring, alerting, and SLOs are in place before production releases
  • Facilitate operational risk assessments, simulate failure scenarios (chaos testing), and ensure rollback/recovery procedures are tested and documented
  • Partner with platform engineering and cloud infrastructure teams to maintain hybrid or cloud-native environments with high availability and resilience
  • Champion a culture of continuous improvement, post-mortem learning, and proactive reliability engineering
Company Benefits & Culture
  • Inclusive and diverse work environment
  • Opportunities for professional growth and development
  • Comprehensive health and wellness benefits
For immediate consideration please click APPLY to begin the screening process with Alex.

Skills Required

Beware of fraud agents! do not pay money to get a job

MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD2920719
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Toronto, ON, Canada
  • Education
    Not mentioned