Site Reliability Specialist

Ottawa, ON, Canada

Job Description


If you are talented and experienced as a site reliability specialist, Aplin has the right opportunity for you! Our Ottawa-based client is seeking a site reliability specialist. This is a 2-year contract with high potential for extension. This is a hybrid opportunity available to candidates in the Ottawa area.

Advantages and advantages:

  • Competitive Salary
  • high-energy, team-focused environment
Responsibilities:
  • Monitor log feeds, flows, and alerts for data interruptions and server health.
  • Perform the administration of monitoring platforms and related infrastructure, working with IT specialists in cyber, network, storage, security, virtual infrastructure, platform, and database to deliver their logging requirements.
  • Work with employees to refine monitoring dashboards.
  • Monitor the performance of systems and service agreements to ensure that ongoing service delivery standards, service levels, and IT policies are met.
  • Examine and improve current logs, and when possible, look into opportunities for automation or fine tuning.
  • Make and maintain management packs and scripts.
  • Support the migration of data sources and log traffic.
  • other related activities and deliverables as required.
Qualifications:
  • A university degree or college diploma in computer science, networking, engineering, or a related field
  • A minimum of five (5) years of relevant work experience with log management and monitoring
  • must be able to attain a secret security clearance.
  • Splunk and Syslog-NG experience is required.
  • knowledge of SCOM (2012 R2, 2016, and 1807) and vendor management packs.
  • demonstrated experience with SCOM report customization and SQL Report Server administration.
  • experience with SolarWinds.
  • Experience with basic admin-level Windows or Linux basic operations around file systems
  • experience with PowerShell scripting.
  • Knowledge of load testing, monitoring, and performance management tools for every layer of the environment
  • demonstrated experience using scripting languages such as Bash or Python, specifically for system automation.
  • Demonstrated experience in log management, service, network, and application monitoring is an asset.
  • Experience with testing high availability environments and performing disaster recovery tests is a plus.
  • Demonstrated knowledge and expertise in site reliability engineering, creating SLOs and SLAs, enhancing observability, and incident management work is an asset.
  • Demonstrated experience with infrastructure scripting and automation or familiarity with infrastructure as code is an asset.
Aplin, one of Canada\'s Best Managed Companies, is an employment agency that finds top talent for exceptional organizations across North America. There are no fees to apply to our jobs or engage with our recruiters to find a new career. Companies hire us to help them grow their teams. Visit our website to learn more:

David Aplin Group

Beware of fraud agents! do not pay money to get a job

MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD2111690
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Ottawa, ON, Canada
  • Education
    Not mentioned