System Administrator, Data Science And Advanced Analytics

Toronto, ON, Canada

Job Description

:
Are you ready to be at the forefront of healthcare innovation? Since 2020, the Data Science and Advanced Analytics (DSAA) team at Unity Health Toronto (UHT) has been developing and implementing machine learning solutions - transforming complex healthcare data into actionable insights that drive better decision-making, enhance hospital efficiency, and improve patient care.
The DSAA team is looking for a skilled System Administrator to join their infrastructure team and support technologies primarily used by their data scientists, software engineers, and data engineers. As a System Administrator, you'll work with a diverse range of tools and platforms - including HashiCorp Nomad, XNAT, event streaming technologies like Kafka, Posit Teams, Prometheus, LXD, Nix, Keycloak, HashiCorp Vault, Postgres databases, data federation tools like Data Virtuality and specialized waveform platforms like Atrium DB.
On a typical day, you may be involved with the following:
Provide day to day maintenance on Linux systems, e.g., authentication services, security services, network drives etc.;
Perform accounts administration which includes creating, disabling and expiring users' accounts;
Configures bare metal, VM, and container infrastructure.
Works with the data scientists and product developers to understand their needs and requirements for implementing or improving underlying host systems.
Monitoring to ensure systems are performing, as required, checking to identify that there are no issues and/or errors on the systems in their logs;
Manage and monitor containers and container orchestration system to ensure systems are online and performing as required.
Troubleshoots problems that arise with the Linux systems, including: authentication systems, container-based services, web servers, hardware networking etc.;
Performs preventative maintenance on servers, when required;
Adopt emerging technologies in the AI/ML space to support data science;
Plan and automate deployment of hardware and software infrastructure;
Maintain active involvement in designated activities of new projects going live, e.g., testing process, etc.;
Works with other staff in the department to put together test cases and user acceptance testing, implementing results, in a timely manner;
Using and supporting version control systems (Gitlab) and infrastructure as code (e.g., with Terraform, Ansible);
Responds to downtime incidents on an on-call basis
Qualifications:
Completion of a recognized Bachelor's degree or a diploma in Computer Science, networking or related field required;
Five (5) years' experience in field required; with Linux certification, preferably GCUX, required;
In depth knowledge of Debian/Ubuntu environment required;
In depth knowledge of container and reproducibility solutions, (e.g. LXD Docker/Podman, and Kubernetes/Nomad, Nix), required;
Demonstrated flexibility and ability to adapt to change required;
Demonstrated strong analytical, organization, conceptual and decision making skills with the ability to work within a team environment required;
Familiarity with data science, product development, and deployment teams and technologies (in particular R and Python and their respective communities and ecosystems) is an asset;
High familiarity with monitoring technologies such as netdata, Promethus, Glitchtip, Grafana;
Specializations in or experience with cloud development/deployment, medical imaging, or high frequency data (waveform) infrastructure an asset;
High familiarity with version control systems (Gitlab) and critical extensions thereof (e.g., MLFlow, CI/CD) is an asset;
Demonstrated excellent verbal and written communication skills required;
Ability to handle situations involving unplanned outages required;
Well developed problem solving skills required;
Demonstrated commitment to continuous professional learning required;
Experience in an on-prem or hybrid (on-prem/cloud) corporate environment;
Experience with HPC or cloud computing is an asset
Unity Health Toronto is committed to creating an accessible and inclusive organization. We strive to provide a recruitment process that is barrier-free and in compliance with the Accessibility for Ontarians with Disabilities Act (AODA) and the Ontario Human Rights Code. We understand that you may require an accommodation at any stage of the recruitment process. When you are contacted, please inform the Talent Acquisition Specialist and we will work with you to meet your accommodation needs. We want to emphasize that all accommodation requests are handled with the utmost confidentiality, respecting your privacy and dignity.
#LI-MR1

Beware of fraud agents! do not pay money to get a job

MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD2457477
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Toronto, ON, Canada
  • Education
    Not mentioned