System Administrator High Performance Computing

Montréal, QC, CA, Canada

Job Description

#

General Information



Req #

WD00091520

Career Area:

Services

Country/Region:

Canada

State:

Quebec

City:

Montreal

Date:

Tuesday, December 16, 2025

Working Time:

Full-time

Additional Locations

:
Canada - British Columbia (Mobile) - Prince George Canada - Quebec - Montreal #

Why Work at Lenovo



We are Lenovo. We do what we say. We own what we do. We WOW our customers.

Lenovo is a US$69 billion revenue global technology powerhouse, ranked #196 in the Fortune Global 500, and serving millions of customers every day in 180 markets. Focused on a bold vision to deliver Smarter Technology for All, Lenovo has built on its success as the world's largest PC company with a full-stack portfolio of AI-enabled, AI-ready, and AI-optimized devices (PCs, workstations, smartphones, tablets), infrastructure (server, storage, edge, high performance computing and software defined infrastructure), software, solutions, and services. Lenovo's continued investment in world-changing innovation is building a more equitable, trustworthy, and smarter future for everyone, everywhere. Lenovo is listed on the Hong Kong stock exchange under Lenovo Group Limited (HKSE: 992) (ADR: LNVGY).



This transformation together with Lenovo's world-changing innovation is building a more inclusive, trustworthy, and smarter future for everyone, everywhere. To find out more visit

www.lenovo.com

, and read about the latest news via our

StoryHub

.
#

Description and Requirements



This role will be onsite in Prince George, British Columbia



We are currently hiring for an High Performance Computing System Administrator in British Columbia, Canada, to work on-site support to our customers.


As part of the Lenovo Professional Services team your responsibilities will include:

Monitoring, maintaining, and managing the physical infrastructure of a data center, ensuring its smooth operation, reliability, and security. Monitoring power and cooling systems and network connectivity Hardware and system software debugging and troubleshooting. OS Management Addressing hardware and software issues Responding to alerts, performing preventative maintenance, rolling out and upgrading firmware versions, and managing any issues that may arise to minimize downtime and optimize data availability Opening hardware trouble tickets against different vendors. Following up and reporting the progress on all issues. Respond to users and provide support to them on the daily operations of the cluster. Daily system administration tasks, including granting, deleting access, Investigate and correct defects in the cluster as reported, adhering to the service levels. Resolve errors through developing, testing and implementing changes to the system. Provide corrective and preventive maintenance, troubleshoot and isolate defects. Perform Software and firmware testing for any fixes, upgrades, security patch.

Working directly with the customer you will be responsible for:



The installation, configuration, and the support of services as required within the central customer Research Computing Services platform team. Working with vendors and customer Technology Office to design, implement and upgrade services using change management and revision Controlling processes to ensure that changes are properly tracked and available for audit when required. Analyzing and troubleshooting system issues, defining, and resolving complex issues. Developing innovative solutions to continuously improve HPC and address any shortfalls in provision. Working closely with other customer staff, including Infrastructure Technology, Security and Governance teams. Understanding the importance of security and seek specialist security advice to secure systems. Delivering a high-quality service through a collaborative approach and outstanding analytical skills.


The role gives you a great deal of independence and opportunity to take the lead and advise. You will be expected to work effectively in providing remote technical services in the areas of HPC & AI platforms and solutions. Also, you will be responsible for implementing and supporting HPC solutions at customer sites, involving Server, Storage, Network, Power and Cooling, OS, and cluster management software.

Basic Qualifications



3+ years of experience experience in Linux (ie Suse, RHEL & CentOS) and system administration

Preferred Qualifications:

Experience in HPC system troubleshooting, monitoring, and support Experience in System administration Able to perform OS installation and upgrades with no supervision Able to perform high level problem determination Customer service skills including written and oral communication with client


Additional Locations

:
Canada - British Columbia (Mobile) - Prince George * Canada - Quebec - Montreal

Beware of fraud agents! do not pay money to get a job

MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD3308773
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Montréal, QC, CA, Canada
  • Education
    Not mentioned