Toyota Tsusho Systems Europe

HPC Cluster Engineer

Full-Time in Zaventem, BR - Mid Level - Customer Service - $0.00 - $0.00

HPC Cluster management:

• Administration of HPC cluster for Computer Aided Engineering (CAE) and Render Cluster
• Maintenance of in-house shell scripts
• Failed computation investigation, problem determination, incident resolution, system support, coordination with vendor
• L1/L2 support on the HPC cluster for the customer
• Maintain application running on the cluster
• Manage network aspects (DNS, DHCP, internet access, …) with Network Team
• Perform daily monitoring, and ensure cluster high availability
• Manage patching and upgrade of the managed environment
• Monitor regular backup and ensure cluster high availability
• Create long term environment management centralization
• Collaborate with other technical team when required

Provide support when necessary for the customer’s project:

HPC Cluster migration to AWS Cloud

Support the customer when needed on the following (out of maintenance scope):

  • · Maintain other servers as
    • o ECU compiler server
  • o Terrace server
  • · Data synchronization support
  • · Maintain other servers as

Support the customer when needed on the following (out of maintenance scope):

  • o ECU compiler server
  • o Terrace server
  • · Data synchronization support
  • Manage patching of Linux systems, including offline systems
  • Installation and configuration of hardware, OS and software + tuning for all R&D Linux workstations
  • Support artificial intelligence engineers to setup development environment on GPU HPC
  • Support setup of a driving simulator based on real time OS
  • Ensure Linux environment match company security standards