Cloud Infrastructure Engineer (Nvidia) - #1109044

Assurity Trusted Solutions Pte Ltd


Date: 8 hours ago
District: Singapore
Contract type: Full time
Work schedule: Full day
Assurity Trusted Solutions Pte Ltd

Assurity Trusted Solutions (ATS) is a wholly owned subsidiary of the Government Technology Agency (GovTech). As a Trusted Partner over the last decade, ATS offers a comprehensive suite of products and services ranging from infrastructure and operational services, authentication services, governance and assurance services as well as managed processes. In a dynamic digital and cyber landscape, where trust & collaboration are key, ATS continues to drive mutually beneficial business outcomes through collaboration with GovTech, government agencies and commercial partners to mitigate cyber risks and bolster security postures.

Responsibilities:

  • Design, deploy, and optimize Kubernetes clusters using the Nvidia software stack to support large language model applications.
  • Collaborate with cross-functional teams to integrate Nvidia GPU resources effectively within Kubernetes environments, ensuring optimal performance.
  • Implement and manage infrastructure as code (IaC) for Nvidia GPU configurations, focusing on scalability and high availability.
  • Monitor, troubleshoot, and resolve issues related to both Kubernetes clusters and Nvidia GPU resources to maintain a reliable and performant infrastructure.
  • Stay abreast of industry best practices and emerging technologies related to  Kubernetes and the Nvidia GPU ecosystem.
  • Work closely with development teams to automate deployment processes, leveraging Nvidia GPU capabilities, and streamline workflows.
  • Implement security best practices to safeguard Kubernetes environments, Nvidia GPU resources, and sensitive data.
  • Participate in on-call rotation and provide timely response to incidents, minimizing downtime for language model applications.
  • Contribute to capacity planning and performance tuning activities, considering the demands of large-scale language model applications utilizing Nvidia GPU acceleration.
  • Document infrastructure configurations, processes, and procedures, facilitating knowledge sharing and team member onboarding.


Requirements

  • Bachelor's degree in Computer Science, Information Technology, or a related field.
  • Proven experience in designing, implementing, and managing on-premises infrastructure solutions.
  • Strong knowledge of server virtualisation, storage systems and network infrastructure.
  • Hands-on experience with cloud-native technologies and deployment strategies.
  • Proven experience designing, deploying, and managing Kubernetes clusters such as SUSE Rancher, RedHat OpenShift
  • Strong understanding of containerization concepts such as Docker, orchestration tools like Kubernetes and Nvidia GPU acceleration technologies.
  • Proficiency in scripting, automation and configuration management using tools  such as Chef, Ansible, Terraform, or similar.
  • Familiarity with infrastructure-as-code principles and tools (e.g., Helm, Kubernetes  manifests).
  • Experience with large-scale language model applications, particularly leveraging Nvidia GPU acceleration, is highly desirable.
  • Solid knowledge of networking concepts, Kubernetes networking models, and integration with Nvidia GPU resources.
  • Excellent problem-solving and troubleshooting skills, with a proactive approach to system optimization.
  • Strong communication skills for effective collaboration in a team-oriented, agile environment.

Join us and discover a meaningful and exciting career with Assurity Trusted Solutions!

 

The remuneration package will commensurate with your qualifications and experience. Interested applicants, please click "Apply Now".

 

We thank you for your interest and please note that only shortlisted candidates will be notified.

 

By submitting your application, you agree that your personal data may be collected, used and disclosed by Assurity Trusted Solutions Pte. Ltd. (ATS), GovTech and their service providers and agents in accordance with ATS’s privacy statement which can be found at: https://www.assurity.sg/privacy.html or such other successor site.



Benefits

  • A wholly-owned subsidiary of GovTech.
  • We promote a learning culture and encourage you to grow and learn.

How to apply

To apply for this job you need to authorize on our website. If you don't have an account yet, please register.

Post a resume

Similar jobs

Care Coordinator / Patient Service Executive (UP$3000 | West | Office Hours)

MCi Career Services, Singapore
$2,400 - $3,000 / month
9 hours ago
Our Client, a leading and well-known Health Institution in Singapore is inviting qualified candidates to fill the position as Clinic Care Coordinator Candidates with or without relevant healthcare experiences but willing to learn and explore are welcome to apply! Summary...
MCi Career Services

Roadshow Promoter (Banking /GROSS $6000)

Forte Employment Services, Singapore
$2,300 - $3,000 / month
12 hours ago
Able to commit weekend Salary - $2500 to 2800 + COMM ( GROSS $ 6000) 6 working days per week Job Responsibilities Engage customers in a positive manner via roadshows/events Responsible to promote products Training provided. Any other ad-hoc tasks...
Forte Employment Services

Customer Service Agent (Flight Operations)

Jet Aviation (Asia Pacific) Pte Ltd, Singapore
13 hours ago
Your Opportunity to Shine As a Customer Service Agent , you'll be the friendly face passengers and crew rely on throughout their journey at our Jet Aviation FBO. From arrival to departure, you'll be the go-to person for service requests,...
Jet Aviation (Asia Pacific) Pte Ltd