Operations Manager, GPU - #1130455

Singapore Telecommunications


Date: 12 hours ago
District: Singapore
Salary: $9,400 - $14,000 / month
Contract type: Full time
Work schedule: Full day
Singapore Telecommunications

Operations Manager, GPU Operations is responsible for leading the day-to-day operations of Singtel’s GPU-as-a-Service (GPUaaS).

 In this role, you will lead the operations team to ensure the highest levels of system uptime, availability, security, and performance, and to deliver a stable and reliable GPUaaS service that meets defined service level objectives (SLA/SLO).

 You will also act as the primary point of contact for the GPU infrastructure engineering team, working closely with them to implement platform upgrades, observability enhancements, security features, and continuous operational improvements across the GPUaaS platform.

 Responsibilities

  • Overall coordinator and primary point of contact for end-to-end GPU-as-a-Service (GPUaaS) operations, including data centre operations, reporting in accordance with the established organisational reporting structure

  • Lead day-to-day operations of GPU-as-a-Service and data centre operations, including hardware, environmental controls, networking, security and software.

  • Technical manager managing operations team, vendors and consultants to administer GPU-as-a-Service (GPUaaS) operations during regular operations and in emergency situations.

  • Coordinate with internal teams, vendors, and consultants for the operation and implementation of GPU-as-a-Service (GPUaaS) enhancements and related data centre initiatives.

  • Implement, validate, and continuously improve plans to ensure the highest levels of operational stability for  GPU-as-a-Service (GPUaaS), data centre operations, including scenarios involving GPU cluster hardware, software, and related equipment, as well as data centre infrastructure such as power or cooling outages.

  • Lead the resolution of incidents impacting GPU-as-a-Service (GPUaaS) environments, including GPU cluster hardware, software, and related equipment, as well as data centre infrastructure such as power or cooling outages; perform root cause analysis (RCA) as required and ensure findings are reported to customers and internal stakeholders within an appropriate and timely manner.

  • Present GPU-as-a-Service (GPUaaS) operational status and plans, including data centre operations, to senior management and relevant stakeholders.

  • Ensure incidents are responded and attended to, or escalated for resolution based on criticality, impact and SLA.

  • Build and lead a high-performing operations team to foster a culture of innovation, collaboration, and continuous improvement.

  • Set clear goals and objectives, mentor team members, and drive professional development initiatives.

  • Lead security incident management processes, focusing on identification, containment, and resolution of threats.

  • Enforce best practices for security and compliance within the GPU-as-a-Service (GPUaaS) environment.

  • Stay abreast of industry security trends and implement measures to safeguard customer data and platform integrity.

  • This role may require availability outside standard work hours, including nights, weekends and public holidays.

 Requirements

  • Bachelor’s degree in Computer Science, Information Technology, or related field.

  • Minimum of 8 years in data centre operations and management with at least 3 years in a leadership/managerial position.

  • Knowledge and experience in data centre infrastructure, including servers, networking, storage, physical and cybersecurity. Bonus for knowledge and experience in GPU cluster.

  • Well versed in various equipment maintenance and upkeep, including electrical and mechanical.

  • Experience in leadership/managerial roles with excellent team management skills.

  • Organized and adaptive to changes in work schedules and arrangements.

  • Strong interpersonal and professional communications skills, as well as presentation skills.

  • Proficiency in managing customer interactions and improving service delivery to enhance customer experience.


How to apply

To apply for this job you need to authorize on our website. If you don't have an account yet, please register.

Post a resume

Similar jobs

Senior Technician

Meggitt Aerospace Asia Pacific Pte Ltd, Singapore
1 hour ago
The Services and Support Operations (SSO) is a global leader of integrated MRO services and support. We are renowned for our exceptional customer service and expertise at every stage of the aircraft lifecycle, providing innovative OEM-quality service solutions across a...

Service Delivery Consultant ((Cryptography)

Talent Trader Group, Singapore
$4,500 - $5,500 / month
2 hours ago
Responsibilities: Provide technical expertise and guidance to clients on the design, implementation, and maintenance of cryptographic systems Conduct risk assessments, vulnerability analyses, and compliance reviews for client cryptographic infrastructure Develop and deliver customised training programmes to educate clients on best...
Talent Trader Group

Ocean Freight Coordinator (Ref: GMT)

Celecti Pte. Ltd., Singapore
$2,300 - $2,800 / month
3 hours ago
Location: Toh Guan 5-Day Work Week A global logistics company is hiring an Ocean Freight Coordinator to manage export shipments, documentation, and customer coordination. What You’ll Be Doing: Prepare and check export/import documents (e.g., HB/L, MBL, manifest, pre-alerts, customs declaration,...
Celecti Pte. Ltd.