Data Centre Operations Engineer - #1130441
Singapore Telecommunications
As an DC GPUaaS Operation Engineer for SingTel’s GPU-as-a-Service (GPUaaS), you will help in implementing processes and integration of operations to advance customer’s AI and HPC capabilities. You will be exposed to both physical Data Centre implementation, operation and Data Centre software solutions in SingTel’s GPU-as-a-Service (GPUaaS). This position requires a forward-thinking individual who thrives in dynamic environments and is committed to driving continuous improvement in GPU Data Centre technologies for AI and HPC environments. This is an excellent opportunity for individuals eager to begin their career in state-of-the-art Data Centre technology and develop expertise in AI and HPC Data Centre platforms.
Responsibilities
Data Centre Operations Management
Ensure incidents are responded and attended to, or escalated for resolution based on criticality, impact and SLA.
Hands on operations on air & liquid cooling systems and electrical systems in a data centre environment.
Active participation in continuous improvement of operations and processes with understanding of GPU oriented data centre requirements.
Coordinate and obtain security clearances for visitors and vendors to enter Singtel’s GPU-as-a-Service (GPUaaS) data centre.
Manage and ensure vendors adhere to WSH and House regulations throughout their duration of work in the data centre.
This role may require availability outside standard work hours, including nights, weekends and public holidays.
Data Centre Facilities Management
Perform monitoring of data centre facilities and infrastructure regular function in both upstream and downstream (Power, Cooling, Leakage Detection, Environmental Control, etc)
Update and maintain all DC related documentation, including reports generation when required
Coordinate with various stakeholders to resolve technical and processes issues in SingTel’s GPU-as-a-Service (GPUaaS) data centre.
Ensure adherence to all standard operating procedures (SOP), method of procedures (MOP),
and emergency response procedures (ERP) established for critical GPU-as-a-Service (GPUaaS) operations.
Utilize expertise in power and cooling requirements for both air & liquid-cooled servers to support ongoing operational enhancements and strategic planning.
Apply understanding of latest data centre technologies to coordinate maintenance and shutdowns with stakeholders and vendors, ensuring optimal system reliability.
Preparation of monthly Facilities Management report on data centre health status
Identify potential risks relating to occupational and health within data centre
Visual inspection of servers and cooling distribution units in data centre.
Perform troubleshooting for servers together with remote engineering team.
Requirements
Diploma in Mechanical/Electrical Engineering/Build Services or a related discipline
Broad understanding of data centre infrastructure electrical and mechanical systems, fire safety and protection, building management systems (BMS), equipment maintenance, space planning, and the development of mission-critical facilities.
Experienced in the maintenance and upkeep of various data centre equipment, with a focus on electrical and mechanical systems.
Team-oriented, while demonstrating ability to function effectively independently.
Organized and adaptive to changes in work schedules and arrangements.
An enthusiastic and inquisitive mindset, with a strong willingness to learn and acquire new skills in state-of-the-art, GPU-oriented data centre technology.
How to apply
To apply for this job you need to authorize on our website. If you don't have an account yet, please register.
Post a resumeSimilar jobs
Occupational Therapist (Community/ Islandwide) (ID: 685080)
Sales/BD Executive (OEMs/EMS/CEMs)
Maintenance Technician (Mechanical) / 5 days / Jurong Island / $3.5K-$4K++