Full-Time System Administrator
Job Description
SYSTEM ADMINISTRATOR
The future is what we make it. When you join Honeywell, you become a member of our global team of thinkers, innovators, dreamers and doers who make the things that make the future (#Futureshapers). That means changing the way we fly, fueling jets in an eco-friendly way, keeping buildings smart and safe and even making it possible to breathe on Mars. Working at Honeywell isn’t just about developing cool things. That’s why all of our employees enjoy access to dynamic career opportunities across different fields and industries. Are you ready to help us make the future?
The Enterprise Datacenter and Networks Organization designs, implements and operates a state-of-the-art Information Systems Infrastructure serving around 90,000 employees, in over 600 locations across 65 plus countries. We work with the latest and emerging technologies to deliver scalable, high performing, always available infrastructure services around the world. We deliver services that connect, host, virtualize, store, collaborate, integrate, compute and transact business solutions that utilizes cloud first, automation, analytics, “as a Service”, BIG data and software defined.
The HPC System Administrator will be responsible for the physical operational infrastructure support and configuration of the HPC environment in the Phoenix, AZ greater area and will support the Global HPC/GPU Compute Infrastructure with 100,000 cores at Honeywell. The candidate will manage HPC networks and storage, distributed DGX GPU compute resource, evaluate and establish AI/Deep Learning software stack, accelerate CFD applications, integrate on-premise hardware and operations with cloud-based solutions and assist evaluating and implementing emerging HPC technologies. This role includes application rationalization components to best assist business teams on migration, upgrades or decommissioning of legacy Infrastructure. This role will be responsible for managing, installing and troubleshooting HPC-based applications and assisting with automation activities.
Key Responsibilities:
- Assess, order, install, maintain and decommission the physical compute, storage and network HPC infrastructure in the Phoenix, AZ greater area (2 different locations)
- Coordinate with vendors the deployment and configuration of HPC infrastructure
- Move in/out physical media and load it for processing on air-gapped/insolated HPC systems
- Overall management and operational support of Global HPC and GPU Cluster Infrastructure in on-call shifts
- Work with HPC SGI-based and NVIDIA clusters, networks and storage
- Work with AI/Deep Learning Software stack
- Troubleshoot HPC performance issues related to on-premise and virtualized workloads
- Facilitate integration of HPC workloads to/from on-prem and off-prem cloud
- Participate in calls with other members to address and troubleshoot application and or infrastructure related issues
YOU MUST HAVE
- USA Security Clearance by DCSA is a mandate
- A bachelor’s Degree; preferably in Information Technology, Computer Science, Engineering or Business-Related Discipline
- 3+ years of experience in operational support of Linux OS, preferably REHL. Others might be considered.
- 2+ years of experience in other infrastructure areas such as storage and networking
- Experience in enterprise HPC Cluster ecosystems will be a big plus
- Excellent oral, written and collaborative communication skills
- The ability to partner effectively across IT teams, suppliers and business customers on cross-functional projects and process improvements
- Strong interpersonal skills – effective listening and teaming
- Self-motivated, demonstrated bias for action
- Skilled in partnering with internal customers at all levels to define problems, identify solutions, and facilitate change
WE VALUE
- Technical certifications in HPC, IT infrastructure, AI, Machine Learning, Deep Learning
- Technical certifications in RHEL or other Linux flavors
- Excellent leadership communication and executive presence
- Strong influencing, program and change management skills
- Strong business acumen and customer focus
- Creation of resulted-oriented Management Operating System
- Effectively demonstrates ability to deliver on complex situations or problems
- Creative and collaborative problem-solving capability
- Consistently makes timely decisions even in the face of complexity, balancing systematic analysis with decisiveness
- Excellent oral, written and collaborative communication skills, including executive level communications
How to Apply
an apply for this role at the Honeywell Careers site: Search results | Find available job openings at Honeywell180 total views, 0 today