Who ❤️ PJ →

This job listing has expired and may no longer be relevant!

14 Aug 2024

Full-Time Manager, AI System Infrastructure and MLOps Engineering

Chan Zuckerberg Initiative – Posted by htaylor – Redwood City, California, United States

Job Description

The Team

Across our work in Science, Education, and within our communities, we pair technology with grantmaking, impact investing, and collaboration to help accelerate the pace of progress toward our mission. Our Central team provides the support needed to push this work forward.

The Central team at CZI consists of our Finance, People & DEI, Real Estate, Events, Workplace, Facilities, Security, Brand & Communications, Business Systems, Central Operations, Strategic Initiatives, and Ventures teams. These teams provide strategic support and operational excellence across the board at CZI.

The AI/ML Infrastructure team works on building shared tools and platforms to be used across the Chan Zuckerberg Initiative, partnering and supporting the work of an extensive group of Research Scientists, Data Scientists, AI Research Scientists, as well as a broad range of Engineers focusing on Education and Science domain problems. Members of the shared infrastructure engineering team have an impact on all of CZI’s initiatives by enabling the technology solutions used by other engineering teams at CZI to scale.

The Opportunity

As a hands-on Manager of the AI System Infrastructure and MLOps Engineering team, you will be joining the AI/ML and Data Engineering team in CZI Central Tech, with the responsibility for the stability and scalable operations of our leading edge GPU Cloud Compute Cluster. This supports our AI Researchers in their development and training of state-of-the-art models in artificial intelligence and machine learning to solve important problems in the biomedical sciences aligned with CZI’s mission, contributing to greater understanding of human cell function.

As the Engineering Manager of the AI Infrastructure and MLOps Engineering team, you will be responsible for a variety of MLOps and AI development projects that empower our AI Researchers and help to accelerate Biomedical research across the whole of the AI lifecycle. You will guide our AI Systems Infrastructure and MLOps efforts focused on our GPU Cloud Cluster operations, ensuring that our systems are highly utilized, performant, and stable. You will be working in collaboration with other members of our own AI Engineering team as well as the Science Initiative’s AI Research team as they iterate and train their deep learning code, optimizing systems operations and in helping to troubleshoot problems encountered by jobs running on the cluster.

What You’ll Do

Help to build out the MLOPs and Systems Infrastructure Engineering team, growing the team to support the large scale capacity systems and AI training efforts we will be undertaking.
Drive our MLOps processes and System Infrastructure Engineering efforts in ensuring that our GPU Cloud computing systems are highly utilized and stable, and proactively guide our team in implementing the instrumentation and observability tooling integral to our AI Platform.
Own the on-call efforts for our GPU Cloud computing systems, building out the MLOps and Systems Infrastructure Engineering alerting and monitoring efforts for our leading edge Kubernetes based AI platform, including troubleshooting problems encountered on the GPU platform infrastructure and with jobs running on the cluster and computing systems.
Responsibility for a variety of AI/ML development infrastructure, instrumentation, and telemetry projects that empower our team in supporting our users across the AI/ML lifecycle, taking a key role in simplifying and optimizing the systems and processes that are integral to our GPU Cloud Cluster operations – in an MLOps meets SRE kind of hybrid operations model.
Mentoring and managing your team in fulfilling their roles to the best of their abilities, provide skill and career coaching to help the team members keep growing along their own career and life paths, and keep the team engaged in meaningful and interesting projects in service of our north star philanthropic mission

What You’ll Bring

Hands-on AI/ML Model Training Platform Operations experience in an environment with challenging data and systems platform challenges
MLOps experience working with medium to large scale GPU clusters in Kubernetes, HPC environments, or large scale Cloud based ML deployments (Kubernetes Preferred)
BS, MS, or PhD degree in Computer Science or a related technical discipline or equivalent experience
2+ years of experience managing MLOps teams
7+ years of relevant coding and systems experience
7+ years of relevant coding and systems experience
7+ years of systems Architecture and Design experience, with a broad range of experience across Data, AI/ML, Core Infrastructure, and Security Engineering
Strong understanding of scaling containerized applications on Kubernetes or Mesos, including solid understanding of AI/ML training with containers using secure AMIs and continuous deployment systems that integrate with Kubernetes or Mesos. (Kubernetes preferred)
Proficiency with Amazon Web Services (AWS), Google Cloud Platform (GCP), or Microsoft Azure, and experience with On-Prem and Colocation Service hosting environments
Solid coding ability with a systems language such as Rust,C/ C++, C#, Go, Java, or Scala
Extensive experience with a scripting language such as Python, PHP, or Ruby (Python Preferred)
Working knowledge of Nvidia CUDA and AI/ML custom libraries.
Knowledge of Linux systems optimization and administration
Understanding of Data Engineering, Data Governance, Data Infrastructure, and AI/ML execution platforms.
PyTorch, Karas, or Tensorflow experience a strong nice to have

Share this role online (there may be a referral fee*)

How to Apply

https://grnh.se/dc9971171us

Job Types: Full-Time. Salaries: 100,000 and above.

87 total views, 0 today

Apply for this Job

Pink Jobs

Welcome to Pink Jobs, the diversity and inclusion focused, free job listing, and vacancy website. .

Use the search bar at the top of this page to search for positions of employment near you.

It functions in a very similar way to other job sites by letting you join as a member and list your resume/CV for recruiters to browse.

We want this site to enable diversity and inclusive friendly individuals and employers to locate candidates and jobs near to them. Hopefully we are providing you the tools to do this.

The site also includes information about Pink Jobs, enables you to contact us, and also provides information of diversity and inclusion employment and employers.

Fancy a break from your job-search? (Or listing roles) have a quick game of: Doddle

From LGBTQ Jobs, to full Diversity and Inclusion Support

Embracing Diversity and Inclusion: A Closer Look at Pink-Jobs.com and Its Role in the Job Market

Introduction

In recent years, the corporate world has seen a significant shift towards embracing diversity and inclusion. This change is not just a moral imperative but also a strategic business decision. Companies are realizing that a diverse workforce brings different perspectives, experiences, and ideas, which can drive innovation and improve decision-making. In this landscape, specialized job boards like Pink-Jobs.com have emerged as important tools in promoting workplace diversity, particularly for the LGBTQ+ and other minority communities.

Pink-Jobs.com: Fostering Inclusivity

Pink-Jobs.com, known for its focus on the LGBTQ+ and other minority communities, stands out as a beacon of inclusivity in the job market. It’s not just a job board; it represents a commitment to providing equal employment opportunities regardless of sexual orientation or gender identity. This platform allows employers who prioritize diversity to connect with job seekers from the LGBTQ+ and other minority communities, fostering a more inclusive work environment.

The Importance of Niche Job Boards

Niche job boards like Pink-Jobs.com are pivotal for several reasons. Firstly, they provide a safe and welcoming space for job seekers who might face discrimination in the broader job market. Secondly, they help employers who are committed to diversity and inclusion to target their recruitment efforts more effectively, ensuring that their job postings reach a diverse audience.

…continued

Impact on the Hiring Landscape

The emergence of platforms like Pink-Jobs.com has had a profound impact on the hiring landscape. They challenge traditional hiring practices by highlighting the need for more inclusive recruitment strategies. These platforms remind us that talent exists in all communities and that inclusive hiring practices are crucial for uncovering this often-untapped potential.

Beyond Tokenism: Real Inclusion

However, it’s important to note that simply posting jobs on such platforms is not enough. Real inclusion means creating an environment where all employees feel valued and are given equal opportunities to succeed. Companies need to implement policies and practices that support diversity and inclusion at every level of the organization.

The Future of Diverse Hiring

Looking forward, the role of job boards like Pink-Jobs.com is likely to become even more significant. As companies continue to recognize the value of a diverse workforce, the demand for such specialized platforms is expected to grow. These platforms not only help in meeting diversity targets but also play a crucial role in shaping a more inclusive corporate culture.

Conclusion

Pink-Jobs.com and similar platforms are more than just job boards; they are catalysts for change in the corporate world. They encourage companies to think beyond traditional hiring practices and embrace the rich diversity of talent available in the LGBTQ+ and other minority communities. As we move towards a more inclusive future, the role of these platforms in shaping diverse and welcoming workplaces cannot be overstated.

Pink Jobs The world's largest equal opportunity focused job board. 100% Free. #diversity #inclusion

Full-Time Manager, AI System Infrastructure and MLOps Engineering

Job Description

The Team

The Opportunity

What You’ll Do

What You’ll Bring

How to Apply

Apply for this Job

Follow Us

Share this role online! (there may be a referral fee*)

Pink Jobs

Prose Digital

Follow Us On Socials – We post all jobs there too!

From LGBTQ Jobs, to full Diversity and Inclusion Support

Embracing Diversity and Inclusion: A Closer Look at Pink-Jobs.com and Its Role in the Job Market

Introduction

Pink-Jobs.com: Fostering Inclusivity

The Importance of Niche Job Boards

…continued

Impact on the Hiring Landscape

Beyond Tokenism: Real Inclusion

The Future of Diverse Hiring

Conclusion

Pink Jobs The world's largest equal opportunity focused job board. 100% Free. #diversity #inclusion

<img decoding="async" src="https://sp-ao.shortpixel.ai/client/to_webp,q_glossy,ret_img/https://pink-jobs.com/wp-content/uploads/2023/05/1845861.png" style="width: 20px;float:left">Full Search

Full-Time Manager, AI System Infrastructure and MLOps Engineering

Job Description

The Team

The Opportunity

What You’ll Do

What You’ll Bring

How to Apply

Apply for this Job

Follow Us

Share this role online! (there may be a referral fee*)

Pink Jobs

Prose Digital

Follow Us On Socials – We post all jobs there too!

From LGBTQ Jobs, to full Diversity and Inclusion Support

Embracing Diversity and Inclusion: A Closer Look at Pink-Jobs.com and Its Role in the Job Market

Introduction

Pink-Jobs.com: Fostering Inclusivity

The Importance of Niche Job Boards

…continued

Impact on the Hiring Landscape

Beyond Tokenism: Real Inclusion

The Future of Diverse Hiring

Conclusion

Full Search