Sort by
Refine Your Search
-
/Responsibilities: Platform Operations & Implementation Assist in day-to-day operations for on-premises and cloud-based Kubernetes clusters. Develop and integrate critical components for networking, CI/CD tooling, OS
-
: Infiniband, Mellanox, OFED, Voltaire, Force10 Proven experience in: HPC architecture and performance tuning Cybersecurity in HPC/cloud environments Infrastructure as Code (AWS, Terraform, Ansible, Packer
-
centered on Microsoft Azure and related AI cloud technologies with High Performance Computing (HPC) for on premise AI environments and Azure, AWS, and GCP for off premise environments. Major Duties
-
at scientific edge systems using large-scale HPC/AI computational and storage systems. Design and evaluation of ephemeral, user-configurable, and composable data and storage systems. Evaluation of cloud data
-
for the design and analysis of computational methods that accelerate data analytics and machine learning, especially as the apply to scalable high-performance computing, cloud computing, and large interconnected
-
on-premises and in the cloud. Work with researchers and developers to configure complex pipelines to streamline multiple code repositories for automated deployments/upgrades, and work with others across
-
. Familiarity with version control systems, CI/CD pipelines, and cloud-based development environments. Special Requirements: This position requires the ability to obtain and maintain a clearance from the
-
: Experience with large-scale experiments on HPC or cloud platforms. Strong publication record commensurate with career stage. Familiarity with distributed training frameworks (e.g., DeepSpeed, Megatron-LM, Ray
-
). Knowledge of high-performance computing or cloud environments for large-scale data. Strong collaboration skills and ability to work in interdisciplinary teams. Special Requirements: Applicants cannot have
-
. Preferred Qualifications: Familiarity with running on scheduled clusters or cloud services. Experience in developing and applying biostatistics, AI, and ML-based solutions to healthcare data. Experience