Sort by
Refine Your Search
-
Listed
-
Category
-
Field
-
unique opportunity to engage in transformational research that advances the development of AI-ready scientific data, optimized workflows, and distributed intelligence across the computing continuum. In
-
of relevant experience in Linux systems administration or HPC systems engineering. Preferred Qualifications Demonstrated experience leading the design and deployment of HPC or large-scale distributed computing
-
. Demonstrated experience developing and running computational tools for high-performance computing environment, including distributed parallelism for GPUs. Demonstrated experience in common scientific programming
-
/O solutions (e.g., HDF5, ADIOS2), and distributed computing tools relevant to data preparation. Evidence of ability to conduct independent research and publish in peer-reviewed venues. Preferred
-
for Science @ Scale: Pretraining, instruction tuning, continued pretraining, Mixture-of-Experts; distributed training/inference (FSDP, DeepSpeed, Megatron-LM, tensor/sequence parallelism); scalable evaluation
-
for Science @ Scale: Pretraining, instruction tuning, continued pretraining, Mixture-of-Experts; distributed training/inference (FSDP, DeepSpeed, Megatron-LM, tensor/sequence parallelism); scalable evaluation
-
of relevant experience in Linux systems administration or HPC systems engineering. Preferred Qualifications Demonstrated experience leading the design and deployment of HPC or large-scale distributed computing
-
, Mixture-of-Experts; distributed training/inference (e.g. FSDP, DeepSpeed, Megatron-LM, tensor/sequence parallelism); scalable evaluation pipelines for reasoning and agents. Federated & Collaborative