Sort by
Refine Your Search
-
Listed
-
Category
-
Field
-
supporting new research and engineering using ORNL’s Frontier exascale supercomputer for its dense GPU-based HPC resources to train, deploy models and create large-scale production datasets for high-impact
-
supercomputer for its dense GPU-based HPC resources to deploy models and create large-scale production datasets for high-impact sponsor missions. The candidate will be expected to handle sponsor requirements and
-
CPU and GPU based HPC systems. Exploration of the capabilities of DPU/IPU SmartNICs to support network security isolation, platform level root-of-trust, and secure platform management/partitioning
-
. Demonstrated experience developing and running computational tools for high-performance computing environment, including distributed parallelism for GPUs. Demonstrated experience in common scientific programming
-
or OpenMP. Experience in heterogeneous programming (i.e., GPU programming) and/or developing, debugging, and profiling massively parallel codes. Experience with using high performance computing (HPC
-
(GPUs). Motivated self-starter with the ability to work independently and to participate creatively in collaborative teams across the laboratory. Ability to function well in a fast-paced research
-
supercomputer for its dense GPU-based HPC resources to train large GeoAI models and deploy models and create large-scale production datasets for high-impact sponsor missions. Major Duties/Responsibilities: Deploy
-
software engineering practices. Experience with GPU computing (e.g., CUDA, HIP), parallel computing (e.g., MPI, Actor Model). Familiarity with containerization (e.g., Docker, Podman, Apptainer), networking
-
reasoning or tool-augmented LLMs, RL (RLHF/RLAIF/online RL), or foundation models for science, Software engineering skills (Python) and experience with modern DL stacks (PyTorch) and multi-GPU training
-
container technologies in HPC environments. Experience with multiple system deployment mechanisms (Warewulf, PXEboot, Cobbler, Bright). Experience with GPU clusters (NVIDIA, AMD) for AI/ML and scientific