System Admin GPU Job at Noblesoft Technologies, Atlanta, GA

VVI2QlJjaE5pZytzV0FIdkF6eVFWeCtUV2c9PQ==
  • Noblesoft Technologies
  • Atlanta, GA

Job Description

Title : System Admin GPU

Location :Atlanta GA (onsite)

Required Skills

  • Proven ability to orchestrate bare metal linux systems at scale including building automation for firmware updates, bios config management, configuring PXE environments.
  • Deep Linux systems experience including troubleshooting network interfaces, developing and applying configuration management, security best practices and monitoring and alerting.
  • Strong automation mindset. Expert knowledge in 1 or more orchestration tools such as MaaS, Salt, Chef, Ansible or Puppet.
  • Strong communication skills. Your job will involve writing detailed documentation for others to pick up or leading knowledge sharing sessions with operations teams.
  • Bonus skills include:
    • Hands-on experience in High Performance Computing (HPC) clustered environments from Nvidia or AMD. Experience in performing automated wide scale testing on NCCL or other frameworks.
    • Network engineering experience with VyOS platforms.

What You'll Be Working On:

  • Provisioning and automating GPU Bare Metal deployments
  • DevOps - Assist customer support and CloudOps teams with GPU specific knowledge/debugging during customer escalations
  • Performance testing, analysis and monitoring
  • Firmware, BIOS, Kernel upgrades and testing
  • Strong understanding of Linux based operating systems
  • Deep experience with the internals of QEMU, KVM, Linux kernel and libvirt. Strong proficiency in C.
  • Strong knowledge of DO's proprietary services and how they intersect with our virtualization stack.
  • Accelerate the virtualization of next generation GPU enabled platforms that power AI/ML workloads.
  • Work with hardware engineering teams and vendors to validate GPU fabric performance. Optimize performance while maintaining DOs high security standards.
  • Collaborate with open source Linux, QEMU and libvirt communities to drive the evolution of Linux virtualization technologies and incorporate them into the DO fleet.
  • Backport, build, and deploy software patches in order to support new features, backport bug fixes, and resolve security issues.

Job Tags

Contract work,

Similar Jobs

University of Massachusetts Boston

Assistant Professor (Economics) Job at University of Massachusetts Boston

The Department of Economics at UMass Boston invites applications for a tenure track Assistant Professor position in Health Economics and related fields to begin September...  ....edu UMass Boston is an urban public research university with a teaching soul, whose impact... 

Costello's Ace Hardware

Masonry Technician (Rockville, MD) Job at Costello's Ace Hardware

 ...Position Overview: As aMasonryTechnician, you will play a critical role in maintaining and improving the structural integrity and safety of residential and commercial chimneys. Your primary responsibilities will include relining chimneys, repairing smoke chambers, and... 

South Texas Health System - Clinics

Tech - Vascular Ultrasound Job at South Texas Health System - Clinics

 ...field, disease, and new procedures as they evolve. Vascular/Echo Techs are responsible for interacting in a positive manner with all...  ...Duties/Responsibilities: Operating imaging equipment such as ultrasound Embalming bodies using techniques such as arterial embalming... 

Tricon Solutions

Documentation Specialist Job at Tricon Solutions

 ...Job Title: Documentation Specialist Location: 125 Colfax StreetSpringdale, PA 15144 Duration...  ...version control. Conduct regular reviews and updates of documentation to reflect...  .... May require collaboration across multiple departments and remote teams.... 

ARC Document Solutions, LLC

Outside Sales Rep - Graphics and Color Job at ARC Document Solutions, LLC

 ...resilient organization that values excellence and responsiveness. Riot Creative Imaging (www.riotcolor.com) , our specialized visual color graphics division, excels in transforming spaces through immersive environmental graphics and sustainable printing solutions . As a...