Remote Senior HPC Systems Administrator Job at RedLine Performance Solutions, Remote

dmlUYy9MZGZLWWtvTXZTeU1xdjNWSzdMTnc9PQ==
  • RedLine Performance Solutions
  • Remote

Job Description

RedLine Performance Solutions (RedLine) has been in the HPC solutions engineering services business for over 25 years and is consistently determined to keep the “bar of excellence” quite high for new hires. This enables RedLine to accomplish what other firms cannot and promotes a high level of staff retention. We offer services ranging from full life cycle HPC systems engineering to remote managed services to HPC program analysis.
RedLine is looking for a Senior High Performance Computing (HPC) Systems Administrator to join our team. The administrator for this team will be providing support and administration for a large on-premise HPC cluster and a small cloud-based HPC cluster. The Senior HPC Systems Administrator will be an experienced individual with a strong security, Linux, HPC, configuration management, systems automation, and networking background. 
Job Details

  • This position requires mission-critical monitoring and maintenance and will require off hours support in a team rotation .
  • US citizenship   and the ability to obtain a Public Trust clearance is a requirement to apply.
  • The preference is for the candidate to be in the Phoenix, AZ area, however the position can be remote with the possibility of some travel.
  • This full-time position includes a comprehensive benefits package featuring paid time off, a 401(k) match, health insurance, and a full range of additional benefits.
Job Responsibilities:
  • Provide HPC cluster administration using technologies such as HPCM, Lustre, Slingshot, Cray OS, and Slurm
  • Engage with the customer to identify the needs and user stories to build enhancements and upgrades for the HPC clusters
  • Work with configuration management solutions to develop Ansible playbooks to support image generation and server support
  • Work with version control systems to perform and review Git pull requests from the team to ensure that the cluster support follows best practices
  • Update and expand existing systems monitoring capabilities
  • Develop automation tools for cluster administration
  • Participate in resource optimization and job scheduling software and policies
  • Support HPE-based Cluster Management solutions
  • Provide technical support to researchers using HPC resources, troubleshoot problems, and develop appropriate computational strategies.
J ob Requirements: 
  • Minimum of 7 years SLES, RedHat and CentOS Linux system administrator experience in an HPC environment.
  • Experience with schedulers/batch systems (e.g., SLURM, PBS, LSF)
  • Experience with managing parallel and cluster file systems (e.g., GPFS, Lustre)
  • Network management experience, including in an HPC context (e.g., InfiniBand, OmniPath)
  • Demonstrated ability to configure, deploy, and manage a major system area such as batch system, network, data storage, backup system, database system, or distributed computing
  • Scripting experience (e.g., bash, Python, Perl).
Preferred Skills:
  • Experience supporting HPC cloud environments (e.g., Azure)
  • Server provisioning and image management
  • Experience with Lmod/Lua
  • Experience with MPI technologies
  • One of the ISC2 certifications (e.g., CISSP, SSCP) or Security+ certification
  • Experience integrating applications with cloud provider software stack.

Jobicy JobID: 134809

Job Tags

Full time,

Similar Jobs

Shanghai Plus Ivy Education Company

Private English Tutor for Children (Ages 7-12) Job at Shanghai Plus Ivy Education Company

 ...genuinely passionate about education.Strong intercultural communication skills and a professional demeanor. Job Posting: Private English Tutor for Children (Ages 7-12)About the Role:We are a dynamic English training institution seeking passionate and dedicated Private... 

Carelinks ABA

Registered Behavior Technician (RBT)/Behavior Technician (BT) Job at Carelinks ABA

 ..., compassionate individuals to join our growing team as Behavior Technicians (BTs) and Registered Behavior Technicians (RBTs). We specialize in providing...  ...professional growth. New to the field? Well provide the full RBT training program - and offer a completion bonus for... 

HDR

Bridge Coordinator/EIT Job at HDR

 ...progression. Familiarity with engineering software packages such as: LEAP Bridge Enterprise, FB-MultiPier, LPile, AASHTOWare BrR, midas Civil, MDX, SAP2000, CSiBridge, ADINA, RM Bridge, spColumn, LUSAS, STLBridgeLRFD, or STLBridge, MS Office, MathCAD and AutoCAD,... 

CRB

EHS Manager I Job at CRB

 ...projects are pioneering solutions addressing important issues such as food scarcity and global health. Job Description The EHS Manager coordinates implementation of the site EHS plan in accordance with company policies and procedures; provides information to the... 

Hungry Howie's Pizza

Delivery Driver Job at Hungry Howie's Pizza

 ...delivering the product to customers. You will also need to process cash and card payments.To be successful as a Pizza Delivery Driver...  ...and complaints. Benefits: The position pays cash daily, with drivers earning between $12-$20 per hour! Flexible Scheduling...