HPC System Administrator
This is for a High Performance Computing unit and generic LINUX system administration skills will not be sufficient. Job Summary: Provides computer related system services for a broad and diverse variety of supported operating systems, operating systems software, and utility programs. Posting Position Title: High Performance Computing System Administrator Departmental Focus: Within the department of Information Technology Services provides hardware, software and end user support for a growing number of High Performance Computing (HPC) clusters used in faculty research. Designs and builds High Performance Computing clusters. This includes system administration, hardware maintenance, applications, networking, account management and security. Designs and builds large scale high performance storage systems. Analyzes and tunes storage systems, networks and workload managers. Works closely with vendors for hardware and software problem resolution. Provides end user support through a problem tracking system. Maintains System Administrator and end user documentation. Remains current with emerging technologies and trends related to High Performance Computing. Possible off shift work and on call coverage. Principal Responsibilities: 1. Provides technical expertise in resolving user system deficiencies and determines appropriate action. 2. Provides system services and analyze system performance for stakeholders and intended end users. Performs all activities necessary to activate a new operating system or new release of an existing system, including analysis, design, implementation, and related documentation. Analyzes systems performance and modifies programs to increase the efficiency of the operation. Reinstates integrity of system as quickly as possible following an outage in order to minimize item and data loss. 3. Recommends and authorizes system upgrades and software installations. 4. Designs, develops and implements new system tools. 5. Analyzes execution time of commonly used instruction to identify and replaces those that are inefficient or slow to operation. 6. Analyzes, evaluates and takes steps to circumvent problems and restores systems to operating condition. 7. Contributes in the determination of specifications and determines the combination of options needed to tailor an operating system to meet the business needs. 8. Conducts training and user education. 9. Researches new technologies, processes, and methodologies. Required Education and Experience: Bachelor's degree. Four years of experience as a systems programmer with knowledge of one or more high level languages, or an equivalent combination of education and experience. Required Skills & Abilities: Required Skill/Ability 1: Proven ability working in a high performance computing research environment. Required Skill/Ability 2: Proven ability working with remote management of large scale server environments. Required Skill/Ability 3: Ability to work well individually and in a team environment. Required Skill/Ability 4: Strong scripting skills (shell, perl, python). Required Skill/Ability 5: Preferred Education, Experience and Skills: 1. Six years experience as a Linux systems administrator. 2. Expert knowledge of Redhat/CentOS Linux. 3. Expert knowledge of HPC storage technologies. 4. Extensive knowledge of workload management and accounting systems. 5. Extensive knowledge of HPC provisioning and monitoring tools. 6. Extensive knowledge of high speed HPC networks.
|
New Haven
|
Expired |
Romain Fleury
Blog & Content Writer
View profile
Darek Kos
Multimedia Expert
View profile
Kristin Carella
Web Developer
View profile
Related projects
Search for freelance jobscan’t wait for more clients
and advertising. Thank you."