Rutgers, The State University of New Jersey, is seeking a System Administrator for the Office of Advanced Research Computing (OARC). The System Administrator supports the development of university-wide advanced cyberinfrastructure resources which are necessary to foster a scientific community of excellence in computation and data that empowers research, learning, and societal engagement. This position also serves as OARC’s primary contact for the Caliburn cluster infrastructure support and will work closely with the OIT Data Center team on Caliburn support.
Among the key duties of this position are the following:
Provides expertise in specific technologies used by multiple units around the University, such as parallization of code, Hadoop, and high-performance networking.
Maintains contact with researchers around the University, developing recommendations for software that OARC should license, and hardware and service that require support.
Assess the needs of research projects, recommending an appropriate mix of departmental, University, national and commercial resources , and helping researchers set up and use those resources.
Minimum Education and Experience:
Bachelor's degree in Computer Science or similar technical field or an equivalent combination of education and related experience.
A minimum of five (5) years of relevant experience in system administration, systems programming or research computing.
Required Knowledge, Skills, and Abilities:
Excellent at configuration management with good hands-on skills with Puppet, or Ansible or Chef or Salt or similar systems.
Excellent at system administration of Linux systems.
Excellent at the Linux command line.
Strong programming experience in one or more languages
Basic data center knowledge
Strong communication skills and a sense of ownership and drive.
Comfortable using version control systems
Comfortable understanding of networking
Comfortable with non-tech people and have a sense of humor. (Note: This is not a client-facing support role; we have a group of truly smart folks who handle that, but all OARC staff interact with our customers to some extent.)
Have systematic problem-solving skills.
Previous experience with automation and configuration management of old and new networking gear
Previous experience as a developer
Experience in HPC system administration including Linux, cluster job schedulers, parallel storage and file systems, general IP networking, high speed, low latency network interconnects, e.g., Infiniband, co-processors, e.g., GPU, familiarity with common scripting languages (Bash, Python, etc.), and parallel programming methodologies (e.g., MPI).
Experience as a system administration of a high-performance computing system
Master’s degree in a technical discipline preferred.
Comfortable owning a project just as you are comfortable working as part of a team.
Need to do quality work and like your work to reflect well of yourself.
Possess curiosity and a desire to learn
Have strong opinions about monitoring and alerting
Interest in designing, analyzing and troubleshooting large-scale distributed systems
Have ‘owned’ the continuous integration process in previous role.
Basic understanding of 3-phase AC electrical power distribution and usage
Comfortable with InfiniBand or similar data interconnects
Comfortable with HPC schedulers such as Slurm, PBS or similar.
Contributed to an Open Source Project or two.
Supported Data Scientists in a previous role.
Experience managing projects
Expert at distributed file systems like Ceph or GPFS.
Expert with Hadoop and other NoSql systems.
Expert at parallel programming using MPI, OpenMP or similar.
Expert with HPC schedulers such as Slurm, PBS or similar.
Comfort level working directly with researchers in finding solutions to demanding computational problems.
Physical Demands and Work Environment:
Ability to lift and carry up to 50lbs. Sitting, standing, walking, talking and hearing.
Visual acuity to perform activities such as: viewing a computer terminal, reading, analyzing written information/data, etc.
Posting Number: 20ST1671
Location: Busch (RU-New Brunswick)
Internal Number: 119645
About Rutgers University
Rutgers, The State University of New Jersey, is a leading national public research university and the state's preeminent, comprehensive public institution of higher education. Rutgers is dedicated to teaching that meets the highest standards of excellence; to conducting research that breaks new ground; and to turning knowledge into solutions for local, national, and global communities. As it was at our founding in 1766, the heart of our mission is preparing students to become productive members of society and good citizens of the world. Rutgers teaches across the full educational spectrum: preschool to precollege; undergraduate to graduate and postdoctoral; and continuing education for professional and personal advancement. Rutgers is New Jersey's land-grant institution and one of the nation's foremost research universities, and as such, we educate, make discoveries, serve as an engine of economic growth, and generate ideas for improving people's lives.