Department: NAISE - NU ANL Inst Sci Eng Salary/Grade: ITS/78 Job Summary: Oversees a major department/school-wide program or project & provides direction & thought leadership to develop strategic goals & objectives; defines associated metrics to measure results; & creates & obtains resources to ensure success. Reviews, creates & nurtures internal & external alliances, collaborations and partnerships. Completes system installation and management, system monitoring and performance tuning, diagnostics and troubleshooting, user support coordination, backup and storage management, data center maintenance, and other related tasks. Please note: This is a 1-year term position. Opportunity for renewal will be based on performance and available funding. Travel from NU to ANL may be required. Specific Responsibilities: Administration - Ensures data/media recoverability by implementing a schedule of system backups and database archive operations.
- Supports media management through internal methods and procedures or through offsite storage and retrieval services.
- Develops and promotes standard operating procedures.
- Conducts routine infrastructure audits to ensure compliance with established standards, policies, and configuration guidelines.
- Develops and maintains a comprehensive infrastructure configuration database/library of all supporting documentation.
- Creates and maintains user accounts, security, permissions, and file systems in accordance with standards and operational requirements.
Development - Performs infrastructure installations and upgrades.
- Schedules installations and upgrades and maintains them in accordance with established IT policies and procedures.
Performance - Monitors and tunes the system to achieve optimum performance levels.
- Ensures infrastructure integrity by evaluating, implementing, and managing appropriate solutions.
- In collaboration with system administrators, performs ongoing support and maintenance for systems and related applications.
- Prioritizes, troubleshoots and resolves issues.
Miscellaneous - Performs other duties as assigned.
Minimum Qualifications: - Successful completion of a full 4-year course of study in an accredited college or university leading to a bachelor's or higher degree in a major such as computer science, information technology, or related; OR appropriate combination of education and experience.
- 2 years system administration or other relevant experience required.
- Please see information in tables below:
Infrastructure (extends across applications): - Amazon Web Services (AWS)
- Code Repositories (Git, Subversion)
- Linux Operating System
- Server hardware
- SQL/MySQL/Postgres
- Storage hardware
Programming Languages and Frameworks: - Django
- HTML 5
- JavaScript
- Python
- Shell Scripting
Analytical: - Critical thinking
- Debugging
- Decision making
- Problem solving
- Troubleshooting
- Use-case analysis
Project: - Agile environment
- Code documentation
- Collaboration and teamwork
- Cost/benefit analysis
- Evaluate resources
- Facilitate collaboration
- Functional documentation
- Iterative & incremental development
- Organizational skills
- Scrum project framework
- Workflow development & documentation
Minimum Competencies: (Skills, knowledge, and abilities.) - Extensive experience with the Linux operating system command-line interface in use and customization.
- Proficient in scripting languages such as Python or Go.
- Proficient in Bash shell scripting.
- Familiar with system administration, metrics gathering software (Prometheus, or similar).
- Familiar with log management systems (logrotate or similar).
- Familiarity with JIRA or similar user-submitted ticket management system.
- Ability to design, create and debug automated recovery services running on remote systems to self-diagnose and recovery system.
- Experience with key and certificate security authorization (SSH keys and certificates).
- Basic experience with network management (TCP, UDP, reverse proxy, ports, sockets, etc.)
Preferred Qualifications: - Prior experience as System Admin or DevOps for remotely managed systems.
- Experience diagnosing Linux operating system or service application issues and performing root-cause analysis.
- Experience using and (ideally) creating a system health dashboards. Capturing the metrics and health of remote systems to facilitate identification of ecosystem problems and diagnose problems.
- Experience rolling out operating system and software service updates to many remotely managed machines.
Preferred Competencies: (Skills, knowledge, and abilities) - Familiar with MQTT, RabbitMQ or similar message-broker services.
- Experience with micro-computing systems such as Raspberry Pi.
- Experience managing remote (sometimes limited connectivity and bandwidth) hardware.
- Experience customizing a Debian Linux operating system.
- Experience managing operating system and system services version control to facilitate consistent and stable remote systems.
- Extensive experience with network management.
Northwestern University is an Equal Opportunity, Affirmative Action Employer of all protected classes, including veterans and individuals with disabilities. Women, racial and ethnic minorities, individuals with disabilities, and veterans are encouraged to apply. Hiring is contingent upon eligibility to work in the United States. |