Description
The System Administrator provides Tier-3 operational and maintenance support for enterprise IT and data center monitoring systems. This role ensures system availability, security compliance, and continuous monitoring across regional and global infrastructures.
Key Responsibilities
- Administer and maintain enterprise monitoring and logging platforms including:
- Nagios (multi-region)
- NetFlow/sFlow collectors
- HP Network Node Manager (HPNMi)
- HP Network Automation (HPNA)
- Syslog and Cribl
- Nagios (multi-region)
- Maintain regional and global instances to ensure continuous availability and mission support.
- Perform system scaling, configuration changes, and performance tuning as operational requirements evolve.
- Execute patching and upgrades of applications and systems for:
- Security compliance
- Bug remediation
- Security compliance
- Provide advanced troubleshooting and root cause analysis for mission-impacting incidents.
- Assist with alert tuning, false positive reduction, and onboarding of new monitoring alerts.
- Support enterprise environmental logging and monitoring infrastructures to meet uptime SLAs.
- Track incidents, service requests, and changes
- Support log retention, storage, and restoration processes.
- Assist users with software installation, usage, and operational best practices.
Requirements
Top Secret SCI w/ Polygraph Clearance Required
Required Skills
- Linux system administration
- Enterprise monitoring tools (Nagios, NetFlow, Syslog)
- Network troubleshooting fundamentals
- Patch and configuration management
- Incident response and operational support in secure environments