Position Summary:
The System Administration Engineer will be responsible for maintaining and managing IT infrastructure systems including Red Hat Enterprise Linux (RHEL) servers, Microsoft Windows servers, Active Directory, DNS, Group Policy Objects (GPO), and automation tools like Terraform and Ansible. The engineer will ensure the health, security, and performance of all systems, while collaborating with cross-functional teams to deploy, monitor, and troubleshoot infrastructure.
Key Responsibilities:
- System Administration & Infrastructure Management:
- Administer and support RHEL and MS Windows Server environments, ensuring optimal performance, security, and availability.
- Perform system installation, configuration, and upgrades for both RHEL and Windows servers.
- Manage Active Directory (AD), DNS, and DHCP services, ensuring reliable and secure network operations.
- Create and maintain Group Policy Objects (GPO) to manage security settings and system configurations.
- Develop and manage Infrastructure as Code (IaC) using Terraform for automating provisioning and deployment of cloud and on-premises infrastructure.
- Automate system configurations, deployments, and patching tasks using Ansible, ensuring consistency and compliance across environments.
- Create scripts to automate routine system administration tasks (shell scripting, PowerShell, etc.).
- Continuously monitor system performance, security, and availability; proactively troubleshoot and resolve issues.
- Implement performance tuning and optimization of server resources (CPU, memory, storage, and network).
- Set up and manage monitoring tools to detect and alert system anomalies.
- Ensure compliance with internal security policies and external regulatory requirements.
- Perform regular system updates, security patches, and vulnerability assessments.
- Work with the security team to implement best practices for system hardening and securing servers.
- Ensure that backup and disaster recovery solutions are in place and tested regularly for both RHEL and Windows Server environments.
- Create and implement disaster recovery plans to minimize downtime during critical events.
- Collaborate with DevOps, Cloud, and Network teams to ensure seamless integration of infrastructure services.
- Provide on-call support during off-hours for urgent system outages or issues.
- Assist development teams with troubleshooting and deploying applications on supported servers.
- Maintain up-to-date system documentation, including server configurations, system setups, and standard operating procedures (SOPs).
- Generate and present system performance, security, and compliance reports to management.
Required Qualifications & Skills:
- Experience: 5 to 8 years of hands-on experience in system administration with a focus on RHEL and MS Windows Server environments.
- Technical Skills:
- Strong experience with RHEL system administration, configuration, and troubleshooting.
- Proficient in managing MS Windows Server environments (2008/2012/2016/2019).
- In-depth knowledge of Active Directory, DNS, Group Policy Objects (GPO), and DHCP management.
- Familiarity with Terraform for Infrastructure as Code (IaC) and cloud provisioning.
- Expertise in using Ansible for automation and configuration management.
- Experience with Linux/Windows Shell scripting and automation via tools like PowerShell.
- Strong understanding of networking protocols (TCP/IP, HTTP, HTTPS, DNS, etc.) and basic network troubleshooting.
- Experience with system monitoring tools such as Zabbix or similar.
Preferred Qualifications:
- Certification in Red Hat Certified Engineer (RHCE) or similar.
- Certification in Microsoft Certified Solutions Expert (MCSE) or equivalent.
- Experience with Oracle or other cloud platforms for server management and automation.
- Knowledge of containerization technologies such as Docker or Kubernetes.
Soft Skills:
- Strong problem-solving and analytical abilities.
- Excellent communication skills, both verbal and written.
- Ability to work independently and in a collaborative team environment.
Detail-oriented with a strong commitment to quality and process improvements