About the job Infrastructure Operations Engineer
The role will be working under the direction of technical team leaders in providing solutions for 24/7 incident management and request fulfillment. The primary scope of cloud operations is the day-to-day operations of on-premise infrastructure and optimizing delivery of the different technical services to the R&D partners.
Role and Responsibilities: - As part of incident management, the role aims to provide:
- First line response for datacenter and regional office technical incidents. These will include but are not limited to handling endpoint, performance, hardware, network and application issues.
- Coordination of critical incident handling together with subject matter experts.
- Knowledge base maintenance.
- Meeting business objectives and team targets as per defined SLO.
- On the request handling front, the role entails:
- Learning and executing playbooks as well as established operating procedures to complete service requests within SLO. These include but are not limited to systems, network, and shared services.
- Eliminate toil through automation and process optimization.
- Document and update playbooks
- The leadership team will also assign projects and other duties as the need arises.
Required Experience and Skills:- Experience in network configuration and troubleshooting (VPN, DNS, LAN)
- Knowledge in Virtualization
- Project Management Experience
- Familiarity with Server environments, preferably CentOS Linux. Has experience with installation, configuration and management of server systems (Windows server or Linux).
- Experience in solutions integration, tools development, and programming, preferably Python.