Octopus by RTG is enabling a key partner organization to build their digital hub in Egypt looking for the right pioneers to work on exciting AI Projects.
Octopus is proud to be part of the Robusta Technology Group (RTG), a leading tech consultancy group. With a decade of experience and a successful track record of delivering over 300 projects across Europe, the Middle East, and North America, RTG has established itself as a preferred employer in the Egyptian market. Octopus and Robusta are building a bridge between Europe and Africa, creating tailored hub solutions to connect companies with top talent across the globe.
Octopus is specialized in rapidly assembling remote global tech teams that are fully aligned with the culture and practices of a particular brand. By providing tailored hubs to suit its clients needs, Octopus gives companies all the advantages of remote work and offshoring without all the negatives.
We are looking for a Resident DevOps & Monitoring Engineer to manage the day-to-day deployment, monitoring, and coordination of a vendor solution hosted within a client’s secure on-premises environment. This role requires strong cross-functional collaboration skills, a proactive approach to incident handling, and solid experience with DevOps in on-prem setups.
Key Responsibilities:
- Deployment & Configuration:
- Package Docker images, maintain Kubernetes manifests and Helm charts
- Align and manage versions of Postgres, MongoDB, and DB2 connectors
- Monitoring & Observability:
- Set up monitoring systems across applications and infrastructure
- Capture metrics, logs, and traces; configure sensible alert thresholds
- Internal Service Coordination:
- Submit, track, and follow up on service requests across IT, Data, Security, Ops, and QA
- Ensure timely completion and resolution across departments
- Incident Response & RCA:
- Detect and resolve production outages or performance issues
- Lead coordination efforts with on-site teams and drive root cause analysis
- Stakeholder Management:
- Act as the communication bridge between client Ops, Security, QA, and the vendor’s engineering/product teams
- Process Optimization & Documentation:
- Develop deployment guides, runbooks, checklists, and automation scripts
- Drive process improvement through standardization and documentation
- On-Call Support:
- Be available for critical incidents and lead resolution coordination when needed
Requirements
- Containers & Orchestration:
- Proficient with Docker, Kubernetes, Podman
- Strong understanding of networking fundamentals
- Operating Systems:
- Experience with RHEL and Windows Server environments
- Databases:
- Familiarity with Postgres, MongoDB, DB2, and SQL querying
- Scripting & Tooling:
- Python, Git, Pandas
- Monitoring Expertise:
- Hands-on experience with implementing observability across metrics, logs, traces, and alerting systems
- AI/ML Observability (Nice to Have):
- Understanding of monitoring ML models (accuracy, drift, hallucinations) and ensuring data pipeline integrity
- Security & Compliance:
- Knowledge of secure zones, change management protocols, and vulnerability remediation practices
- Soft Skills:
- Strong written and verbal communication
- Excellent stakeholder management
- Clear technical documentation and meeting facilitation skills