Posted:1 hour ago
By:Hiring Kenya
Company Details
Industry:
Banking
Description:
4G Capital (4th Generation Capital) is Africa's fastest fintech providing ethical credit services to those who require it most. We provide rapidly accessible and affordable unsecured loans with strict affordability criteria to prevent unmanageable debt. Our customers are mainly small businesses and entrepreneurs who use our credit to grow their businesses and provide for the unforeseen.
Job Description
- This is an exciting opportunity for a seasoned SRE who thrives at the intersection of cloud engineering, operational excellence, and platform reliability. You will be responsible for strengthening and maturing our SRE practice while ensuring our cloud-native systems on Google Cloud Platform (GCP) remain stable, observable, scalable, and cost-efficient.
- You will define the standards for reliability engineering across the organisation, develop strong observability and monitoring systems, guide capacity planning, and own cloud cost optimisation. Working closely with DevOps, DevSecOps, IT Operations, Product, and business teams, you will help ensure our systems are predictable, resilient, and capable of supporting the company’s rapid growth.
What You’ll Do
- Define and own 4G Capital’s SRE standards, principles, and best practices.
- Build robust monitoring, observability, and alerting systems that provide proactive insights into system health.
- Establish and manage reliability metrics (SLIs, SLOs) aligned to business priorities.
- Lead capacity planning, performance engineering, and demand forecasting for production systems.
- Champion cloud cost optimisation (FinOps), ensuring full visibility and effective cost management across GCP workloads.
- Provide reliability and operability guidance during architecture and design discussions.
- Operate and continuously improve GCP-based production platforms, including Cloud Run, networking, and event-driven services.
- Design and maintain infrastructure using Terraform and infrastructure-as-code methodologies.
- Collaborate cross-functionally to embed reliability, performance, and cost discipline across the organisation.
- Participate in incident response, post-incident reviews, and continuous improvement initiatives.
What We’re Looking For
- You’re an engineer who blends deep cloud expertise with a passion for operational excellence. You understand the importance of reliability in powering financial inclusion and thrive in environments where systems must scale gracefully and perform predictably.
Ideally, you’ll bring:
- Extensive experience operating production systems at scale.
- Strong hands-on GCP experience in live, high-availability environments.
- Proven expertise with Terraform and Infrastructure as Code.
- Demonstrated ability to build monitoring, observability, and alerting frameworks.
- Practical experience implementing SLIs, SLOs, and service reliability metrics.
- Experience in capacity planning, performance engineering, and scaling strategies.
- Solid understanding of FinOps, cloud cost optimisation, and cost governance.
- Professional-level Google Cloud certification (Cloud DevOps Engineer, Cloud Architect, etc.).
- A collaborative, structured, and outcomes-driven engineering style.
Salary: Discuss During Interview
Education: Diploma
Employment Type: Full Time