Rollenübersicht von JobGrid
SRE (Terminal) at MLabs in London, United Kingdom, on-site, in IT / DevOps / SRE, with Senior level shown in the source. JobGrid normalizes the role facts from the source record, keeps the content within the structured-source boundary, and routes candidates to the original public application page with non-personal referral parameters.
- London, United Kingdom; on-site workplace; category: IT; subcategory: DevOps / SRE; seniority: Senior.
- Source freshness is shown by the posted time of 2026-06-03 and the last-checked time of 2026-06-04.
- No salary or employment type is provided in the structured payload, so JobGrid does not add those details.
- Application goes through JobGrid's path to the original public application page with non-personal referral parameters.
Location: New York, United States • London, United Kingdom (Office)
On-Site | Full-time
Compensation: Competitive Compensation
Our client is a high-growth software development organization and a key contributor to one of the largest and fastest-growing decentralized crypto social networks globally. The platform has achieved massive scale, generating significant revenue and global attention since its inception.
To support this rapid expansion and ensure the continuous uptime of its high-stakes, high-throughput environment, our client is seeking a battle-tested Site Reliability Engineering (SRE) Expert. This individual will be handed ambiguous, critical infrastructure challenges and will be trusted to navigate them end-to-end—scoping solutions, making sound architectural trade-offs, and executing with precision.
Key Responsibilities
- Own Foundation & Architecture: Design, scale, and maintain highly available, multi-region, or active-active cloud infrastructure patterns.
- Incident Response & Reliability: Lead critical incident response efforts, participate in real on-call rotations, and drive comprehensive, blameless post-mortems to continuously harden the system.
- Automation & Tooling: Write clean, production-grade automation code (Python, Go, or similar) for infrastructure tooling, operators, and seamless systems integration.
- Risk & Security Management: Exercise sharp judgment regarding system risks, balancing rapid deployment velocity with robust infrastructure safety and stability.
- Operational Excellence: Raise the engineering and operational bar across the organization through the implementation of rigorous standards, modern tooling, and technical mentorship.