Job Description

About the company GetBlock is a leading RPC node provider and Web3 infrastructure platform trusted by devs worldwide. Since 2019, we have empowered crypto innovators with instant and reliable access to 120+ blockchains through robust production-ready APIs.Our team’s deep blockchain expertise and passion for innovation drive us to build the infrastructure layer for the next generation of blockchain applications. At GetBlock, we move fast, think creatively, and deliver high-impact products.

Our culture blends technical excellence with entrepreneurial spirit, making us a dynamic force in the Web3 space.About the roleWe’re hiring an SRE Lead to own end-to-end production reliability for a multi-region platform serving high-load RPC traffic across 120+ blockchains. You’ll lead the blockchain nodes reliability team, responsible for scalability, stability, and cost-efficient infrastructure — from gateways and orchestration (Nomad & Kubernetes) to nodes and observability.

This is a hybrid SRE leadership role: driving SLOs, incidents, postmortems, and capacity planning while staying hands-on with clusters and internal tooling. You’ll work closely with Backend, DevOps, and Support teams, and own the on-call system, incident processes, and continuous reliability improvements to keep performance high and customer trust strong.

Responsibilities

People & ProcessLead and grow the SRE team: hiring, onboarding, 1:1s, performance reviews, and career development.

Own SRE operating cadence: prioritization, planning, execution, and visibility of reliability work.

Maintain high standards for production readiness: runbooks, operational checklists, change management, and quality gates.Reliability & OperationsOwn production reliability end-to-end across gateways, clusters, and blockchain node fleets.Define and evolve SLIs/SLOs for uptime, response time, RPS, and time-to-resolve; partner with engineering teams to meet targets.

Own incident management standards: alerting strategy, escalation, incident coordination, and communications.

Run and improve postmortems: ensure follow-ups are executed and reliability debt is reduced over time.Lead capacity planning and performance work across regions and chains; balance reliability, speed, and cost.Technical LeadershipLead design reviews and set engineering standards for reliability, scalability, and operational excellence.Drive architecture decisions across Nomad + Kubernetes environments, gateways, and observability stack.Build and evolve internal tooling that improves reliability and operational efficiency (automation, health systems, diagnostics, self-service).

Qualifications3+ years in SRE / infrastructure / production engineering, including 1+ year leading peopleStrong Linux, networking, and production incident debugging skillsExperience running and scaling distributed, multi-region, high-load systemsHands-on with orchestration (Nomad and/or Kubernetes) and modern gateways/proxiesSolid observability practices (metrics, logs, traces, alerting, incident response)Using AI agents to improve operational efficiency and reliability automationStrong communication and ability to lead technical decisions end to endNice to have:Web3 / RPC infrastructure and blockchain node operationsHashiCorp stack (Nomad, Consul, Vault), Prometheus ecosystemTerraform / IaC, capacity & cost modeling, DDoS and abuse protectionBuilding internal platforms: self-service tools, runbooks, reliability automationWhy GetBlock?Be Part of Something BiggerGetBlock builds infrastructure that makes blockchain faster and more accessible worldwide. Our goal is to lead the RPC solutions segment through innovation, flawless uptime, and a personalized approach to clients. We're growing alongside Web3 and invite you to join us!Remote by DesignJoin a fully distributed, international team working across time zones and continents.

We believe great talent lives everywhere, and English is our shared language.Trust over ControlNo trackers, no micromanagement. We hire responsible, self-driven professionals who take ownership and make decisions with confidence.Flexibility that FitsWe embrace the geographical and cultural diversity of our team, allowing you to choose your holiday days and coordinate your schedule with your manager.Inclusive corporate cultureMonthly town halls, English-speaking clubs, and other corporate events to stay connected and foster team spirit, despite the distance.Balanced WorkloadWe maintain a flexible work schedule with approximately 40 hours per week, based on an 8-hour workday. Hours can be adjusted within your department to suit personal preferences and team needs.We offerCompetitive market rate salary with performance-based incentives.20 days of annual leave, plus an additional 12 days off to use for your holidays or personal days.Well-being programs to support your health and balance.Coworking space compensation for a productive work environment.Paid sick leave to ensure you can rest when needed.A company that invests in your growth, with personalized roadmaps to guide your professional development.An actively growing company with great opportunities for both horizontal and vertical career development.Opportunity to shape the initiatives you’re working on and make a real impact.

Originally posted on Himalayas

SRE Lead

Job Description

Open Positions You Might Like

Sign in to apply