Site Reliability Engineer - SRE en Remoto para Rappi - Hireline LATAM
Encuentra más vacantes similares

Site Reliability Engineer - SRE en Rappi

Sueldo oculto

Remoto: LATAM

Empleado de tiempo completo

Nivel de Inglés: Nivel Intermedio

Rappi is looking for a seasoned engineer who wants to join us in implementing one of the most innovative methodologies for stability and reliability - Chaos Engineering

What you'll do:
  • As one of the most challenging tech platforms in LATAM, some key aspects make it a unique place for your personal and professional growth in a tech career:
  • In operation in nine different markets
  • One of the biggest AWS infrastructures in LATAM
  • Around 40k containers scaling flexibly on a typical day
  • Over 1k microservices
  • Over 1.6k productive databases
  • Over 1Tb of monthly data to process
What we expect from you: Skills needed:
  • 5~ years of experience coding in any language (preferably on the backend, but frontend experience will be considered)
  • 3~ years of experience working in (*nix) system administration / infrastructure
  • 3~ years of experience managing IAAS / SAAS cloud platforms (AWS is a plus)
  • Solid troubleshooting skills for production issues
  • Proficiency deploying and managing containerized applications (tools like: ECS, Kubernetes, Docker Swarm…) Solid understanding of networking and Layer 4 (and up) routing.
  • Scripting proficiency (preferably Python)
  • Experience integrating third party services and applications (Slack, Splunk Logs / Observability)
  • Working knowledge of metrics collection and analysis (e.g. Prometheus (Thanos), Splunk observability (SignalFX), Datadog…)