Sarvaha would like to welcome a skilled Observability Engineer with a minimum of 3 years of experience to contribute to designing, deploying, and scaling our monitoring and logging infrastructure on Kubernetes. In this role, you will play a key part in enabling end-to-end visibility across cloud environments by processing Petabyte data scales, helping teams enhance reliability, detect anomalies early, and drive operational excellence.
Sarvaha is a niche software development company that works with some of the best funded startups and established companies across the globe. Please visit our website at
https://www.sarvaha.com to know more about us.
What You’ll Do
- Configure and manage observability agents across AWS, Azure & GCP
- Use IaC techniques and tools such as Terraform, Helm & GitOps, to automate deployment of Observability stack
- Experience with different language stacks such as Java, Ruby, Python and Go
- Instrument services using OpenTelemetry and integrate telemetry pipelines
- Optimize telemetry metrics storage using time-series databases such as Mimir & NoSQL DBs
- Create dashboards, set up alerts, and track SLIs/SLOs
- Enable RCA and incident response using observability data
- Secure the observability pipeline
You Bring
- BE/BTech/MTech (CS/IT or MCA), with an emphasis in Software Engineering
- Strong skills in reading and interpreting logs, metrics, and traces
- Proficiency with LGTM (Loki, Grafana, Tempo, Mimi) or similar stack, Jaeger, Datadog, Zipkin, InfluxDB etc.
- Familiarity with log frameworks such as log4j, lograge, Zerolog, loguru etc.
- Knowledge of OpenTelemetry, IaC, and security best practices
- Clear documentation of observability processes, logging standards & instrumentation guidelines
- Ability to proactively identify, debug, and resolve issues using observability data
- Focused on maintaining data quality and integrity across the observability pipeline
Why Join Sarvaha?
- Top notch remuneration and excellent growth opportunities
- An excellent, no-nonsense work environment with the very best people to work with
- Highly challenging software implementation and deployment problems
- Hybrid Mode. We offered complete work from home even before the pandemic.