Experience -4-6 years
Budget – Best in Market


Must have :

Hands on experience in any one Scripting language. Java / JS, Python, GO lang, SPLUNK & AppDynamics, Couch (NoSQL), KafKa or RabbitMQ, Cloud Infra knowledge, ITIL / SRE processes, Documentation skills (Content writing) (Java people with Monitoring & Support skills)

Good to have skills: SRE implementation knowledge

Job description :

The site reliability engineer will focus on adding resiliency to the systems and builds software that solves IT operations & engineering problems. Our SREs should empower our colleagues with a highly available and rich in feature products. Some of their responsibilities include,
1.Building software to help operations and support teams
2. Fixing support escalation issues
3. Optimizing on-call rotations and processes by adding more automation and enriching alerts
4.Update runbooks and create documents to ease Incident handling
5.Create knowledge base from experiences of work in both development and production environment and regularly update the same.
6.Conduct and own the outcome of host post-incident review.

Interested candidates share resume to ritushree.das@xebia.com