Role
o??Design, develop, and maintain high-performance and scalable platform
o??Collaborate with development and operations teams to drive standardization for cloud platform across
o??Provide technical leadership and mentorship to other engineers
o??Create high-quality technical documentation, including requirements specifications, use cases, test strategies, performance benchmarks, deployment plans, and feasibility studies.
o??Troubleshoot and resolve production issues, ensuring system stability and reliability.
o??Continuously seek opportunities to improve system performance, security, and user experience.
All About You
o??Experience with cloud platforms (AWS, Azure) and containerization technologies (Kubernetes, Docker).
o??Experience with modern monitoring and observability tools (Dynatrace, Prometheus, Grafana, Datadog.).
o??Strong understanding of distributed systems, high availability, and failure recovery.
o??Familiarity with chaos engineering practices and tools (e.g., Gremlin, Chaos Monkey).
o??Strong leadership and team collaboration skills.
o??Deep understanding of service-level management, incident response, and root cause analysis.
o??Excellent problem-solving and troubleshooting skills.
o??Strong programming and scripting skills (e.g., Python, Go, Bash, Java, C#).
o??Familiarity with CI/CD pipelines and automation frameworks.