Sertis is a leading Data and AI company based in Bangkok. We're looking for a Senior-Lead Site Reliability Engineer to improve the efficiency and reliability of our software development and deployment processes. You'll work closely with our Machine Learning, Software Engineering, Quality Assurance, and Data Engineering teams to automate and streamline our systems and services.
Requirements
5-8 years of hands-on experience in designing, building, maintaining cloud infrastructure, and applying DevOps and SRE practices in large-scale systems
In-depth knowledge about container orchestration principles and techniques, including hands-on experience with Docker and platforms such as Kubernetes
In-depth knowledge of cloud infrastructure and its components, including virtual machine, serverless, storage, networking, and security
Strong automation and IaC skills, including experience with tools such as Terraform, AWS CDK, Flux/ArgoCD, Helm, and Gitlab CI
Ability to design and build new infrastructure and continuously improve the CI/CD pipelines
Experience with monitoring (Prometheus/Grafana preferred) and defining SLAs
A secure by design mindset and understanding of the importance of security in the development and deployment process
Strong problem-solving skills and experience in troubleshooting production issues
Ability to work collaboratively with different teams
Leadership skills and ability to mentor junior and mid-level Site Reliability Engineers
Benefits
Hybrid working environment
Flexible office hours
Opportunity to work and learn from the best in the industry