Senior Site Reliability Engineer (SRE) | LATAM
Who We Are
At MAS Global Consulting, we are a premium digital engineering partner delivering technology solutions to some of the world’s most innovative companies — from high-growth startups to Fortune 500 enterprises.
With a people-first culture and a commitment to excellence, we combine nearshore talent, agile delivery, and technical depth to build scalable, high-impact software solutions.
Our teams comprise experienced technologists who are passionate about innovation, collaboration, and delivering measurable value to our clients.
Who You Are
You are an experienced Site Reliability Engineer who thrives on building resilient, scalable, and highly reliable systems. You have a strong background in cloud infrastructure, automation, and DevSecOps practices, focusing on improving system stability, performance, and operational efficiency.
You enjoy working closely with product and engineering teams, translating operational needs into reliable solutions, and continuously optimizing workflows through automation and modern reliability engineering principles.
What You’ll Do
As a Senior Site Reliability Engineer, you will play a key role in ensuring the reliability, scalability, and security of production environments. You will drive automation initiatives, improve monitoring strategies, and support the overall stability of critical systems while promoting best-in-class DevSecOps practices.
Key Responsibilities
- Design, build, and deploy solutions that enhance system reliability and optimize operational efficiency.
- Develop and optimize CI/CD pipelines to ensure secure and efficient delivery processes.
- Provide technical guidance and mentorship on DevSecOps and SRE best practices.
- Collaborate with product teams to understand system requirements and reliability needs.
- Conduct root cause analysis and post-mortems to prevent incident recurrence through code-driven solutions.
- Implement robust monitoring, alerting, and security scanning mechanisms.
- Support incident resolution and assist operational teams in troubleshooting production issues.
- Promote and implement modern technologies and workflows to improve system performance.
- Automate processes to reduce manual operational effort and improve response times.
- Provide after-hours emergency support when required.
What You Bring
Technical Skills
- 5+ years of experience in Site Reliability Engineering, DevOps, or reliability-focused roles, designing, building, and deploying solutions that improve system reliability and operational efficiency.
- Strong experience improving reliability through root cause analysis, post-mortems, and code-based prevention of recurring incidents.
- Proven ability to design and guide effective CI/CD pipelines while applying DevSecOps best practices.
- Solid experience working with AWS in scalable and highly available environments.
- Hands-on experience with Terraform and Ansible for infrastructure automation.
- Experience implementing monitoring and security scanning solutions.
- Strong background managing containerized environments using Docker and Kubernetes.
- Ability to identify and implement automation to reduce manual support workload.
- Experience with Git, GitLab, and Artifactory for version and artifact management.
- Proficiency in Linux and/or Windows scripting to support operational processes.
- Experience supporting incident resolution, collaborating with product teams, and providing after-hours support when required.
- English proficiency from Intermediate (B2) or higher.
