Technology
·
Remote - LatAm, Remote - Mexico
·
Fully Remote
DevOps Engineer
The Role:
The DevOps Engineer will design and maintain infrastructure, manage cloud migrations, ensure security compliance, and improve CI/CD processes while promoting DevOps best practices
Responsibilities:
- Participate in the design, implementation, and maintenance of our infrastructure, ensuring reliability, scalability, and security.
- Support, monitor, and enhance the live infrastructure and platform solutions, ensuring high availability and performance.
- Participate in the migration of infrastructure from On-prem to GCP/AWS or AWS to Google Cloud Platform (GCP), ensuring a smooth transition and leveraging GCP services effectively.
- Maintain robust CI/CD and IAC pipelines, collaborating closely with development teams to streamline deployment processes.
- Maintain and enhance our security posture, ensuring compliance with industry standards and frameworks (e.g., SOC-2, ISO 27001).
- Diagnose and resolve infrastructure outages and incidents, ensuring timely resolution and root cause analysis.
- Ensure comprehensive documentation of infrastructure, systems, and processes to support onboarding, troubleshooting, and scalability.
- Promote and implement DevOps and Site Reliability Engineering (SRE) best practices across the organisation.
- Participate in a rotational 24/7 on-call schedule and perform root cause analysis of incidents in the production environment.
Requirements:
- 5+ years of experience in SRE, DevOps, or systems engineering roles supporting high-volume, mission-critical applications in production environments
- Strong Linux systems administration experience, including firewalls and hardening
- Experience in supporting containerized workloads - Docker, EKS, GKE, Kubernetes
- Proficiency with Infrastructure as Code (IaC) tools, particularly Terraform.
- Familiarity with configuration management (Ansible) and IT automation tools.
- Experience with network design, administration, and troubleshooting.
- Knowledge of programming languages (e.g., JavaScript, Node.js, PHP, Python)
- Good scripting skills (shell, PowerShell, Python)
- Experience with version control systems, ideally Git
- Web server configuration (Apache, Nginx).
- Database management (MySQL, Postgres, MongoDB), including high availability and backup solutions.
- Hands-on experience managing cloud providers, with significant experience in AWS and Google Cloud Platform (GCP).
- Familiarity with GCP services such as Compute Engine, Kubernetes Engine (GKE), Cloud Storage, BigQuery, Cloud Operations, and IAM.
- Experience with CI/CD pipelines and tools such as Jenkins, Harness, Bitbucket, Git, jFrog
- Hands-on experience with observability stacks: Prometheus, ELK, Splunk, CloudWatch, StackDriver, and Grafana.
- Familiarity with Agile and ITSM processes (incident/change/problem/configuration management), preferably using BMC Remedy.
- Strong understanding of DevOps and SRE principles.
Preferred:
- AWS/GCP Certifications or relevant Site Reliability/Cloud Solution Architect certifications.
- Jenkins/Harness Certifications
- Kubernetes Certifications
Soft Skills:
- Strong written and verbal communication skills for effective collaboration with global engineering and operations teams.
- High ownership mindset with the ability to work independently and proactively improve system reliability.
- Self-motivated, Organized, analytical, and driven by continuous improvement
- Strong understanding and experience operating in an agile development environment.
- Category
- Technology
- Locations
- Remote - LatAm, Remote - Mexico
- Remote status
- Fully Remote
- Employment type
- Full-time