Position Overview: This is for a "Follow the Sun" model with support in New Zealand, the Philippines and Columbia. We are seeking an experienced Site Reliability Engineer (SRE) to join our dynamic team. The ideal candidate will have extensive experience in DevOps practices, continuous integration and continuous deployment (CI/CD) pipelines, and container orchestration with Kubernetes. As an SRE, you will play a crucial role in ensuring the reliability, scalability, and performance of our integration platforms, with a focus on Java Spring applications. Key Responsibilities: Infrastructure Automation: Design, implement, and maintain infrastructure as code (IaC) using tools such as Terraform, Ansible, or Chef to automate the deployment and management of cloud infrastructure. CI/CD Pipeline Management: Develop and optimize CI/CD pipelines using GitHub Actions or other similar tools to automate build, test, and deployment processes for Java Spring applications. Kubernetes Orchestration: Deploy, configure, and manage Kubernetes clusters to orchestrate containerized workloads, ensuring high availability, scalability, and reliability. Monitoring and Alerting: Implement monitoring and alerting solutions using tools like Prometheus, Grafana, or ELK stack to proactively identify and address performance issues and service disruptions. Incident Response and Troubleshooting: Respond to and resolve incidents in a timely manner, conducting root cause analysis and implementing preventive measures to minimize the risk of recurrence. Performance Optimization: Identify opportunities for performance optimization and efficiency improvements in the infrastructure and application stack, collaborating with development teams to implement solutions. Security and Compliance: Implement security best practices and compliance standards (e.g., GDPR, HIPAA) in the infrastructure and application environments, ensuring data privacy and regulatory compliance. Documentation and Knowledge Sharing: Document system configurations, procedures, and troubleshooting steps, and share knowledge with the team to foster collaboration and continuous learning. Requirements: Bachelor's or Master's degree in Computer Science, Engineering, or a related field. Extensive experience in DevOps practices, including infrastructure automation, configuration management, and CI/CD pipelines. Proficiency in GitHub pipelines and CI/CD practices, with hands-on experience in configuring and managing GitHub Actions. Strong expertise in container orchestration with Kubernetes, including cluster management, deployment, scaling, and monitoring. Solid programming skills in Java and experience with Java Spring framework. Experience with cloud platforms such as AWS, Azure, or Google Cloud Platform. Knowledge of networking concepts, security principles, and best practices. Excellent problem-solving skills, attention to detail, and ability to work effectively in a fast-paced environment. Strong communication and collaboration skills, with the ability to work closely with cross-functional teams. Location: Remote, Colombia
Contract Type: Full-time
Salary: 2,500 - 3,000 USD (Full-time)
How to Apply: Interested candidates are invited to submit their resume and a cover letter detailing their relevant experience to ******. Please include "Site Reliability Engineer" in the subject line. #J-18808-Ljbffr