Sre (Site Reliability Engineer)

Details of the offer

Our Travelstart team is seeking an SRE (Site Reliability Engineer) for our Dev Team. This role ensures the reliability, performance, and scalability of the Travelstart systems. This role bridges the gap between software development and system operations, focusing on automating infrastructure and processes to improve reliability and efficiency.(This role is structured as a Hybrid role)
Key Responsibilities
Infrastructure Automation:Design, develop, and maintain automated infrastructure provisioning and management systems (e.g., Terraform, Ansible, CloudFormation).Create and manage configuration management tools (e.g., Puppet, Chef) to ensure consistent environments.Automate routine tasks and processes to reduce manual intervention and errors.
System Reliability:Monitor system performance and identify potential issues proactively.Implement incident response procedures and participate in incident investigations.Conduct root cause analysis to prevent recurring problems.Develop and maintain service level agreements (SLAs) and ensure they are met.Performance Optimization:Optimize system performance through tuning, caching, and load balancing.Conduct performance testing and benchmarking.Identify and address bottlenecks in the system.Scalability:Design and implement scalable architectures to handle increasing traffic and data volumes.Ensure the system can accommodate growth and peak loads.Collaboration:Work closely with development teams to ensure that new features and changes are reliable and scalable.Collaborate with operations teams to maintain system stability and availability.Participate in knowledge sharing and training activities.
Required Skills and Experience
Strong understanding of cloud platforms (e.g., AWS, Azure, GCP) and infrastructure-as-code tools.Proficiency in scripting languages (e.g., Python, Bash).Experience with containerization technologies (e.g., Docker, Kubernetes).Knowledge of networking and security concepts.Experience with monitoring and alerting tools (e.g., Prometheus, Grafana).Problem-solving and analytical skills.Ability to work independently and as part of a team.
Desired Skills and Experience
Experience with DevOps methodologies and tools.Knowledge of specific OTA technologies and systems.Experience with chaos engineering and failure testing.Certification in cloud platforms or DevOps.
By effectively fulfilling these responsibilities, the SRE will play a crucial role in ensuring the reliability, performance, and scalability of the Travelstart systems, ultimately enhancing customer satisfaction and business success.
About Travelstart
Travelstart is Africa's leading online travel agency (OTA) that helps today's business and leisure travellers search, compare and book the best flight, bus, hotel, car hire, holiday packages and activities all in one place.
With a huge focus on affordable travel and simplifying the travel booking experience for our customers, visit the Travelstart website or download the Travelstart app to find some of the lowest fares around. Pay quickly and safely online with your preferred payment method and you're off!
#J-18808-Ljbffr


Nominal Salary: To be agreed

Source: Whatjobs_Ppc

Requirements

Reactjs Developer

Our client is eagerly searching for a passionate and dynamic ReactJS Developer to join their team in an exciting hybrid role based in Cape Town This is a fan...


Msp Staffing Ltd - Western Cape

Published 6 days ago

Portfolio Specialist

Location: Western Cape Job Type: Full-Time Salary: Competitive Benefits Are you a skilled and detail-oriented professional with a passion for managing insura...


Network Recruitment - Western Cape

Published 6 days ago

Artisan (M03)

Artisan Ons kliënt, geleë in Durbanville, is 'n toonaangewende speler in die handel van edelmetale, insluitend goud. Hulle is toegewy aan die verskaffing van...


Werksmag Consilium Pty Ltd - Western Cape

Published 6 days ago

Rf Field Technician

We are seeking a skilled RF Field Technician for network maintenance and building, ensuring optimal communication system performance. Key Responsibilities: P...


Network Recruitment - Western Cape

Published 6 days ago

Built at: 2024-11-21T21:42:17.475Z