Capital H Staffing And Advisory Solutions | Site Reliability Engineer (Sre) (Ch1078)

Details of the offer

Our client is an innovative cloud-based company that leverages its software to address the legal contracting, compliance, and legal practice challenges faced by listed companies and multinationals. They are seeking a Site Reliability Engineer to join their dynamic team of professionals, delivering transformative growth by creating intelligent tech solutions that revolutionize the practice of law.
The ideal candidate will have a background in software engineering, system administration, and experience managing large-scale, high-availability systems. The SRE will be responsible for ensuring the reliability, scalability, and performance of our infrastructure, collaborating with development teams, and driving continuous improvement in system operations. This role offers a fantastic opportunity to work in a professional environment while enjoying the flexibility of working from home.
Key Responsibilities:Infrastructure Management: Design, build, and maintain highly available and scalable infrastructure using cloud platforms (OCI, AWS, GCP, Azure) and on-premises environments.Monitoring & Incident Response: Implement and maintain monitoring, logging, and alerting systems to detect and respond to system issues promptly. Lead incident response efforts and perform root cause analysis.Automation: Develop and deploy automation tools to streamline operations, reduce manual intervention, and improve system reliability.Performance Optimization: Analyze system performance metrics and make recommendations to improve application and infrastructure performance.Security & Compliance: Ensure systems meet security, compliance, and regulatory requirements by implementing best practices and conducting regular audits.Collaboration: Work closely with development teams to ensure new features and services are scalable, reliable, and maintainable.Disaster Recovery: Develop and maintain disaster recovery plans, including data backups and system redundancy strategies.Continuous Improvement: Identify areas for improvement in the existing infrastructure, propose, and implement solutions to enhance system reliability and performance.Documentation: Create and maintain detailed documentation for system configurations, procedures, and processes.We are looking for someone with excellent communication and problem-solving skills, someone who is an analytic thinker, who can work effectively in a fast-paced environment.
Required Skills:Experience: Minimum of 5 years of experience in Site Reliability Engineering, DevOps, or a related field.Cloud Platforms: Extensive experience with cloud services such as OCI, AWS, Google Cloud, or Azure.Automation & Scripting: Proficiency in scripting languages (Python, Bash, etc.) and configuration management tools (Terraform, Ansible, Chef, Puppet).Monitoring & Logging: Experience with monitoring tools (Zabbix, Prometheus, Grafana, Wazuh) and logging systems (ELK stack, Splunk, Elastic).Networking: Strong understanding of networking concepts, including DNS, load balancing, firewalls, and VPNs.Containers & Orchestration: Experience with containerization (Docker) and orchestration tools (Kubernetes).CI/CD: Familiarity with continuous integration/continuous deployment (CI/CD) pipelines and tools like Jenkins, GitLab CI, or CircleCI.System Administration: Strong background in Linux/Unix system administration.ITSM: Experience with IT Service Management platforms, optimizing and supporting tools like JIRA, Freshdesk.Incident Management: Proven ability to handle high-pressure incidents and provide clear communication to stakeholders.Preferred Qualifications:Education: Bachelor's or master's degree in computer science, Engineering, or a related field.Certifications: Relevant certifications such as Oracle Cloud Infrastructure Architect Associate, AWS Certified Solutions Architect or Google Cloud Professional DevOps Engineer.Programming: Experience with software development in languages such as Python, Go, Java, or Ruby.Database Management: Experience managing and optimizing databases (OracleDB, SQL).Experience in High-Traffic Environments: Prior experience working in environments with large-scale, high-traffic systems.General: Only shortlisted candidates will be contacted. Should you not hear from us after 30 days you may consider your application unsuccessful. In keeping with our client's employment equity requirements, only South African citizens will be considered. Please include your current salary and salary expectations.

#J-18808-Ljbffr


Nominal Salary: To be agreed

Source: Jobrapido_Ppc

Requirements

Information Technology Manager

Location: King George, VA Desired Candidate Qualities: Ten (10) plus years of experience with demonstrated ability to plan and coordinate the installation, t...


Tech Wizards - South Africa

Published a month ago

Data & Analytics Industrial Placement

Data & Analytics Industrial PlacementJob Description SummaryPrior to submitting your application, please visit our early careers website to find out more abo...


Industry Placements - South Africa

Published a month ago

Specialist Technology Transfer Project

JOB PROFILE FIXED TERM CONTRACT (12 MONTHS) JOB TITLE: Specialist: Technology Transfer Projects JOB GRADE: C5 Minimum Midpoint TOTAL CTC: R507 409 - R596 9...


Small Enterprise Development Agency_Gov - South Africa

Published a month ago

Data Engineer

Please note that at this time, we are not accepting resumes from external agencies or recruiters. Any unsolicited resumes will not be considered and will not...


Hakkoda Inc. - South Africa

Published 12 days ago

Built at: 2024-12-23T00:47:50.682Z