Site Reliability Engineer Job at EVONA, San Francisco, CA

N1gvWWl3bGVpS0hweXhLL1M2N002eDRxMWc9PQ==
  • EVONA
  • San Francisco, CA

Job Description

Site Reliability Engineer (SRE)

Location : San Francisco Bay Area

Role Overview :

We are seeking a highly skilled Site Reliability Engineer (SRE) to join a dynamic team at a rapidly growing technology company. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of mission-critical systems, while implementing automation and optimizing cloud infrastructure. This role offers the opportunity to work with cutting-edge AI/ML technologies , leveraging them to solve complex challenges in cloud infrastructure management and performance optimization.

Key Responsibilities :

  • System Reliability & Performance : Design, implement, and maintain scalable systems, ensuring high availability, performance, and disaster recovery across production environments.
  • Automation & Tool Development : Develop automation tools to streamline operations, improve system reliability, and reduce manual interventions.
  • Cloud Infrastructure Management : Create and manage cloud instances (e.g., dev, staging, production) using AWS, GCP, or Azure, optimizing infrastructure performance and cost.
  • Integration of AI/ML Models : Collaborate with engineering teams to integrate machine learning models into production environments, ensuring that these models scale efficiently and perform optimally.
  • Incident Management : Respond to and resolve incidents, minimizing downtime and ensuring quick recovery. Lead post-incident reviews and implement preventive measures.
  • Continuous Improvement : Identify areas of improvement and drive initiatives to enhance system reliability, performance, and security.
  • Security & Compliance : Ensure that infrastructure and applications adhere to security best practices and compliance standards.

Qualifications :

  • Educational Background : Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience).
  • Experience : Proven experience as a Site Reliability Engineer or in a similar role within a SaaS environment , managing and optimizing cloud infrastructure (preferably AWS, GCP, or Azure), and familiarity with integrating AI and machine learning technologies.
  • Technical Skills :
  • Proficiency in programming and scripting languages such as Python, Go, or Bash.
  • Experience with containerization and orchestration tools like Docker and Kubernetes.
  • Solid understanding of networking, security , and performance optimization practices.
  • Knowledge of CI/CD pipelines and DevOps practices to ensure smooth development and deployment cycles.
  • Problem-Solving : Strong analytical and problem-solving skills with attention to detail.
  • Collaboration & Communication : Excellent interpersonal skills, with the ability to work collaboratively in cross-functional teams and communicate technical concepts clearly.

Benefits :

  • Competitive Salary : Attractive compensation package, including equity options.
  • Health & Wellness : Comprehensive health, dental, and vision insurance, along with other benefits.
  • Work Environment : A collaborative and innovative work environment within a growing company.
  • Growth Opportunities : Opportunities for career growth, professional development, and a chance to shape the future of the company’s technology and infrastructure.

Job Tags

Similar Jobs

Tiger Recruitment

Executive Assistant - Investment and Technology Development Firm Job at Tiger Recruitment

 ...Executive Assistant - Global Investment and Technology Development Firm Manhattan, NYC Full Time, Permanent Hybrid Salary: $120,000 - $140,000 p.a. Our client, a Global Investment and Technology Development Firm based in NYC, is looking for a proactive, resourceful... 

Two95 International Inc.

IT disaster Recovery Analyst - Portland, OR Job at Two95 International Inc.

 ...Title: IT Disaster Recovery Analyst Location: Portland, OR Position: Contract Rate: $Open Required Skills: - Bachelors degree with a minimum of 5 years experience with IT technologies, analysis of business process to technology interdependency mapping... 

Canadian Executive Search Group (USA) Inc / Division of Arro...

Maintenance Technician Job at Canadian Executive Search Group (USA) Inc / Division of Arro...

 ...CES/AWS is seeking a Maintenance Technician for an Automotive Manufacturing client in Blue Springs, MO Location: Blue Springs, MO...  ...understanding of electrical systems, including motors, drives, and control circuits. Please submit your resume to carmen@... 

CRG

Agile Coach Job at CRG

 ...Agile Coach Location: Chicago, Illinois (Hybrid) Duration: 6 Months, Contract Pay: $55-60/hour W2 JOB DESCRIPTION...  ...experience Consulting Experience MBA or advanced degree in relevant technology or business-related field Category Code: JN008... 

Darwill

Paid Media Specialist (Google Ads and Paid Social) Job at Darwill

 ...experience ~ Demonstrated ability to drive results and achieve key performance indicators (KPIs) such as return on ad spend (ROAS), cost per acquisition (CPA), and conversion rates ~ Experience in the Healthcare, Home Services, or Automotive industries is a plus ~...