Senior Site Reliability Engineer
Optimizely
Hanoi, VN
5 ngày trước

Introduction

SREs at Optimizely are focused on making us the most reliable, performant, and trustworthy Digital Experience Optimization platform ever.

Our engineering teams have built data pipelines that process 10 billion events daily and applications that support powerful experimentation and collaboration workflows at scale.

Our platforms are built on AWS and GCP. We use technologies such as Kafka, Samza, HBase, MySQL, and Postgres. We build and manage our systems using TravisCI, Jenkins, Docker, Kubernetes, Terraform, and Chef.

We use a combination of managed and self-hosted approaches. This is a unique opportunity to lead the engineering organization in areas of standardized automated infrastructure and service provisioning and orchestration, service-oriented architectural excellence, and forward-looking planning and execution of large technical projects.

Job Responsibilities

  • Help build a Site Reliability Engineering culture across the organization by sharing your best practices, approaches, documentation, and code with other engineering teams
  • Apply automation and software to any tasks or parts of the system that would benefit from it or are performed manually
  • Ensure effective performance and 24x7 availability of all production systems
  • Monitor alerts coming out of all Optimizely’s platforms, and coordinate with Operations / SRE / TSS / Engineering teams as necessary to take preventive or corrective action to resolve any incidents, with a goal to minimize MTTA / R
  • Put in place and manage an effective on-call rotation within the team
  • Work with engineering teams to set up proper monitoring and alerting thresholds across all Optimizely services and applications so SRE team is focusing on key areas to stabilize the platforms
  • Document your system knowledge as you acquires it over time, create runbooks, and ensure critical system information is readily available to those who need it
  • Accountability for platform uptime SLAs.
  • Knowledge and Experience

  • Proven experience with AWS / Azure cloud infrastructure and DevOps
  • Experience using Kubernetes to build containerized applications
  • Good understanding Identity Governance catalog
  • Experience building secure multi-tier web applications
  • Experience configuring continuous integration and continuous delivery (CI / CD) systems such as TeamCity
  • Proficiency with databases such as SQL Server, Postgres, and MongoDB
  • Proficiency with ELK
  • Strong desire to learn and collaborate with the team
  • Must have a strong passion for continuous improvement
  • Ability to work with remote coworkers in other time zones
  • Familiarity with Agile development methodologies such as Scrum
  • Fluent in English both written and oral.
  • Bonus points :

  • Experience building scalable multi-region applications
  • Proficiency in scripting / Programming like PowerShell, Bash, or Python
  • Experience configuring software monitoring tools such as DataDog, Kibana, ELK, etc.
  • Proficiency in using configuration management tools such as Terraform.
  • Education

    Bachelor of Computer Science or equivalent industry experience

    Competencies

  • Displaying Technical Expertise
  • Critical Thinking
  • Testing and Troubleshooting
  • Demonstrating Initiative
  • Utilizing Feedback
  • About us :

  • 5 working days / week with flexible working time and no overtime;
  • Annual luxury Kick-off vacation;
  • International, professional, creative working environment and talented teams
  • Onsite opportunities in Europe and US;
  • Common cultural-sportive- art Clubs and activities, sponsored and / or supported by the Company (Ex : Football, GYM, Swimming, Guitar, English ).
  • Powerful workstation : Core i7-9700, 16-32 GB RAM, 02 x QHD 2560x1440 monitors (2K resolution);
  • 100% official salary during the probation period, 13th-month salary, annual salary raise;
  • 12 days of annual leave and 2 days of company holidays (New Year's eve 31 / 12, Juneteenth day 18 / 6, Work Anniversary)
  • Up to 03 extra paid-leave days per year
  • A free Hacking day per month for self-studying and researching any IT-related subjects;
  • Social, Health, and Unemployed Insurance are based on 100% Gross salary and fully paid by Company;
  • Extra bonus at $ 60 per special occasion (Birthday, Labor Day, National Day, Solar New year, Lunar New Year);
  • Lunch allowance at $30 per month;
  • Baby allowance for a child under 03 years old is $ 12 per month;
  • AON Premium Healthcare Insurance package for employees and their children up to 18 years old.
  • Daily various foods, drink, and seasonal fresh fruits;
  • And many other benefits, let's join us to discover!

    Báo cáo công việc này
    checkmark

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    Nộp đơn
    Email của tôi
    Bằng cách nhấp vào "Tiếp tục", tôi đồng ý với neuvoo để xử lý dữ liệu của tôi và gửi cho tôi thông báo qua email, như được nêu chi tiết trong Chính sách bảo mật của neuvoo. Tôi có thể rút lại sự đồng ý của tôi hoặc hủy đăng ký bất cứ lúc nào.
    Tiếp tục
    Mẫu đăng ký