AC
Компания RADLogics INC (www.radlogics.com) ищет два специалиста - Site Reliability Engineers (SREs)
SREs are responsible for keeping all user-facing services and other RADLogics production systems running smoothly. SREs are a blend of pragmatic operators and software craftspeople that apply sound engineering principles, operational discipline, and mature automation to our environments and the Bitbucket codebase. We specialise in systems, whether it be networking, the OS, or some more specific interest in scaling, algorithms, or distributed systems.
RADLogics is a unique site, we support customers worldwide whether in the cloud (SAAS) or on-prem.
AS a SRE in RADLogics you will:
Take care of new deployments at clients
Support (follow the sun) the WW customers
Monitoring ownership – manage and create new dashboards (per client) on both Grafana and DataDog (per organisation).
Support the RnD teams when needed.
Skills required:
“Get the job done” mind-set
Knowledge and proficiency with a variety of Ops and Automation tools
Great at writing scripts
Comfortable dealing with frequent testing and incremental releases
Understanding of Ops challenges and how they can be addressed during design and development
Soft skills for better collaboration across the team
Ability to postmortem the unexpected incidents to solve future hazards
Skilled in evaluating new possibilities and capacity planning aptitudes
Comfortable with handling the operations, monitoring and alerting
Knowledge and experience in building processes and automation to support other teams
Technical skills:
OS – windows – 2 years as an admin preferable
OS – linux - 2 years as an admin preferable
Scripting : Python / bash / powershell – at least 2 years in one of them.
Network – understanding network (FW / LB / testing / …) – nice to have
AWS – 2 years experience
CM as a code: Puppet / Chef or Ansible - at least 1 year.
Implement "Infrastructure as Code" using Terraform and jenkins CI/CD for automation – 1 year experience with both tools at least !
Kubernetes and containers – 1 year experience
Monitoring and Metrics in DataDog, Grafana and integrations with Slack/PagerDuty – 1 year or more.
Logging infrastructure – DataDog – Nice to have
Backend storage management and scaling – Nice to have
Disaster Recovery and High Availability strategy – nice to have
Strong communication skills (verbal / written)
Languages (good English!!!)
Условия работы:
Удаленная работа, с последующим выходом в офис
ЗП и бонусы индивидуально обсуждаются (в рынке)
от 180 до 250 тыс руб. (на руки)
Контакты для связи и резюме: alexander@radlogics.com