Site Reliability Engineer - Remote, Null
Social network you want to login/join with:Site Reliability Engineer - RemoteClient:ESL FACEIT GroupLocation:RemoteJob Category:EngineeringEU work permit required:YesJob Reference:3c177bbf8881Job Views:11Posted:22. 01. 2025Expiry Date:08. 03. 2025Job Description:At EFG (ESL FACEIT Group), we create worlds beyond gameplay where players and fans become community.
We pride ourselves in having a corporate social responsibility which is that "IT'S NOT GG (Good Game), UNTIL IT'S GG FOR ALL".
We are passionate about the culture we foster that ultimately helps to create and shape the world of esports, gaming tournaments, leagues, events, and holistic ecosystems staged for our millions of players, fans, and heroes. The Team:As a Site Reliability Engineer at EFG, you will be designing, analyzing, and troubleshooting large-scale distributed systems.
You will demonstrate a systematic problem-solving approach, and the ability to debug and optimize code and to automate routine tasks.
You will ensure that EFG's services and systems are reliable, that they have uptime appropriate to users' needs, and they have a fast rate of improvement. Apart from monitoring our systems' capacity and performance, you will also focus on optimizing existing systems, on building infrastructure, and on eliminating work through automation.
You will work collaboratively with the software engineering teams to deploy and operate our systems, and you will help to automate and streamline our operations and processes.
Within this role, you will be given real responsibilities, and you have the opportunity to drive change and have a big impact on our products and platform. What you will do:Maintain and improve the monitoring and observability tools (Grafana/Prometheus/Thanos/Jaeger);Work closely with your team and with other cross-functional teams to help design, maintain, and operate systems at scale;Develop and drive adoption of SRE best practices across the company;Lead the incident management process and adoption;Use your troubleshooting skills to help identify and fix operational issues;Work with Cloud Native technologies such as Kubernetes, Envoy, Istio, Prometheus, and Helm;Work with the "Hashi Stack" (Terraform, Packer, Vault);Experiment with and introduce cutting-edge technologies. Requirements:Proven experience as a Site Reliability Engineer, DevXP Engineer, or Software Engineer, focusing on building and maintaining scalable infrastructures;Excellent working knowledge of at least one of the major cloud providers (GCP/AWS/Azure);Experience with cluster management systems (Kubernetes);Knowledge of incident management: ability to investigate, troubleshoot, recover, and prevent the recurrence of incidents that interfere with the normal delivery of IT services;Proficient in Go language and some level of proficiency in at least another language: Java, Python, Rust…;Knowledge of GitOps practices;Production scale experience with one of the following: MongoDB, Redis, MySQL;Experience contributing to open source technologies would be an added bonus.
#J-18808-Ljbffr
Diventa il primo a rispondere a un'offerta di lavoro!
-
Perché cercare un lavoro con PostiVacanti.it?
Ogni giorno nuove offerte di lavoro È possibile scegliere tra un'ampia gamma di lavori: il nostro obiettivo è quello di offrire la più ampia selezione possibile Ricevi nuove offerte via e-mail Essere i primi a rispondere alle nuove offerte di lavoro Tutte le offerte di lavoro in un unico posto (da datori di lavoro, agenzie e altri portali) Tutti i servizi per le persone in cerca di lavoro sono gratuiti Vi aiuteremo a trovare un nuovo lavoro