SRE Tech Lead - Cloud Infrastructure
Dublin, County Dublin, Ireland · Full Time
Be the first to apply
- Experience
- Any
- Salary
- —
- Openings
- 1
- Posted
- 1 hour ago
Where you'll work
Job description
Role overview
Join a Technical Infrastructure SRE function focused on keeping infrastructure and applications reliable, efficient, and cost-effective at global scale. The team supports a rapidly expanding worldwide user base by managing production stability, capacity planning, traffic scheduling, fault tolerance, disaster recovery, emergency handling, automation, and operations platform development.
This role also centers on building the foundational engineering for infrastructure products and components. The objective is to improve O&M architecture, create automated operations platforms, and apply data-driven and intelligent operations practices to solve large-scale cluster management challenges. The long-term goal is to provide stable, efficient, and low-cost serverless infrastructure for Mid-Platform and Business teams while helping the organization build a leading SRE capability.
What you will do
- Guide and grow a team of engineers responsible for scalable and dependable infrastructure platform systems.
- Stay hands-on with technical work while also managing people and team execution.
- Offer technical direction and practical support to team members and partner teams.
- Work closely across functions and with internal and external stakeholders to advance engineering initiatives.
- Drive innovation within the team by introducing new ideas, methods, and technologies.
Minimum qualifications
Candidates should bring strong experience in analyzing and resolving issues in distributed systems. A bachelor’s or master’s degree in Computer Science, software development, systems engineering, or a closely related technical discipline is required. Practical programming experience in at least one of these languages is needed: Python or Golang.
Preferred background
Strong communication and collaboration skills are important, especially when working across data science and infrastructure groups. Hands-on experience designing, building, scaling, and debugging platform solutions is highly valued, along with a solid grasp of code optimization and automation of recurring work. Familiarity with storage, database, and compute systems such as HDFS, object storage, file storage, KV, table, graph, Redis, MySQL, MongoDB, MQ, and Kafka is preferred, as is experience with Kubernetes, Docker/containers, AIOps, Spark, Flink, function-as-a-service platforms, RPC frameworks, and service mesh technologies.
About the company
The employer is a global short-form video platform with offices across major cities worldwide. Its mission is to inspire creativity and bring joy through a diverse and fast-moving workplace.
Why this role stands out
The organization emphasizes curiosity, humility, resilience, and continuous iteration. It also highlights an inclusive culture where different backgrounds and perspectives are valued and where teams work together to create meaningful impact for users and communities around the world.
Diversity and inclusion
The company is committed to building an inclusive environment and supporting a workforce that reflects the many communities it serves.