Senior Site Reliability Engineer
We are an experienced innovation company!
We exist to unlock a better way to experience the world. Valtech is
one of the largest global business transformation agencies. We design,
build, and deliver transformative digital solutions for the world's
best-known brands.
We engage with our clients worldwide to discover new opportunities,
design & build digitally driven solutions, and run &
continuously optimize outcomes. We build intuitive, frictionless, and
connected experiences that improve human lives and make our client's
businesses grow.
The ideal candidate's skills
You are someone with 5 years of experience in the field of software engineering, devops engineer, qa engineering and/or cloud engineering of which at least the last 2 years as a dedicated Site Reliability Engineer. You feel comfortable to take the lead, make decisions and know how to mobilize and motivate people to set things in motion. In your current role, people come to you for advice on what to look for to determine the robustness of their production environments, advice for reliable deployment procedures, assistance in analysis of failure scenarios and ideas on how to mitigate or remediate those.
/ You are assertive with good communicative skills, capable of taking the lead and coaching a development team to make the right choices.
/ You have experience with incident management on a production environment of a public facing online service with high business value and preferably high traffic in a 24x7 fashion
/ You have experience in working in corporate environments
/ You have experience programming and scripting
/ You have at least basic knowledge of serverless services in one or more public cloud providers (AWS, Azure, GCP).
/ You have extensive knowledge of and experience with various monitoring systems, amongst which APM systems such as Datadog, New Relic, Dynatrace, Prometheus, Grafana
/ You have knowledge of and experience with various pipelining tools, such as GitHub, Azure DevOps, Gitlab, Jenkin
/ You have knowledge of and experience with microservices related technology: Docker, Kubernetes
/ You have a good conceptual understanding of software architecture and system thinking
/ You have worked as an engineer in a DevOps context
/ You have an excellent command of English (C1 or above)
Are familiar with the following technologies:
- Datadog (or APM equivalent)
- Argo CD
- Java / Springboot
- Kafka
- Kubenetes / EKS
- AWS
/ Have worked within the context of publicly accessible, highly available eCommerce platforms.
/ Have experience working in an international context with on- and off-shore teams.
The role
As a Site Reliability Engineer (SRE), you are the bridge between
software development and operations. You help us to deliver reliable
speed to our clients, allowing them to leverage the benefits of
continuous deployment without losing grip on customer experience. You
will work with our multidisciplinary teams in an essential DevOps way of
working where your main responsibility is to keep everyone focused on
production, while creating the facilities to do so.
Your responsibilities will be:
/ Work with teams to define SLIs and SLOs
/ Creating systems for observability
/ Work with teams to analyze failure scenarios and possible mitigations.
/ (Assisting to) create runbooks to remediate or prevent failure scenarios.
/ Reduce work that does not add value
/ Participate and facilitate incident management including On Call Duty
What do we offer in return?
- Private health insurance
We hope you will never need it, but nevertheless, we offer private health insurance to all our employees.
- Education program
We never stop learning, that’s why we offer our employees an educational program with training and certification.
- Wellbeing program
We all deserve to live a healthy and well-balanced life. It's not an option, it's a necessity!
- Free beverages
Enjoy free coffee, drinks, and snacks at work, or join one of our famous company dinners.
- Social events
We enjoy spending time together, not only at work. Ski trips, carting, laser-tag, wine tasting, picnics, cooking classes… you name it – we’ve done it! There are plenty of cool events to join and to get to know your colleagues.
- Competitive conditions
Besides a competitive salary and 24 days of vacation, you will join annual company events with the whole team.
- Challenging projects
Ready for a challenge? We guarantee you’ll find challenging projects at Valtech!
- Cool colleagues
What’s the most important thing in a job? Cool colleagues with whom you spent most of the time during the week. We have a lot of them!
- Honest feedback
Honesty, openness and respect are among our core values. We encourage an open feedback culture in order to build trust and grow together
Our company values
We SHARE our knowledge with our clients and colleagues all over the world. We value different opinions and embrace open discussions. We DARE to go into unknown territories. We dare to speak up and be totally honest. We CARE about the end-user experience, about our clients' businesses, and about the quality of the things we make. We want to make the world a better place through the work we do.
Say hello to your future. Apply!
- Department
- Engineering
- Role
- Cloud Engineer
- Locations
- Skopje, Bitola
- Remote status
- Hybrid Remote
- Employment type
- Full-time
About Valtech North Macedonia
We are a global digital agency focused on business transformation.
Having our consultants as the heart of our business, we believe that our teams are strengthened by recruiting from a truly diverse pool of exceptionally unique and talented individuals.
We are a people company. We share our knowledge and learnings; we dare to speak up and take risks. We care about one another, our clients, and the world.
#wearevaltech
Senior Site Reliability Engineer
Loading application form
Already working at Valtech North Macedonia?
Let’s recruit together and find your next colleague.