With over 20 years of expertise, our client is a global leader in Vertical AI SaaS solutions, revolutionising how top firms across industries operate. They offer more than just a job—it’s a chance to be part of a company that values accountability, collaboration, and growth in a truly diverse and inclusive culture. They provide a flexible, connected work environment that prioritises both work-life balance and career development, making it the perfect place to thrive professionally and personally. Open to receiving applicants from outside of Portugal and offering a comprehensive relocation package along with immigration/administrative support.
Key Responsibilities
1. Collaborate on New Features: Work with the Development and Product teams to design and build new features.
2. Troubleshoot and Fix Issues: Investigate and solve reliability problems in systems; work with other software engineers across the organisation to produce and roll out fixes.
3. Standardise Processes: Help create consistent practices across different teams and services, working with Site Reliability Engineers (SREs).
4. Improve Automation: Find ways to automate tasks like deployment, service management, and monitoring of services, and create tools to make this happen.
5. Define & Monitor Reliability Metrics: Establish and track Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs) to measure system performance and ensure reliability.
6. Lead Post-Mortems & Incident Reviews: Conduct post-mortems to analyse outages and incidents, identify root causes, and drive improvements to prevent future failures.
7. Agile Work Style: Balance planned project work (sprints) with daily operational tasks.
8. On-Call Support: Be part of a 24/7 on-call rotation with 12-hour shifts.
What You Bring:
1. Experience with Reliable Systems: You have worked on building systems that are fault-tolerant and scalable.
2. Knowledge of Databases: You’re familiar with databases like SQL Server, PostgreSQL, and NoSQL.
3. Expertise in Tools: You’re skilled in tools for managing configurations and deployments, such as Ansible, Jenkins, and Azure DevOps.
4. Azure Experience: You have hands-on experience using Azure to run production workloads.
5. Scripting Skills: You can write scripts in Python, Perl, Go, or similar programming languages.
6. Understanding of CI/CD: You know how to set up and manage continuous integration and continuous deployment pipelines.
7. Windows Infrastructure Experience: You have experience managing Windows Infrastructure that runs IIS (Internet Information Services).
8. Problem Solver: You enjoy fixing reliability issues and creating long-term solutions.
9. Automation Focus: You believe in automating tasks whenever possible.
10. Fluent in English: You can speak and write English confidently and clearly.
Área: Data and BI
Local: Lisboa
Tipo de contrato: FULL_TIME
Especialização: Tecnologias de informação
Indústria: TI
Salário: Negotiable
Tipo de trabalho: Híbrido
Nível de experiência: Gerente
Referência da vaga: 2366282/001
Data postada: 14 de março de 2025
Consultor: Cátia Carlos
#J-18808-Ljbffr