What you'll doHelps teams (such as product teams, platform teams and application teams) successfully adopt cloud infrastructure and platform services.
It is involved in design and implementation activities that result in new or improved cloud-related capabilities, and it brings skills and expertise to such areas as cloud technical architecture (for a workload's use of infrastructure as a service [IaaS] and platform as a service [PaaS] components); automating cloud management tasks, provisioning and configuration management; and other aspects involved in preparing and optimizing cloud solutionsSuccessful outcomes are likely to embrace infrastructure-as-code (IaC), DevOps and Agile ways of working and associated automation approaches, all underpinned by the cloud infrastructure engineer's solid understanding of networking and security in the cloudWill directly engage with customer teams, but will also work on cloud infrastructure platform capabilities that span multiple teamsCollaborates closely with cloud architects, product/platform teams, software developers, site reliability engineers (SREs), security, and network specialists, as well as other roles, particularly those in the infrastructure and operations.
Being an approachable team-player is therefore crucial for success, and willingness to lead initiatives is important tooSupports colleagues with complex (escalated) operational concerns in areas such as deployment activities, event management, incident and problem management, availability, capacity and service-level management, as well as service continuityIs expected to demonstrate strong attention to detail and a customer-centric mindset.
Inquisitiveness, determination, creativity, communicative and collaboration skills are important qualities tooThis role is based remotely/on-site, with some travel requirements (up to 40%)Provide expert knowledge on cloud infrastructure and platforms, to ensure our organization achieves its goals for cloud adoption.
This involves translating cloud strategy and architecture into efficient, resilient, and secure technical implementationsOffer cloud-engineering thought leadership in areas such as Compute and DataSupport architects working on new cloud solutions including analyzing requirements, supporting technical architecture activities, prototyping, design and development of infrastructure artifacts, testing, implementation, and the preparation for ongoing supportWork on cloud migration projects, including analyzing requirements and backlogs, identifying migration techniques, developing migration artifacts, executing processes, and ensuring preparations for ongoing supportDesign, build, deliver, maintain and improve infrastructure solutions.
This includes automation strategies such as IaC, configuration-as-code, policy-as-code, release orchestration and continuous integration/continuous delivery (CI/CD) pipelines, and collaborative ways of working (, DevOps).
Participate in change and release management processes, carrying out complex provisioning and configuration tasks manually, where needed.Research and prototype new tools and technologies to enhance cloud platform capabilities.
Proactively identify innovative ways to reduce toil, and teach, coach or mentor others to improve cloud outcomes using automationImprove reliability, scalability and efficiency by working with product engineers and site reliability engineers to ensure well-architected and thoughtfully operationalized cloud infrastructures.
This includes assisting with nonfunctional requirements, such as data protection, high availability, disaster recovery, monitoring requirements and efficiency considerations in different environmentsProvide subject matter expertise for all approved IaaS and PaaS services, respond promptly to escalated incidents and requests, and build reusable artifacts ready for deployment to cloud environmentsExert influence that lifts cloud engineering competency by participating in (and, where applicable, leading) organizational learning practices, such as communities of practice, dojos, hackathons and centres of excellence (COEs).
Actively participate in mentoring.
Practice continuous improvement and knowledge sharing (, providing KB articles, training and white papers)Participate in planning and optimization activities, including capacity, reliability, cost management and performance engineeringWork closely with security specialists to design, implement and test security controls, and ensure engineering activities align to security configuration guidanceEstablish logging, monitoring and observability solutions, including identification of requirements, design, implementation and operationalizationOptimize infrastructure integration in all scenarios — single cloud, multicloud and hybridConvey the pros and cons of cloud services and other cloud engineering topics to others at differing levels of cloud maturity and experience, and in different roles (, developers and business technologists).
Be forthcoming and open when the cloud is not the best solutionWork closely with third-party suppliers, both as an individual contributor and as a project lead, when required.
Engage with vendor technical support as the customer lead role when appropriateParticipate/Lead problem management activities, including post-mortem incident analysis, providing technical insight, documented findings, outcomes and recommendations as part of a root cause analysisSupport resilience activities —, disaster recovery (DR) testing, performance testing and tabletop planning exercisesWho you areExperience with these cloud provider services is essential: AWS, Azure DevOps-tooling such as CI/CD (, Jenkins, Jira, Confluence, Azure DevOps/ADO, TeamCity, GitHub, GitLab)Infrastructure-as-code approaches, role-specific automation tools and associated programming languages Ansible, Terraform, Packer Knowledge of software development frameworks/Languages: PythonContainer management: Docker, Rancher, Kubernetes, EKSVirtualization platforms: VMwareOperating systems: Windows and Linux including scripting experienceDatabase technologies and caching: Postgres, Oracle, MSSQL, NoSQL, Redis, CDN, MongoIdentity and access management: Active Directory/Azure ADMonitoring tools: AWS CloudWatch, Elastic Stack (Elastic Search/Logstash/Kibana), Grafana, LokiCloud networking Subnetting, Route Tables, Security Groups, VPC, VPC Peering, NACLS, VPN, Transit Gateways, optimizing for egress costsCloud security key management services, encryption, other core security services/controls the organization usesScalable architectures, including APIs, microservices and PaaS.Analyzing cloud spending and optimizing resources Implementing resilience (, multi-AZ, multi-region, backup and recovery tools)Cloud provider frameworks (, Well-Architected)Working with architecture tools and associated artifacts