Mô tả công việc
• Manage and operate systems on a large-scale platform.
• Implement SRE automation by developing automation across the stack and optimizing operational hours by reducing manual tasks.
• Eliminate toil through automation across all layers, including infrastructure provisioning, configuration management, deployment, testing, and operations on both on-premise and public clouds (e.g., AWS).
• Collaborate with development teams to design, deploy, and maintain reliable systems.
• Secure the system infrastructure by applying compliance standards (e.g., PCI DSS, ISO 27001).
• Provide 24/7/365 monitoring and troubleshooting for system incidents.
• Address other aspects of the system, such as security, scalability, and performance.
Yêu cầu
• At least 3 years of working experience in relevant fields with a strong background in Linux systems.
• Experience with technologies such as Kafka, RabbitMQ, Redis, Elasticsearch, MySQL, Mongo, ETCD….
• Proficiency in CI/CD, DevOps automation, and monitoring tools (e.g., Jenkins, GitLab, Grafana, Prometheus, Terraform, ArgoCD, Tracing). Hands-on experience with Jenkins pipeline scripting to extend CI/CD pipelines.
• Expertise in operating systems/software architecture, monitoring, and network protocols, as well as familiarity with microservices architecture and container orchestration using Kubernetes.
• Intermediate knowledge of modern architecture-related tools, including service mesh, API gateways, reverse proxies, and service proxies for cloud-native applications.
• Ability to containerize applications, write optimized Dockerfiles, and troubleshoot containerized application issues in local environments or Kubernetes clusters.
• Ability to code or script (Bash, Python, Go, …).
Nice-to-Have:
• Experience working in the payment platform domain.
• Familiarity with cloud platforms, particularly AWS.
• Experience managing a large-scale platform.
• Basic knowledge of PCI DSS compliance.
• Strong problem-solving skills.
• English communication skills.
#J-18808-Ljbffr