Senior Cloud Platform Site Reliability Engineer (Wallet Focus)

Moledao

$5-15K[Bulanan]
Jarak jauh5-10 Tahun KedaluwarsaS1Penuh waktu
Bagikan

Detail Jarak Jauh

Negara terbukaDi seluruh dunia

Persyaratan BahasaInggris | Cina

Deskripsi Pekerjaan

Tampilkan teks asli

We are hiring a Senior SRE Engineer (Wallet Operations) responsible for ensuring the stability, availability, and performance of core business infrastructure on AWS; managing global production environments; building scalable, highly available systems; advancing automation and observability platforms; and maintaining security and compliance standards.

Remote work, with optional bases in Singapore, Malaysia, or Abu Dhabi

Job Purpose

  • Responsible for deployment-related tasks
  • Ensure reliable and efficient system operation at scale
  • Develop tools to enhance availability, performance, and incident response capabilities

Responsibilities

  1. Ensure the stability, availability, and high performance of AWS global infrastructure, and own the production environment SLAs.
  2. Design, operate, and troubleshoot cloud-native components such as Kubernetes, Envoy, Service Mesh (Istio/Linkerd), and Ingress.
  3. Improve operational efficiency through automation and platform tools (IaC, CI/CD), building observability, self-healing, and rapid recovery capabilities.
  4. Implement and maintain operational security: access controls (AWS IAM/K8s RBAC), network security policies, vulnerability management, and incident response.
  5. Develop a global operations framework covering capacity planning, monitoring and alerting (Prometheus/ELK), CI/CD (GitLab/Jenkins), disaster recovery, and automated failover.
  6. Gain deep understanding of the business architecture, participate in designing and reviewing high availability/disaster recovery solutions, and continuously optimize costs.

Qualifications

  • Over 5 years of Linux operations/SRE/DevOps experience with large-scale distributed systems operations expertise
  • Proficient in core AWS services (EC2/S3/VPC/IAM/ELB/RDS, etc.), with experience in architecture, operations, and cost optimization
  • Deep understanding of Kubernetes, with experience in production operations, performance tuning, and troubleshooting of large-scale clusters
  • Familiar with Envoy, Istio/Linkerd, Nginx/Istio Ingress (L7 traffic management)
  • Strong security awareness, with knowledge of common system/network/application vulnerabilities and mitigation strategies
  • Proficient in at least one scripting/programming language (Go/Python/Shell) for automating and engineering complex operations tasks
  • Experience with observability platforms such as Prometheus and ELK, including capacity planning and performance testing

Preferred Qualifications

  • Experience managing or leading SRE/platform/tooling teams
  • Advanced hands-on experience with Prometheus, Grafana, and ELK
  • Certifications in AWS (SAA/SAP) or Kubernetes (CKA/CKS, etc.)
Preview

Dorothy Mole

HR OfficerMoledao

Balas Hari Ini 0 Kali

Diposting di 25 December 2025

Moledao

<50 Karyawan

DAOs

Lihat perekrutan pekerjaan

Laporkan

Pengingat Keamanan Bossjob

Jika posisi tersebut mengharuskan Anda bekerja di luar negeri, harap waspada dan waspada terhadap penipuan.

Jika Anda bertemu dengan perusahaan yang melakukan tindakan berikut selama pencarian kerja Anda, tolong segera laporkan

  • menahan ID Anda,
  • mengharuskan Anda untuk memberikan jaminan atau mengumpulkan properti,
  • memaksa Anda untuk berinvestasi atau mengumpulkan dana,
  • mengumpulkan keuntungan terlarang,
  • atau situasi ilegal lainnya.
Tips
×

Some of our features may not work properly on your device.

If you are using a mobile device, please use a desktop browser to access our website.

Or use our app: Download App