DevOps / SRE Engineer - AI Platform
เต็มเวลา
Makro PRO
The DevOps / SRE Engineer owns the operational substrate of an AI-native retail decisioning platform — infrastructure, CI / CD, observability, cost meter, and incident response for a system that runs production agents taking real business actions. The role builds on the enterprise Terraform standard, CI / CD spine, and FinOps tagging policy rather than reinventing parallel infrastructure.
Remote candidates outside of Thailand are welcome to apply.
Key Responsibilities:
- Adopt the enterprise Terraform standard and module library for all platform infrastructure; author platform-specific modules where needed (agent runtime, vector DB, knowledge graph); run drift detection weekly.
- Build platform-specific CI / CD pipelines on the enterprise spine — service deploys, agent deploys, eval-gate enforcement; integrate eval gates so no agent reaches production without eval pass.
- Operate rollback orchestration with sub-15-minute recovery; quarterly game days.
- Own the platform observability stack — OpenTelemetry, Langfuse for LLM traces, custom dashboards for per-agent cost.
- Implement the per-agent cost meter end-to-end — token counts, vector queries, model inference, downstream LLM Gateway costs; surface cost data to the enterprise GenAI cost dashboard.
- Stand up the platform on-call rotation; author runbooks for every production agent and service; lead incident response with measurable corrective actions.
- Implement platform cost-tagging policy consistent with the enterprise standard (team, domain, environment, project, agent, suite, persona); report monthly to Cost Review.
- Drive cost optimisation — right-sizing, caching, model routing decisions, reserved compute.
Requirements
- Bachelor's or Master's degree in Computer Science, Engineering, or a related discipline.
- 5+ years SRE / DevOps with production ownership.
- Terraform at scale — modules, state, drift, environment promotion.
- CI / CD for data + ML / AI services (GitLab CI / CD or comparable).
- Cloud platform (Azure preferred; AWS / GCP transferable).
- Observability — OpenTelemetry, Langfuse (or comparable LLM traces), custom dashboards.
- FinOps — tagging policies, attribution, optimisation.
- Incident response — on-call, post-mortems, runbook authorship.
Preferred Qualifications
- AI / agent platform SRE experience; cost-meter / chargeback systems built or operated.
- Multi-cloud production experience; open-source contributions to IaC / observability tooling.
- AI / ML / agent system observability instrumentation (LLM cost, agent cost, eval scores).
- Vendor certifications such as HashiCorp Terraform Associate / Professional, Azure Solutions Architect Associate, or Databricks Data Engineer Professional.
ตำแหน่งว่าง 27 วันที่ผ่านมา
งานที่คล้ายกัน ที่อาจน่าสนใจสำหรับคุณอิงตามตำแหน่งว่าง DevOps / SRE Engineer - AI Platform ใน กรุงเทพมหานคร
- ...ftware Factory platform and empower ou... ...puter Science, Engineering, or related fi... ...xperience in a DevOps Engineer role ...
- The Tech Lead — AI Platform is the senior technical leader f... ...the platform runtime, AI engine, and agent-orchestration...
- ... a technology platform that will help... ...allenge The DevOps Engineer helps increase... ...as a DevOps or SRE Engineer on a ...
- ...looking to hire AI Automation Spec... ...d communication platforms into unified wo... ...mputer Science, Engineering, Information Sy...
- The Head of Retail AI Platform is the senior leader accoun... ...programme with a senior engineering team. Remote candi...
- ...nior cloud engineer to ensure ... ...ation with DevOps & Developm... ...r with ACP Platform teams, con... ...+ years in SRE role is ma...
- ...he forefront of AI technology, gui... ...keholders, Data Engineers (for training d... ...onships with AI platform providers and L...
- We are looking for an enthusiastic, driven Pre-sales Engineer to join our fantastic sales team. Responsibilities includ...
- The AI Engineer builds production agents end-to-end on an AI... ...ative retail decisioning platform — prompt design, tool de...
- ...ber of Technical Staff, Platform Engineering Location: Bangkok, ... ...data pipelines, and the AI-assisted engineering sy...
- ...enior MS Azure DevOps Engineer to join our de... ... be exposed to AI and AI develop... ... orchestration platforms (e.g., Kuberne...
- About the Role As an AI Engineer on our Enterprise Systems team, you will be embedded directly in the trenches of o...
- ...ovision, manage, enhance, maintain, Deploy, and support platform managed service. - Develop and apply efficient plat...
- ...nd Lifestyle e-Commerce platform services. We build tech... ...ead of Site Reliability Engineering (SRE) is a pivotal leadershi...
- ...d LegalTech and AI-driven SaaS com... ...nce cloud-based platforms. As part of its... ...ct Managers and Engineers to create next-...
- ...looking for an AI Solution Archit... ...y with hands-on engineering and enjoy worki... ...nments on cloud platforms like AWS Bedroc...
- ... With that growth comes the need for a Software Engineer, Platform to join our newly formed Platform team and help us...งานระยะไกล
- ...le, our internal engine must operate at ... ... operations with AI, you are doing m... ...s, or automation platforms to solve specifi...
- ... a Software Engineer in our IT d... ... or GitHub, DevOps principal) ... ... Technology platform (preferably... ...exposure to AI technologie...
- รายละเอียดงาน 1. วางแผนกลยุทธ์ด้านการตลาด การโฆษณา และธุรกิจออนไลน์ 2. วางแผน และเขียนแผนการตลาด เพื่อโปรโมทสินค้า แ...
125 $
Thai AI Product Tester Project Type: Contract... ... be managed via the Upwork platform. While participating in...- ... mentor: Lead a team of DevOps Engineers in an Agile framework, ... ...es for new features. Platform development & observabi...
- รายละเอียดงาน บริษัท อธีราห์ คอร์ปอเรชั่น เป็นบริษัทที่ทำธุรกิจอยู่ในหลายประเภท ซึ่งธุรกิจหลักของเราคือขายสินค้าเกี่ย...
- ...The Tech Lead — AI Applications is... ...ail decisioning platform — the commercia... ...~ Be the senior engineering peer for all su...
- ...-time Software Engineer in Bangkok we’... ...uct Manager and DevOps Engineer, inclu... ...n C/C++ and Qt platform. Experience ...
- ...-edge cloud and AI technologies? ... ...ns on the Azure platform. You will colla... ...osely with data engineers, data scientist...
- ... help keep our analytics platform sharp, reliable, and bus... ...ely with product owners, engineers, BI developers, and busi...
- ...ead of Product (AI) will lead the ... ...s stakeholders, engineering teams, data tea... ...tal products or platforms within technolo...
- รายละเอียดงาน 1. กำหนดงานที่ได้รับ และปฏิบัติตามเ ป้าหมาย 2. ติดตามข่าวสาร และอัพเดทเทรนด์ใหม่ ๆ อยู่เสมอ 3. รับผิ...
- รายละเอียดงาน ต้องการผู้ร่วมงาน ที่ทำงานจริงและมีประสบการณ์ : การ LIVE สด ผ่านโปรแกรม obs ผ่านช่องทาง TikTok/ Social...
คุณต้องการรับตำแหน่งงานว่างเพิ่มเติมหรือไม่?
สมัครแล้วรับตำแหน่งงานว่างที่คล้ายกับ DevOps / SRE Engineer - AI Platform สมัครเป็นคนแรกเลย!
