Senior DevOps Engineer
Restaurant365
Mexico CityMX$644,000 - MX$966,000 / yearDevOps & SysAdmin
About Restaurant365
Restaurant365 is a SaaS company disrupting the restaurant industry! Our cloud-based platform provides a unique, centralized solution for accounting and back-office operations for restaurants. Restaurant365’s culture is focused on empowering team members to produce top-notch results while elevating their skills. We’re constantly evolving and improving to make sure we are and always will be “Best in Class” ... and we want that for you too!
Role Overview
This role is based in Mexico City and follows a hybrid schedule (3 days per week in the office). As the position supports U.S. operations, team members may be required to work on certain Mexico holidays based on business and operational needs.
Responsibilities
- Architect, implement, and operate Azure infrastructure (App Services, Functions, Container Apps, networking, storage) to host Claude AI applications built by internal business users.
- Design, maintain, and improve CI/CD pipelines (Azure DevOps / GitHub Actions) for automated build, test, and deployment of user-developed applications.
- Establish and enforce a change management process including environment promotion gates, approval workflows, and rollback procedures.
- Serve as the primary technical owner of the hosted application platform, ensuring reliability, scalability, and cost optimization.
- Partner directly with internal application owners and business stakeholders to onboard new enterprise applications to the platform, advising on architecture, deployment, and support best practices.
- Define and enforce security policies for application sharing, including authentication, authorization, and network access controls.
- Build and monitoring observability solutions (Azure Monitor, Application Insights, Log Analytics) to provide proactive alerting and performance visibility.
- Author and maintain Infrastructure as Code (Terraform / Bicep) for all Azure resources, ensuring environments are reproducible and auditable.
- Drive incident response processes and conduct post-incident reviews to improve platform resilience.
- Mentor mid-level DevOps engineers and contribute to team standards, documentation, and runbooks.
- Develop and enforce governance policies defining standards for hosted Claude AI applications, including acceptable use policies, approval workflows for new deployments, and application lifecycle management.
- Own Azure cost management and FinOps practices, monitoring spend across hosted applications, identifying optimization opportunities, and providing regular cost reporting to leadership.
- Lead capacity planning and scaling strategy as internal adoption of the Claude AI hosting platform grows, ensuring infrastructure stays ahead of demand.
- Manage the vendor relationship with Anthropic, coordinating on API usage, rate limits, enterprise support, and roadmap alignment.
- Design and deliver training and enablement sessions for business users on how to properly build, package, and submit Claude AI applications for secure hosting.
- Collaborate closely with InfoSec and IT Security teams on Entra ID integration, Conditional Access policies, vulnerability remediation, and compliance-related infrastructure controls.
- Ensure platform operations meet compliance and audit requirements (SOC 2, SOX, or other applicable frameworks), maintaining evidence of controls and supporting audit activities.
- Participate in an on-call rotation providing after-hours production support for hosted applications, and define escalation procedures for the broader team.
Requirements
- 5+ years of hands-on DevOps / Cloud Infrastructure engineering experience.
- Deep expertise in Microsoft Azure (App Services, AKS, Functions, Networking, Entra ID, Key Vault).
- Strong experience designing and operating CI/CD pipelines (Azure DevOps, GitHub Actions, or similar).
- Proficiency with Infrastructure as Code tools (Terraform, Bicep, or ARM templates).
- Solid understanding of identity and access management, OAuth 2.0 flows, and secrets management patterns.
- Experience with container orchestration (Docker, Kubernetes / AKS).
- Demonstrated ability to implement change management frameworks in an enterprise environment.
- Experience with cloud cost management, FinOps principles, or Azure Cost Management tooling.
- Familiarity with compliance frameworks (SOC 2, SOX) and supporting audit processes in a cloud environment.
- Excellent communication skills with the ability to translate technical concepts for non-technical business users.
Preferred Qualifications
- Experience supporting AI/ML application workloads or LLM-powered applications.
- Familiarity with the Anthropic API, Claude AI platform, or similar LLM service integrations.
- Azure certifications (AZ-104, AZ-400, AZ-305).
- Experience with policy-as-code frameworks (Azure Policy, OPA/Rego).
- Background in restaurant technology, SaaS platforms, or multi-tenant application hosting.
- Experience managing vendor relationships and coordinating with third-party platform providers.
- Track record of building internal developer platforms or self-service infrastructure tooling.
Benefits and Compensation
Compensation for this position is 1,230,000 - 1,540,000 MXN annually ($102,500 - $128,333 monthly), depending on experience.
We also offer a comprehensive benefits package designed to support your health, well-being, and work-life balance. Benefit options include:
- Health Insurance
- Dental Insurance
- Vision Insurance
- Life Insurance
- Meal Allowance
- Monthly Internet & Electricity Stipend
- Mental Health Support Resources
- And more!
DYN365, Inc d/b/a Restaurant365 is an equal opportunity employer.