Now hiring

Platform Engineering Manager @ Tango

United StatesRemoteFull-timePosted today

Opens on the employer's site

About this role

*Applicants must be authorized to work in the U.S. for any employer. *We cannot sponsor employment-based visas at this time. Let’s Tango! Where Innovation Meets Impact. At Tango Analytics, we’re all about helping businesses make smarter decisions through powerful technology, insightful data, and a whole lot of collaboration. Whether you're a creative thinker, a strategic planner, a tech wizard, or a customer champion, there's a place for you on our team. We believe work should be meaningful and fun — so if you're ready to make a difference while enjoying the journey, come join us and let's Tango! We are looking for a Platform Engineering Manager to join our dynamic and growing Platform Engineering team. About the Role: We are seeking a Platform Engineering Manager to build and operate our AI - native Internal Developer Platform (IDP) - the foundational layer that powers engineering velocity across the organization. You will own multi - cloud infrastructure (AWS & Azure), define golden paths, drive cloud modernization aligned to Well - Architected Frameworks, and deliver the observability, sh ared services, and agentic infrastructure that give every team a production - ready foundation. A defining dimension of this role is partneri ng with peer engineering leaders to actively migrate teams onto the platform and positioning it as the organization's AI - first engineering foundation. Key Responsibilities : Platform Strategy & Architecture Own and execute the Platform roadmap: compute, networking, identity, observability, shared services, and AI/ML tooling across AWS and Azure Lead cloud modernization against the AWS and Azure Well - Architected Frameworks across all five pillars: operational excellence, security, reliability, performance efficiency, and cost optimization Define golden paths - standardized self - service workflows for service scaffolding, DB provisioning, environment spin - up, and AI workload deployment - with escape hatches for edge cases Own multi - cloud strategy; ensure consistent IAM, networking, and FinOps governance across providers IaC & CI/CD Automation Drive OpenTofu /Ansible as source of truth for all infrastructure; enforce GitOps and policy - as - code for governance, auditability, and security Build and mature CI/CD pipelines (GitHub Actions, ArgoCD) to maximize deployment frequency, reduce lead time, and enable zero - ticket self - service provisioning Observability Own org - wide observability: metrics, logs, traces, and alerting – extended to AI/LLM signals (token usage, model latency, inference cost, agent task completion rates) Operate a centralized observability platform (Datadog /Signoz , OpenTelemetry, Grafana/Prometheus/Loki, or equivalent) delivered via golden paths; define SLIs/SLOs as onboarding defaults for all services Ensure full - stack coverage across infrastructure, Kubernetes, APM, distributed tracing, AI pipelines, and cost anomaly detection Shared Services Build and operate a self - service shared services catalog: secrets management, API gateways, model registries, and LLM gateways Rationalize duplicative per - team infrastructure; maintain shared services to production SLA standards with clear ownership and consistent security controls AI Platform & Agentic Infrastructure Own GPU/accelerated compute, model serving, vector databases, RAG pipelines, and LLM API gateway management (AWS Bedrock, Azure OpenAI, Anthropic) Build AI golden paths for self - service model deployment and LLM integration; design agentic infrastructure including orchestration runtimes, tool registries, memory/state services, and human - in - the - loop workflows Establish governance, cost controls, prompt injection guardrails, and model access policies for AI API usage and inference spend Partner with data science and ML engineering to translate agentic workflow requirements into reusable platform primitives Platform Adoption & Team Migration Collaborate on migration program: partner with peer managers to plan and execute structured workload migrations onto the platform with hands - on support - not just documentation Define onboarding playbooks covering golden paths, shared services, observability setup, CI/CD cutover, and AI capability onboarding; track and report adoption metrics to leadership Identify and remove migration blockers - technical gaps, missing services, or organizational friction — and feed them into the platform roadmap Developer Experience, Leadership & Culture Build a self - service developer portal (Backstage , GitHub or equivalent) with service catalogs, golden paths, and AI/agentic workflow templates; track DORA metrics and developer experience KPIs Hire, develop, and retain high - performing platform engineers; build AI fluency across the team and foster a platform - as - a - product culture with feedback loops, OKRs, and iterative roadmapping Lead architecture reviews; make pragmatic build - vs - buy decisions; partner with security and compliance on governance priorities Security, Compliance & FinOps Embed secure - by - default guardrails: IaC scanning, RBAC, secrets management, container hardening, and AI - specific controls (prompt injection defense, model access governance, data residency) Own cloud cost optimization across AWS and Azure including AI inference spend; maintain SOC 2/ISO 27001 compliance posture About You: Required 8+ years in infrastructure, DevOps, or platform engineering; 2+ years in engineering management Cloud: Deep hands - on AWS and Azure expertise: multi - cloud architecture, IAM, networking, compute, and AI/ML services (SageMaker, Bedrock, Azure OpenAI, Azure ML) IaC & CI/CD: Terraform required; GitOps , policy - as - code; GitHub Actions / ArgoCD at scale DP: Proven track record building an IDP with self - service workflows, golden paths, and developer portal (Backstage, GitHub , or equivalent) Observability: OpenTelemetry, Datadog, Signoz, or Prometheus/Grafana at scale; SLI/SLO definition and enforcement Shared Services: Built and operated multi - team shared service catalogs with production - grade SLAs Adoption: Led structured platform migration and adoption programs in partnership with peer engineering leaders Kubernetes & WAF: Kubernetes cluster management, Helm, RBAC, service mesh; AWS and Azure Well - Architected Framework reviews Strong cross - functional influencing skills; comfortable as a peer to engineering managers and product leaders Nice to Have AWS SA Pro / Azure Expert / CKA/CKAD | Python, Go, or Bash What We Offer We’re committed to creating an environment where you can thrive—professionally and personally. Our offerings include: Competitive Compensation We recognize and reward your contributions with a salary package that reflects your value. Comprehensive Benefits Including health, dental, and vision insurance, a 401(k) plan with company match, and generous paid time off to support your well-being. Flexible Work Environment Whether remote, hybrid, or in-office, we support work arrangements that promote productivity and balance. Inclusive & Collaborative Culture We foster a workplace where diverse perspectives are valued, teamwork is encouraged, and everyone has a voice. Tango is proud to be an equal opportunity employer. We are committed to equal opportunity regardless of race, ethnicity, religion, parental status, sexual orientation, age, citizenship, disability, or veteran status. Base pay offered is contingent on qualifications and other operational considerations. Base pay is just one piece of the full compensation structure offered at Tango. If this pay range is outside of your expectations, we still encourage you to apply and have a conversation with us. Base pay offered for this position is: $160,000 - 200,000 (+10% annual bonus)

Ready to apply?

Install the ResuMinder extension and we'll auto-fill the application in seconds — no rewriting.

Get the extension →
See how your CV scores — free
Platform Engineering Manager at Tango | ResuMinder Jobs