Back to careers
Stockholm
AI RunOps Manager
Design, lead, and scale our central AI Agent operations.
About Algorithma
At Algorithma, we are focused on one thing: putting AI to work. We build AI Agents that deliver real impact in your business. Our agents tackle repetitive tasks, surface critical insights, and scale operations without scaling headcount. To enable this at scale, we are strengthening our central AI RunOps capability, the engine that ensures all AI Agents run reliably, securely, efficiently, and responsibly.
We are seeking an AI RunOps Manager to design, lead, and scale our central AI Agent operations. This role is pivotal in ensuring that our flexible build pods can focus on innovation, while the central RunOps function guarantees automation, governance, and seamless operations. You will be at the heart of making Agents stable, scalable, cost-efficient, and safe.
Key Responsibilities
- •RunOps leadership: Own the day-to-day operations of LLM-based Agents in production, ensuring performance, reliability, and compliance.
- •Automation first: Drive automation across deployment, monitoring, and lifecycle management to minimize manual intervention.
- •Pod Collaboration: Act as the bridge between build pods (development) and central operations (RunOps), ensuring smooth handovers and feedback loops.
- •Governance and guardrails: Define and enforce best practices, playbooks, and responsible AI policies to ensure safe and compliant agent behavior.
- •Monitoring and optimization: Implement observability frameworks to track prompts, responses, accuracy, latency, adoption, and cost (and optimize continuously.)
- •Incident and risk management: Lead incident response, root cause analysis, and proactive mitigation of risks like hallucinations or compliance breaches.
- •Scalability: Build RunOps processes that can handle increasing adoption and complexity as agentic AI use cases grow.
Qualifications
- •Proven experience in AI operations with a focus on LLM-based agents or intelligent automation.
- •Strong skills in automation tooling, orchestration frameworks, and modern LLMOps/AgentOps practices.
- •Cloud operations expertise (AWS, Azure, GCP) with experience in Kubernetes, serverless, and infrastructure-as-code for scalable deployments.
- •Familiarity with agent orchestration tools (e.g., LangChain, LlamaIndex, vector databases, workflow engines).
- •Experience in observability, monitoring, and optimization of AI agents, including cost, latency, and reliability.
- •Strong collaboration skills to work across build pods and business stakeholders.
- •Awareness of responsible AI guardrails and best practices in LLM safety.
- •(Bonus) Knowledge of traditional MLOps practices is a plus, but not core.
What Success Looks Like
- •LLM Agent lifecycle managed with minimal manual effort, enabled by automation-first operations.
- •RunOps standards, playbooks, and guardrails widely adopted across build pods.
- •Reliable, cost-efficient, and scalable agent operations that enable faster innovation.
- •Cloud and API usage optimized for both performance and cost.
- •Algorithma recognized as a leader in responsible, scalable Agentic AI operations.
Why Join Us?
At Algorithma, you will help put Agentic AI to work where it matters most; in day-to-day operations that drive business outcomes. You'll join a dynamic, growing company where you can have a real impact, not only on our clients' transformation journeys, but also on how AI operations evolve as a discipline. If you thrive on building systems that combine innovation, automation, and governance, this is the role for you.