ARE YOU THINKING ABOUT
AI for SRE?
When engineers have questions, AI SRE platforms pull real time information from your environment to find answers at 1/100th the cost of asking for help.
What caused this incident? 70% faster MTTR. 1/6th the number of people in the war room figuring it out.
Why is this alert firing? 98%+ reduction in time spent on alerts.
My code or yours? Developer self service instead of cross-team escalations.
Thousands of AI SRE tools in minutes
Humans rely on dashboards. AI relies on scripts whose output is optimized for LLM context windows. We call them "AI SRE tools," and can integrate thousands of them with your environment in minutes. Start with our tools then add your own. You have control.
Getting started with RunWhen
FIRST DAY: Thousands Of Default Tools
Install your first few thousands (read-only, safe) tools in minutes. The default AI SRE "Assistant" will use them.
Start asking questions immediately and see how it answers with the tools it has. Out of the box, it should have a pretty good feel for your infrastructure, common OSS components and stacktraces in your logs.
Get started with a kubeconfig and/or cloud credentials to cover a wide range of cloud infrastructure and application troubleshooting. No other integration needed.
FIRST WEEK: AI Learning Period
Either integrate with your existing alerts ("why is this firing?") or let RunWhen Assistants run continuously in the background.
They read the output of their tools and commit insights about your environment into their long term memory. The continue to get smarter about the tools they need to answer your question.
After about a week with the default Tasks, they should be ready to roll out to the team across dev/test environments.

FIRST MONTH: "30 New Tools In 30 Days"
RunWhen or our partners' deploy forward-deployed engineers work with your team to build "30 tools in 30 days" to answer questions that unblock developers and reduce MTTR during incidents.
This integrates your AI SRE Assistant more deeply with your application's APIs, data and workflows. Typical tools query application APIs, query databases, automate common/safe remediation steps in non-prod environments.
After 30 days, your AI SRE Assistants should be demonstrating quantifiable reductions in MTTR in the environments where it has been deployed.
PRODUCTION: Thumbs Up?
Each time an an engineer chats with an AI SRE Assistant, they get the chance to give a "thumbs up" if the session materially reduced MTTR or a "thumbs down" so the team can see where new tools are needed.
This results in i) a highly quantifiable business case, ii) a data-driven go/no decision about rolling this out to production, and iii) a high precision feedback loop when additional tools are needed to extend the system's capabilities.
Most teams are production-ready for incident response at the 30 day mark, and self-sufficient for building new tools if needed. Subsequent "30 tool in 30 day" sprints are available as professional services projects.



Can my team deploy RunWhen?
We work in the strictest financial services, health care and government environments in the industry
Need help with a business case?
Our team can help you build a business case for production environments, non-production environments, or both.
We typically do this after a 30 day PoV so we can use real production data in your environment.
How are other teams using AI?
24/7 developer self service
This team is reducing developer escalations by 62%, giving dev teams their own specialized Engineering Assistants to troubleshoot CI/CD and infrastructure issues in shared environments.
Bring on-call back in-house
This team is reducing MTTR and saving cost, replacing an under-performing outsourced on-call service. They are giving Engineering Assistants to their expert SREs that respond to alerts by drafting tickets.
A (paid) community?
Interested in turning your hard-earned production experience into AI-ready automation? Expert authors in our community receive royalties and bounties when RunWhen customers use their automation. Note - expect rigorous human and AI code reviews and continuous testing requirements to join the program.
Reduce observability costs? Let us show you how.
Unlike AI SRE tools built exclusively on observability data, our system leverages automation that pulls LLM-ready insights directly from your environment.
This means less observability spend rather than more, and less token spend processing data that was not built with LLMs in mind.

























.png)

.png)

