Show HN: I made an AI agent that helps DevOps teams resolve incidents 60% faster
platops.aiHey HN,
We built PlatOps to help platform teams resolve production incidents faster using AI. Our AI agent either auto-resolves incidents using workflows learned from your past incidents or guides your on-call engineers through resolution steps.
A common problem we hear from SRE teams is that incident resolution is still largely manual and relies heavily on tribal knowledge. Engineers waste precious time gathering information from multiple sources during incidents, and learnings from past incidents often stay trapped in postmortem docs.
PlatOps solves this by:
1. Automatically gathering relevant information (logs, metrics, docs) when an incident occurs
2. Learning from your team's past incident resolutions to suggest or automate fixes
3. Providing AI-guided resolution steps when manual intervention is needed
Teams using PlatOps have seen:
- 60% reduction in Mean Time To Recovery (MTTR)
- Faster onboarding of new on-call engineers
- Better knowledge retention from past incidents
We're looking for feedback from the HN community, especially from folks who manage production systems. Would love to hear your thoughts on:
- Current pain points in incident management
- Features you'd like to see
- Integration suggestions
Try it out at https://docs.platops.ai
Neat! Would this have a complete audit trail of every CRUD action taken, AI or Human?
Thanks for asking! Yes, we maintain a comprehensive audit trail of all actions taken during incident resolution, whether they're performed by the AI agent or human engineers.