Incident Response Automation for DevOps

Existing monitoring tools just alert you to a problem in the middle of the night.
Now, Neptune lets you automatically act on that alert and fix your incident in minutes.

Watch Video  
Neptune remediates alerts


Your incident MTTR goes down  


Your service availability improves and  


Best of all - No more midnight wake up calls

Minimize downtime, avoid alert fatigue, and grow your business


Fix your simple alerts (disk, cpu, memory etc.) automatically.
For complex alerts (high error rate etc.) get relevant context so that you can fix them in just minutes.

COLLECT

Get relevant alert context and diagnostics in one single report aggregated from all your tools, metrics and systems

CORRELATE

Focus on the right alert with correlation analysis, relevancy factors, temporal analysis and previous history learnings

REMEDIATE

Fix simple alerts automatically and avoid midnight wakeup calls. You can then focus your time and effort on the right things

ANALYZE

Get reports and analytics on your top troublesome hosts, services and apps to facilitate permanent root-cause fixes

Powered by Event-driven automation


In response to an alert or an event, you can run any of the following actions. Your creativity is the limit !

For example

When pingdom alerts that your webserver is down, you can restart the tomcat server on the trigger host


Similarly

You can run any script (shell, ruby, python etc) on a single host or a cluster of hosts in response to your alert or event
Execute script

For example

When you get a high error rate alert from New Relic, you can fetch the latency graph split by application tiers automatically


Similarly

You can get any graph from your existing monitoring or graphing tools in response to your alert
Graph snapshot

For example

When you get transaction failure alert, you can fetch the last 500 lines of logs from sumo logic and search for relevant errors automatically


Similarly

You can get your logs (and errors in them) from your servers or logging tools like SumoLogic, Loggly, Logentries, ELK stack etc.
Get logs

For example

When you get an api error rate alert, you can post test data to all your internal apis to see the output, latency reponses, and quickly identify the failing api


Similarly

You can run health checks against all your applications or micro-services to identify the failing ones quickly
Run health checks

For example

When request throughput high alert comes, you can automatically scale up your heroku dynos or digitalocean droplets or your AWS dynamodb capacity


Similarly

You can run any cloud CLI command (heroku, AWS, digital ocean, softlayer, rackspace) in response to your alert or event

Run CLI commands

For example

Every day at 8pm, or when cpu utilization is less than 5% for 2 hours, you can stop all instances in a testing cluster, and restart them again at 6am in the morning


Similarly

You can start, stop, reboot or terminate a single cloud instance or a cluster of tagged instances in response to your alert
Run cloud api actions

For example

When you get an error rate high alert, you can quickly check whether heroku or any other third party service you use has an outage or a maintenance scheduled at that specific moment


Similarly

You can capture the snapshot of any webpage in response to your alert or event
Capture webpage snapshot

Trusted by


"Neptune has made it much easier for our engineers to gather all the data they need
to respond to site-up alerts quickly and effectively."

Eric Ogren, Head of DevOps

nerdwallet
nerdwallet blueshift joinhandshake aptible persistiq way up tenjin locable tapify hiplead
paidlabs imeos shift

508,015

ALERTS PROCESSED

360,117

INCIDENTS REMEDIATED

60,019

DOWNTIME HOURS SAVED

$30.1 M

BUSINESS LOSSES SAVED

Integrations


Neptune works seamlessly with your existing tools & infrastructure across both cloud and on-premises

Monitoring, alerting & logging tools


Cloud or On-premise infrastructure


Easy to setup & use


It takes less than 5min to automate your first alert



Add your monitoring tool API key to send your alerts to Neptune



Use our industry best practice runbooks and templates to create your rule



Sleep peacefully knowing that Neptune is automating all the manual work for you

Our founders built an incident response automation platform for AWS. Now, we are bringing it for everyone.

Start your   Free trial  or   Book a demo slot