More Uptime, More Sleep

One-stop incident enrichment and automation for IT Operations and DevOps

TRY IT NOW
Click here to Watch Video  
Invision quixey blueshift tenjin chatwork nerdwallet

How it works?


Neptune doesn't replace your existing monitoring and alerting tools, instead it integrates with them

Neptune remediates alerts

Enrich your alerts automatically


Even before an oncall engineer is paged, collect all relevant context automatically and include it in the incident report
e.g. Collect relevant logs, graph snapshots, and show relevant events and incidents on the same host or app

Before enrichment

Neptune remediates alerts

After enrichment

Perfrom repetitive diagnostics and remediation tasks automatically


Capture context automatically for both false-positive and real alerts

Don't page engineers, fix repetitive alerts automatically

React quickly to alerts (short-term) and proactively root cause issues (long-term) with incident analytics

Powered by Event Driven Automation

For example

When pingdom alerts that your webserver is down, you can restart the tomcat server on the trigger host


Similarly

You can run any script (shell, ruby, python etc) on a single host or a cluster of hosts in response to your alert or event
Execute script

For example

When you get a high error rate alert from New Relic, you can fetch the latency graph split by application tiers automatically


Similarly

You can get any graph from your existing monitoring or graphing tools in response to your alert
Graph snapshot

For example

When you get transaction failure alert, you can fetch the last 500 lines of logs from sumo logic and search for relevant errors automatically


Similarly

You can get your logs (and errors in them) from your servers or logging tools like SumoLogic, Loggly, Logentries, ELK stack etc.
Get logs

For example

When you get an api error rate alert, you can post test data to all your internal apis to see the output, latency reponses, and quickly identify the failing api


Similarly

You can run health checks against all your applications or micro-services to identify the failing ones quickly
Run health checks

For example

When request throughput high alert comes, you can automatically scale up your heroku dynos or digitalocean droplets or your AWS dynamodb capacity


Similarly

You can run any cloud CLI command (heroku, AWS, digital ocean, softlayer, rackspace) in response to your alert or event

Run CLI commands

For example

Every day at 8pm, or when cpu utilization is less than 5% for 2 hours, you can stop all instances in a testing cluster, and restart them again at 6am in the morning


Similarly

You can start, stop, reboot or terminate a single cloud instance or a cluster of tagged instances in response to your alert
Run cloud api actions

For example

When you get an error rate high alert, you can quickly check whether heroku or any other third party service you use has an outage or a maintenance scheduled at that specific moment


Similarly

You can capture the snapshot of any webpage in response to your alert or event
Capture webpage snapshot

One-stop incident tracking and collaboration


collaborate with your team members
track all activity including manual and automated actions in a single incident report aggregated from all your existing tools and systems

Neptune remediates alerts

We take reliability and security very seriously


We built an incident response automation platform for AWS. Our team includes founding engineers from Amazon S3 and DynamoDB.
Our fault-tolerant architecture is designed to handle data center wide outages and region wide outages gracefully.

508,015

ALERTS PROCESSED

360,117

INCIDENTS REMEDIATED

60,019

DOWNTIME HOURS SAVED

$30.1 M

BUSINESS LOSSES SAVED

Integrations


Neptune works seamlessly with your existing tools & infrastructure across both cloud and on-premises

Monitoring, alerting & logging tools


Cloud or On-premise infrastructure


Trusted by


"Neptune has made it much easier for our engineers to gather all the data they need
to respond to site-up alerts quickly and effectively."

Eric Ogren, Head of DevOps

nerdwallet
quixey blueshift Invision nerdwallet joinhandshake persistiq way up locable tapify hiplead tenjin chatwork chatwork paidlabs chatwork

Easy to setup & use


It takes less than 5 min to get started
Just add your monitoring and alerting tool API key to send your alerts to Neptune