A comprehensive set of turnkey infrastructure integrations

Including dozens of AWS and Azure services, web, database, network, containers, orchestrations like Docker and Kubernetes, and more.


Complete visibility into the health and performance of applications and their underlying infrastructure

Quickly pinpoint the root cause of performance issues across the stack, down to a poor-performing line of code


Custom metrics and analytics

Analyze custom infrastructure, application, and business metrics

View Custom Metrics Monitoring Info
Powerful API that makes it easy to collect and create any custom metric

Achieve ultimate visibility and enhanced troubleshooting with synthetic and real user monitoring


Free APM Software

Catch bugs early on, and gain full visibility and insights into the applications you’re developing

View Product Info
Free, full-function APM tool for testing and troubleshooting application performance before moving into production

Dev Edition includes five traces per minute, 100 metrics, three hosts, and six containers


Log Management and Analytics powered by SolarWinds Loggly

Integrated, cost-effective, hosted, and scalable full-stack, multi-source log management

View Log Management and Analytics Info
Collect, search, and analyze log data in addition to your metrics and traces to quickly pinpoint application performance problems

Reduce mean time to resolution (MTTR) by quickly jumping from a trace or host view into the relevant logs to accelerate troubleshooting


Digital Experience Monitoring Powered by SolarWinds Pingdom

Make your websites faster and more reliable with easy-to-use web performance and digital experience monitoring

View Digital Experience Monitoring Info
Add client-side web application performance monitoring. Provide maximum observability by adding the user’s perspective.

Achieve ultimate visibility and enhanced troubleshooting with synthetic and real user monitoring


Note: This article was originally published on Librato, which has since been merged with SolarWinds®AppOptics. Explore custom metrics you can track and analyze with AppOptics.

One of the things that makes collectd such an effective collector is all of the metrics it gives you with a minimal amount of effort. As I touched on in a previous blog post, we took extra steps by adding Service-Side Filtering to help our customers filter the collectd metrics we store on their behalf. However, there are still a number of “noisy” metrics where it might be desirable to condense them before publishing them to Librato.

If you’re capturing system-level metrics on a host with plenty of CPU cores, you’ve probably noticed that collectd breaks out each CPU metric (e.g., idle, steal, and user) for each core. For many users, these individual streams offer little benefit on their own. Fortunately, collectd includes some plugins that will help us aggregate each of these metrics into a single stream on the source host.

First, you’ll need to edit your collectd configuration to enable the aggregation plugin and add the following Plugin block. This directs collectd to aggregate each metric across all cores on a single host.

LoadPlugin "aggregation"

<Plugin "aggregation">
    Plugin "cpu"
    Type "cpu"
    GroupBy "Host"
    GroupBy "TypeInstance"
    CalculateAverage true

Optionally, you can also enable “CalculateSum true” if you’d prefer to aggregate by sum instead of (or in addition to) average.

Next, you’ll want to enable the match_regex plugin. This configuration will filter out all of the per-core metrics from being sent to the output plugin. You must perform this step if you want to block these metrics from reaching Librato.

LoadPlugin "match_regex"

<Chain "PostCache">
    <Match regex>
      Plugin "^cpu$"
      PluginInstance "^[0-9]+$"
    <Target write>
      Plugin "aggregation"
    Target stop
  Target "write"

Note that the aggregation and match_regex plugins were introduced in collectd versions 5.2 and 4.6 respectively. Ensure that your installed package is at least version 5.2 in order to use both of these plugins.

These aggregated metrics will arrive in the new format “collectd.aggregation.cpu-average.cpu.wait” so you’ll need to add a whitelist for them in the Other Plugins field. A good wildcard would be “collectd.aggregation.cpu-*.cpu.*“. Make sure to click Update to save your changes.

After restarting your collectd process and updating your whitelist you should begin seeing the new aggregate metrics in your Librato account. Once these have arrived you can safely delete the per-core metrics.

Finally, you’ll want to update the composite function of the CPU instrument in your collectd dashboard to take this change into account:

divide( [sum(derive(series("collectd.aggregation.cpu-average.cpu.idle", "%"))), sum(derive(series("collectd.aggregation.cpu-average.cpu.*", "%")))] )

For more tips on using collectd with Librato, check out our Knowledge Base articles. And as always, if you still have questions about this or any of our services, you can find us online in our support chat or email us at support@librato.com.

Related articles

© 2024 SolarWinds Worldwide, LLC. All rights reserved.