Other Tutorials And HOWTOs

Design, Development and Improvement of Nagios System Monitoring for Large Clusters

Description:

This document describes the work of design, development and improvement of the Nagios monitoring system done in Cineca and used for the Tier-1 systems participating in the PRACE projects. Starting from the issues arisen by the complexity of the HPC systems and the related monitoring activities, the targeted solutions and their implementation are explained. The most important aspects of the implementation and the specific issues related to HPC will be described with a specific attention to the exascale clusters.

Current Version

Last Release Date

September 17, 2012

Compatible With

Owner


Project Notes
Reviews (0) Add a Review
Add a Review

You must be logged in to submit a review.

Thank you for your review!

Your review has been submitted and is pending approval.

Recommend

To:


From:


Thank you for your recommendation!

Your recommendation has been sent.

Project Stats
Rating
0 (0)
Favorites
0
Views
54,254