TOP LEVEL CATEGORIES
EXPLORE
Description:
This document describes the work of design, development and improvement of the Nagios monitoring system done in Cineca and used for the Tier-1 systems participating in the PRACE projects. Starting from the issues arisen by the complexity of the HPC systems and the related monitoring activities, the targeted solutions and their implementation are explained. The most important aspects of the implementation and the specific issues related to HPC will be described with a specific attention to the exascale clusters.
Current Version
Last Release Date
September 17, 2012
Compatible With
Owner
lmiltchev
Website
http://www.prace-ri.eu/
Download URL
http://www.prace-project.eu/IMG/pdf/Design_Development_and_Improvement_of_Nagios_System_Monitoring_for_Large_Clusters.pdf
You must be logged in to submit a review.
Your review has been submitted and is pending approval.
To:
From:
Your recommendation has been sent.