Platform RTM provides the most comprehensive workload monitoring and reporting available for Platform LSF environments. It offers easy and effective monitoring of all workload scheduling and license usage facets through a single interface, allowing administrators to quickly resolve issues without service interruptions. A flexible, automated alert system also flags issues quickly to keep cluster resources up and running as needed. With its broad set of capabilities, Platform RTM can replace multiple tools in typical Platform LSF environments with a single easy-to-use, monitoring tool. This results in improved productivity for administrators and users alike, as well as reduced cost and complexity.
A tool for analyzing, correlating and visualizing large amounts of Platform LSF workload data, Platform Analytics enables data-driven decision making based on job, resource and license data collected from one or more Platform LSF clusters. The tool includes an innovative interface that is built on top of a powerful analytics engine, providing fast and easy results. Users can choose from a variety of pre-configured dashboards or choose to build their own for obtaining quick answers about the status of their HPC infrastructure and applications for optimal resource planning and utilization.
"The ability to monitor cluster availability and performance is imperative when we're running millions of design simulations to test our latest software releases", stated Steve MacQuiddy, IT Director Engineering Infrastructure, Cadence Design Systems. "Having the single Platform RTM dashboard allows us to simultaneously observe the entire cluster environment and it has not only made it easier for us to better balance our workloads, but it's also helped us optimize throughput for our critical jobs during peak usage."
"Keeping our HPC data centre on-line is critical for us when were running frequent tests of designs for our race cars. Even slight design tweaks need to be tested scrupulously before we can put the design into production for the race track", stated Matt Cadieux, IT Director, Red Bull Racing. "Platform Analytics allows us to both track our cluster usage, as well as identify any potential problems that might interfere with running design tests. It also allows the design team to plan peak usage around heavy test times so that the design process runs smoothly every time."
"Platform RTM 8 is built on the powerful and extensible open-source Cacti graphing framework and offers some powerful new features like Grid alarms, which allow us to quickly build alerts without resorting to Cacti graphs", stated Kevin Rota, CIO, Simulia, Dassault Systèmes. "RTM has allowed Simulia to easily access and visualize extensive amounts of data providing much better insight into how our LSF resources are being used and by whom. These new features will provide improvements in quality of service."
"Cluster administrators need to be able to monitor and analyze their cluster performance in order to troubleshoot potential issues and analyze usage patterns for better efficiency and use across their Platform LSF infrastructures", stated Louise Westoby, Senior Product Marketing Manager, Platform Computing. "With IT staff strapped for time these days, building homegrown monitoring, reporting and alerting systems is not a viable option. Platform RTM and Platform Analytics offers the full visibility users need to get the most out of their Platform LSF clusters, queues and jobs so administrators and users can maintain productivity and contain costs."
Unlike tools that only monitor infrastructure at a basic level, Platform RTM includes workload and resource-aware monitoring across all workload facets, including global clusters, hosts, licenses queues, users and log files. New features include:
Platform Analytics uses an innovative visualization tool to translate raw business data into usable information quickly and easily. Key features include: