Taking the best out of your HPC infrastructure is complex, and while you try to do your best as an administrator to optimize it, your efforts can be hampered by end-users’ behaviors.
From the prediction of individual jobs’ resource needs based on their characteristics, to the projection of the cluster load or its energy consumption, Predict-IT is the framework to forecast the behavior of your clusters. Predict-IT embeds a series of machine learning algorithms that feed on the cluster’s logs: job scheduler accounting database, energy readings, applications’ logs… it learns from these historical data and continuously improves.
Integrated with your job scheduler, Predict-IT can warn users about potential issues with their submission parameters, or even directly act and update the jobs’ requirements to optimize the use of your HPC resources. Integrated with Analyze-IT, Predict-IT can help you plan future maintenance periods while limiting the impact on the production.