"Measure model performance on test datasets. Use when assessing accuracy, precision, recall, and other metrics."
4.9
Rating
0
Installs
Machine Learning
Category
The skill provides a well-structured overview of model evaluation with clear workflow steps and appropriate metric coverage. The description adequately conveys the skill's purpose for assessing model performance. Task knowledge is good, including classification and regression evaluation patterns with specific metrics. Structure is clean and logical with distinct sections. However, novelty is limited - model evaluation is a common ML task that CLI agents with standard libraries can handle reasonably well. The skill would benefit from more advanced features like automated metric selection, statistical significance testing, or cross-validation automation to increase its value proposition beyond what standard ML libraries offer.
Loading SKILL.md…