Interpreting Machine Learning Models with SHAP Values: Application to Crude Protein Prediction in Tamani Grass Pastures uri icon

resumo

  • Machine learning models such as XGBoost show strong potential for predicting pasture quality metrics like crude protein (CP) content in tamani grass (Panicum maximum). However, their 'black box' nature hinders practical adoption. To address this limitation, this study applied SHapley Additive exPlanations (SHAP) to interpret an XGBoost model and uncover how management practices (grazing interval, nitrogen fertilization, and pre- and post-grazing heights) and environmental factors (precipitation, temperature, and solar radiation) jointly influence CP predictions. Data were divided into 80% for training/validation and 20% for testing. Model performance was assessed with stratified 5-fold cross-validation, and hyperparameters were tuned via grid search. The XGBoost model yielded a Pearson correlation coefficient (r) of 0.78, a mean absolute error (MAE) of 1.45, and a coefficient of determination (R2) of 0.57. The results showed that precipitation in the range of 100-180 mm increased the predicted CP content. Application of 240 kg N ha-1 year-1 positively affected predicted CP, whereas a lower dose of 80 kg N ha-1 year-1 had a negative impact, reducing predicted levels of CP. These findings highlight the importance of integrated management strategies that combine grazing height, nitrogen fertilization, and grazing intervals to optimize crude protein levels in tamani grass pastures.

autores

  • Marina Maria Pedrosa Méca Ferreira de Castro
  • Gabriela Oliveira de Aquino Monteiro
  • Gelson dos Santos Difante
  • Denise Baptaglin Montagner
  • Valéria Pacheco Batista Euclides
  • Castro, Marina Maria Pedrosa Meca Ferreira
  • Jéssica Gomes Rodrigues
  • Marislayne de Gusmão Pereira
  • Pinto, M.A.
  • Ítavo, Luís Carlos Vinhas
  • Campos, Jecelen Adriane
  • Costa, Anderson Bessa da
  • Matsubara, Edson Takashi

data de publicação

  • dezembro 2025