This analysis is based on scores and stats from individual rounds in the last ten PGA Tour events at Colonial Country Club: 3,922 rounds in total.
Random Forest Regressor is an ensemble learning method that constructs multiple decision trees during training and outputs the average prediction. It combines the predictions of several models to improve accuracy and robustness.
Feature importance is a technique used to interpret a machine learning model. It refers to the score that quantifies the contribution of each feature to the prediction made by the model.
In a Random Forest, the importance of a feature is computed by looking at how much the feature decreases the impurity (e.g., variance for regression tasks) across all the trees in the forest. The more a feature decreases the impurity, the more important it is considered.
The calculated importance scores for all features are then normalized to give relative importance as a percentage. This shows the relative contribution of each feature to the prediction task.
Features with high relative importance percentages have a strong impact on the model's predictions. They are crucial for accurate predictions and indicate key areas where performance matters most.
Features with low relative importance have a minimal impact on the model's predictions. While they can still contribute, they are less critical.
The analysis reveals a distinctive performance profile for success at Colonial Country Club compared to typical PGA Tour venues:
Critical Success Factors:
Overall, Colonial Country Club appears to place a premium on second-shot precision and putting performance, while being relatively more forgiving off the tee. Players with strong iron play and putting skills will have a significant advantage, while those who rely primarily on distance may find their usual advantage diminished. This aligns with Colonial's reputation as a classic, strategic course that doesn't necessarily favour the longest hitters, a characteristic highlighted by its nickname "Hogan's Alley" (after Ben Hogan, who won the tournament five times) and its famous "Horrible Horseshoe" (holes 3-5), which requires precision over power.
The table below shows the top-5 ranked players and their average estimated scores from the different Random Forest models above.
Rank | Player | Avg Score |
---|---|---|
1 | Scottie Scheffler | 68.45 |
2 | Harry Hall | 69.39 |
3 | Ryan Gerard | 69.55 |
4 | Jacob Bridgeman | 69.93 |
5 | Tommy Fleetwood | 69.97 |
Estimated scores for all players can be found here.