I’ve decided to first run all the rules-based players by themselves, then do LLMs after that (time-permitting).
Here are the official rules-based-players-only results (I ran 50 simulations for 500 iterations each, and averaged all the results):
| Player | Avg winnings | Avg node degree |
|---|---|---|
| Aᵀ | 626.181 | 11.61 |
| Smooth Criminal | 625.379 | 8.02 |
| suspiciouslyOdd | 618.934 | 2.28 |
| Pocoyo | 552.462 | 7.50 |
| Sucker | 545.710 | 8.00 |
| CaffieneAddict | 511.135 | 7.63 |
| ColmTheGOAT | 504.759 | 7.96 |
| MyNameIsRetep | 485.058 | 7.57 |
| Goblin | 480.656 | 7.98 |
| Browser | 470.466 | 6.51 |
| Mean | 440.548 | 7.99 |
| Stephen Davies (not the real one) | 429.481 | 5.87 |
| TitForTat | 392.847 | 8.00 |
| GrilledCheeseSandwich | 338.137 | 7.98 |
| Evenstar | 313.667 | 2.65 |
| Peachy Pearl | 200.039 | 3.85 |
| Al-Khwarizmi_ | -1840.430 | 16.13 |
| StrawberryFinch | -2228.890 | 23.97 |
That last column is the average degree of the player’s node in the network at simulation’s end. I find that a fascinating column. One might think: “having too many friends is a recipe for failure,” given the large degree averages of the players at the bottom end of the scale. However, the big housefly in that ointment, of course, is that the winner of the competition (Aᵀ) had quite a high average degree.
Here’s a scatterplot of the results (omitting the outliers) if you’re interested. There are apparently multiple very different paths to victory: just look at the top three finishers!
Here’s one sample run (37 of 50, I believe) which is interesting to watch. (MartyMauser does particularly well in this one, but of course it’s the overall performance over many seeds that takes the cake.)
At any rate, I award the following points:
- +40XP to the nine players who beat TitForTat
- +30XP to the five players who didn’t
- +100XP to Aᵀ‘s final exam score (capped at 100XP)
- +50XP to Smooth Criminal‘s final exam score (capped at 100XP)
- +20XP to suspiciouslyOdd‘s final exam score (capped at 100XP)
- +10XP to Pocoyo‘s final exam score (capped at 100XP)
Congratulations to all our winners! And now on to the LLM competition…*(pant, pant)*







