|
|
Table of contents
1. Correlation of game stats and mmr
- 1.1 Motivation
- 1.2 Some comments about the method
- i) Data and sampling
- ii) mmr distribution of players
- 1.3 Wards and mmr correlation
- 1.4 Duration of the game and mmr correlation
- 1.5 Deny count and mmr correlation
- 1.6 APM and mmr correlation
- 1.7 GPM, XPM and mmr correlation
- 1.8 Games played and mmr correlation
- 1.9 Win ratio and mmr correlation
- 1.10 (K+A)/D and mmr correlation
- 1.11 Are the calculations consistent despite using sample or Is the sample big enough?
2. Correlation of time when game is played and mmr
- 2.1 Motivation
- 2.2. Some notes about the method
- 2.3. When is tmm played
- 2.4 What is average mmr of these matches; conclusion
3. Duration distribution of the matches
- 3.1. Motivation
- 3.2. Short comment about method and general distribution of matched duration
- 3.3. Decrease of average match time with mmr
- 3.4. Average duration distribution in different mmr brackets; conclusion
4. Account age and mmr correlation
- 4.1. Motivation
- 4.2. Method and how to deduce when was account created
- 4.3. mmr distribution of players with respect to their account age; conclusion
5. Heroes picked e and mmr correlation
- 4.1. Motivation
- 4.2. Comment about method
- 4.3. Distribution of picking rate of agility heroes
- 5.4. Distribution of picking rate of intelligence heroes
- 5.5. Distribution of picking rate of strength heroes
- 5.6. Distribution of picking rate of ranged/melee heroes
- 5.7. Distribution of picking rate for different heroes (3 examples)
- 5.8. Distribution of picking rate for different heroes (all heroes - download)
6. Future directions - some additional questions that might be interesting to analyze?
7. Other threads that might interest you
8. Contact
1. Correlation of game stats and mmr
1.1 Motivation
As a regular forum user I have spent a lot of time lurking and reading different topics and post such as relation of APM with MMR, importance of denies, KDA ratio etc. Often these topic finish in flame and users defend their opinion based on personal or anecdotal evidence at best. This thread may shed just a bit of light on these topics as the correlation between of mmr and other game statistics is explored.
1.2 Some comments about the method
i) Data and sampling
ii) mmr distribution of players
1.3 Wards and mmr correlation
1.4 Duration of the game and mmr correlation
1.5 Deny count and mmr correlation
1.6 APM and mmr correlation
1.7 GPM, XPM and mmr correlation
1.8 Games played and mmr correlation
1.9 Win ratio and mmr correlation
1.10 (K+A)/D and mmr correlation
1.11 Are the calculations consistent despite using sample or Is the sample big enough?
2. Correlation of time when game is played and mmr
2.1 Motivation
What is the distribution of players throughout the day and week? Do better players really play during the night and weekday, while evenings and and weekends are reserved for scrubs? An attempt to answer these questions is given below.
2.2. Some notes about the method
2.3. When is tmm played
2.4 What is average mmr of these matches; conclusion
3. Duration distribution of matches
3.1. Motivation
In section 1.4, one can observe that better players finish their matches faster. Why is that? Do they recognize earlier that the game is lost and concede? Or is there less very long games? What is the duration distribution of matches anyway? If matched tend to last on 35 minutes on average are the most of them in that range?
3.2. Short comment about method and general distribution of matches duration
3.2. Decrease of average match time with mmr
3.3. Average duration distribution in different mmr brackets; conclusion
Last edited by OsianII; 12-04-2012 at 08:31 AM.
Big thread of statistical goodies (update - Dec 4) - http://bit.ly/U7Z9Hi
Alt avatars price change http://bit.ly/V2wXIz ł AltAvatars price change mod http://bit.ly/NwOp7n
BangNinja mod fix http://bit.ly/U2VgYU ł Breaky & Zyori in love http://bit.ly/PytPAi
Quickfix for linux and mac 2.6.11 http://bit.ly/Udqwmw ł Highest gpm in first 20 min calculated http://bit.ly/Qf4Q5P
4. Account age and MMR correlation
4.1. Motivation
In section 1.8., I concluded that players with more matches, in very general terms, tend to play better. Perhaps it is also important not just how many matches you have played, but when did you start to play? In early days of HoN people would advertise their skills with "I have played x years of DotA and ...". Without reliable performance measurement, especially for more casual players, time spent with game was often one of the main indicators of how good the player was. Is it really so? When did people create their accounts and what is their mmr today?
4.2. Method and how to deduce when was account created
4.3. mmr distribution of players with respect to their account age; conclusion
5. Heroes picked and mmr correlation
5.1. Motivation
Is is common knowledge that better and/or competitive players tend to pick differently then typical "pub" players. In general, it is thought that new strategies are created on highest level and then slowly trickle down the bracket as time goes by. Is it really so? Are there really some "pubstomp" heroes?
5.2. Comments about method
5.3. Distribution of picking rate of agility heroes
5.4. Distribution of picking rate of intelligence heroes
5.5. Distribution of picking rate of strength heroes
5.6. Distribution of picking rate of ranged/melee heroes
5.7. Distribution of picking rate for different heroes (3 examples)
5.8. Distribution of picking rate for different heroes (all heroes - download)
6. Future directions - some additional questions that might be interesting to analyze?
7. Other threads that might interest you
8. Contact
Feel free to comment and ask questions. Suggestions in general, as well as possible ideas for future project are also warmly welcomed.
Last edited by OsianII; 01-24-2013 at 01:16 PM.
Big thread of statistical goodies (update - Dec 4) - http://bit.ly/U7Z9Hi
Alt avatars price change http://bit.ly/V2wXIz ł AltAvatars price change mod http://bit.ly/NwOp7n
BangNinja mod fix http://bit.ly/U2VgYU ł Breaky & Zyori in love http://bit.ly/PytPAi
Quickfix for linux and mac 2.6.11 http://bit.ly/Udqwmw ł Highest gpm in first 20 min calculated http://bit.ly/Qf4Q5P
Nicely laid out and informative. Thank you for this; shame it does go out of date in relative speed to other mechanics posts.
Very interesting actually.
Though your first graph doesn't appear to be very accurate.
It makes it look like the number of 1800 mmr players are about the same as the number of 1500 mmr players, which is definitely not the case.
Good job overall mate.
Forum Moderators are not S2 Games employees. My posts in no way represent the view of S2 Games or any of its staff.
Please use the report post functionto have me review a post that you believe is breaking the Forum Rules.
Check the Sticky Threads for additional information on this sub-forum and the Announcement Threads for more information about Heroes of Newerth as a whole!
-----------------------------
HoNored | Super Beta Tester | Mechanics Moderator | Pre-purchased HoN | Skype: Necrothica | DeviantArt: Necrothic
Some really nice data you got there. How did you acquire it? Is it available somewere? Are you publishing the results? And if yes which conf/journal?
I lol HrvatNisam ni skontao odmah!
Last edited by psvrisak; 10-29-2012 at 05:25 AM.
Ok, so the first graph shows what was the sample for the analysis. It is has been specially made so that it is approximately uniform across all mmr and as such it does not shows what is the distribution of player base according to their mmr. For instance, image below what would be the sample, if I uniformly picked up 3000 players, absolutely randomly, not caring about their mmr.
That is not what I want. If I did that, I would have mass of data points in the 1500 range, and no data points on the edges of mmr range. For instance, in the above graph, one can see random number generator did not select a single player above 1900 range. What I want is to have sample that is uniform in the mmr, so that each mmr range gives approximately same (large) number of points.
One possible solution would be to test much larger number of points; well pretty much all players, or a large subsection of the player base (my first estimate that one would have to test at least 1/3 of player base to get satisfactory results, which is around 100 000 players). That would be computationally hard and again would miss large number of players with high mmr and low mmr (the ones in which I am interested) and give me bunch of points around 1500 range.
That is why the random seed is modified, that it picks players in a way that it has larger probability to pick players with high or low mmr. I have used one set of paramters of box distribution, which is seen below.
In this way, sample is pretty uniform across mmr range which allows us to test the correlation relations across the whole mmr range and that is what is shown in the first figure of the first post. I hope that this clears this misconfusion? Fire away if something is left unclear!
I am sending you pm about data. I do not think that scientific journals are quite interested in a analysis like thisOriginally Posted by psvrisaak
To be honest, I did this because I have some free time now (unemployed), but I guess this could make interesting talk on some gamers conference or something like that but I am not sure that it has some value in the "real" world.
At least for the moment I hope to update this thread in future. For instance, it would be interesting to see if there is change in game duration after the introduction of the new voting system and will the shape of the game duration curve change.Originally Posted by SmurfinBird
Last edited by OsianII; 10-29-2012 at 06:34 AM.
Big thread of statistical goodies (update - Dec 4) - http://bit.ly/U7Z9Hi
Alt avatars price change http://bit.ly/V2wXIz ł AltAvatars price change mod http://bit.ly/NwOp7n
BangNinja mod fix http://bit.ly/U2VgYU ł Breaky & Zyori in love http://bit.ly/PytPAi
Quickfix for linux and mac 2.6.11 http://bit.ly/Udqwmw ł Highest gpm in first 20 min calculated http://bit.ly/Qf4Q5P
Aaah, that makes sense.
Job well done, good sir.
Forum Moderators are not S2 Games employees. My posts in no way represent the view of S2 Games or any of its staff.
Please use the report post functionto have me review a post that you believe is breaking the Forum Rules.
Check the Sticky Threads for additional information on this sub-forum and the Announcement Threads for more information about Heroes of Newerth as a whole!
-----------------------------
HoNored | Super Beta Tester | Mechanics Moderator | Pre-purchased HoN | Skype: Necrothica | DeviantArt: Necrothic
The most interesting graph (for me at least) is the one showing wards related to mmr, it seems that there's no significant change over 1500, which is not my experience at all. Maybe it's the quality of wards put (positioning, awareness of anti-wards and knowing whether to defend or attack, etc.), rather than number that distinguishes 1500's and 1800's for instance.
Another interesting graph is the duration of the game. The change seems cosmetic, but your analysis seems to be that better players execute strategies better, thus, ending games faster. From what I've experienced higher mmr players opt rather to be on the safe side and secure an in-game advantage by farming and giving the killing blow when they know it will be 100% succesfull. The shorter duration, in my opinion, comes from experience and knowing that some games are unsalvagebale and the concede vote gets thru much more often.
Just my thoughts. Really great work. Things that you feel are right, now suddenly have some statistical evidence.
Ward number does not really change much at all. It's the quality that counts. You normally don't have more than 1-2 active wards up until 1650 at least from my experience, and for other higher tier games I've watched I never see the map full of wards either. The main difference is that people know they need wards up on pull camps or rune spots early on after a certain rank unless they want to lose the game. Those crucial early-game wards don't affect the total outcome that much and are the main difference from the 1300-1400 hell I've been to months ago.
If you notice, there's a significant dispersion of wards per game after 1600, which is hard to take counclusions. I'm guessing it's probably due to game lenght variation cause games tend to be either one sided stomps or better executed strategies/pushes. If you're getting stomped supports don't really have much gold or opportunity to roam to place wards. On another hand, if you're winning they will most likely secure your farming spots as well as set aggressive/kong wards as the games goes on and that is maybe why the ward number does not increase as much.
The APM correlation I found quite funny tbh. However, correlation =/= casuation. Game lenght is decreased as mmr goes up because there are probably less trolls and griefers thumbing down concede votes as you progress. The CC15 zone is very populated around 1400-1600, though. Lol
Anyway, great work here. How did you record these results anyway? And would it be possible to compare results between regions (say us vs eu?).
Last edited by zstarkey42; 10-29-2012 at 03:04 PM.
wait,where did you got all this data?how do you know it's true?looking forward for those 3 "future directions".Seem pretty interesting topics.
Last edited by Clytemnestra; 10-31-2012 at 07:33 AM.
Of course, as it happens, I have noticed some errors in my method for the 2nd exercise, so you should disregard it. First exercise, which provoked much more interest, should be correct. I will change wrong results it with new, hopefully correct, results "soon". To be honest, I should have put tag [WIP] when first making a thread.
@zstarkey42 I have sent you a pm, hopefully you have seen in it.
Big thread of statistical goodies (update - Dec 4) - http://bit.ly/U7Z9Hi
Alt avatars price change http://bit.ly/V2wXIz ł AltAvatars price change mod http://bit.ly/NwOp7n
BangNinja mod fix http://bit.ly/U2VgYU ł Breaky & Zyori in love http://bit.ly/PytPAi
Quickfix for linux and mac 2.6.11 http://bit.ly/Udqwmw ł Highest gpm in first 20 min calculated http://bit.ly/Qf4Q5P
Any chance for you to add confidence intervals to your graphs in the future?
Last edited by Therkel; 11-08-2012 at 04:01 AM. Reason: I misspelled confidence oO
Big thread of statistical goodies (update - Dec 4) - http://bit.ly/U7Z9Hi
Alt avatars price change http://bit.ly/V2wXIz ł AltAvatars price change mod http://bit.ly/NwOp7n
BangNinja mod fix http://bit.ly/U2VgYU ł Breaky & Zyori in love http://bit.ly/PytPAi
Quickfix for linux and mac 2.6.11 http://bit.ly/Udqwmw ł Highest gpm in first 20 min calculated http://bit.ly/Qf4Q5P
Hy, guys I did one pretty massive update - eliminated mistakes, added sections and clarifications to the problems that were discussed in first version, added some new things, added more clarifications about data and method itd... I have thought about putting some tag to new stuff, like [new], but I concluded that it would be too distracting, especially as only few part in first section have been left totally unchanged; I would suggest treating the whole thing as absolutely new.
Again, I would welcome comments, suggestions etc.. Hopefully there will be some more healthy discussion like the one that followed first version!
First post link
http://forums.heroesofnewerth.com/sh...stical-goodies
Big thread of statistical goodies (update - Dec 4) - http://bit.ly/U7Z9Hi
Alt avatars price change http://bit.ly/V2wXIz ł AltAvatars price change mod http://bit.ly/NwOp7n
BangNinja mod fix http://bit.ly/U2VgYU ł Breaky & Zyori in love http://bit.ly/PytPAi
Quickfix for linux and mac 2.6.11 http://bit.ly/Udqwmw ł Highest gpm in first 20 min calculated http://bit.ly/Qf4Q5P
Posting in a fabulous thread. Silhouette graph does not surprise me and neither does the bubbles graph.
Ophelia graph made me smile a little.
Thank you so much OsianII for going through all the trouble and sharing this with us. This is amazing work.
Last edited by Rayniac; 12-04-2012 at 03:46 PM.