-
Website
http://buildingreputation.com/ -
Original page
http://buildingreputation.com/writings/2009/08/ratings_bias_effects.html -
Subscribe
All Comments -
Community
-
Top Commenters
-
Bryce
19 comments · 3 points
-
Michael Kim
1 comment · 1 points
-
Tim Knight
1 comment · 1 points
-
facebook-1291179751
2 comments · 1 points
-
Dan Ritz
2 comments · 1 points
-
-
Popular Threads
-
The Cake is a Lie: Reputation, Facebook Apps, and "Consent" User Interfaces
3 weeks ago · 9 comments
-
A Sneak-Peek at Reputation Concepts
2 weeks ago · 5 comments
-
Pardon our dust...
4 weeks ago · 2 comments
-
The Cake is a Lie: Reputation, Facebook Apps, and "Consent" User Interfaces
Unlike the custom auto community, slams aren't judged by other poets but by members of the audience who are not poets and ideally have never attended a slam previously. They do this to avoid bias and to focus on poetry that appeals "to the people." But the result is a scoring mechanism that rewards novelty over subtlety, and poets often feel frustrated by the system. It seems to me that one of the reasons the custom auto ratings were so healthy is the fact that users were rating based on their own relational sense as active participants of what is good rather than a novice or absolutist sense of what is good.
Books are interesting because within any genre the variety and number published is fairly large. I might be a mystery fan but that covers a wide range. SF&F, romance, mystery are all large, amorphous sets. So I might buy a top rated or new book which I in retrospect consider a complete waste of money. Also, "fans" are notoriously brutal when there expectations aren't met. And it's in genre fiction (and movies) that you find some of the most intensely loyal fans. I'm curious as to how that effects ratings.
When your choices within a set are numberous and ill-defined, how does that impact your willingness to select a member within it. And if I never select it, I'm not going to rate it. Also, what kind of personalities are more likely to rate a product. Many more people will purchase a product than will rate it. What does that tell us about ratings? How does the intensity of my feelings effect my willingness to express an opinion. My guess there are far more 2 and 3 opinions which are not showing up as ratings. How likely am I to give a rating to something on which I'm lukewarm ?
Just some thoughts.
I am not convinced that looking at this aggregate distribution alone we can ask the question if a 5-scale rating is needless and instead we can just use fav or not. The volumes will always be towards the best - Harry Potter's for example (for books) - And those alone can skew this curve.
For other entities, the distribution might be a lot better and might make the 5-star scale meaningful.
You'd be surprised how many product designers just immediately assume that a 5-point scale will generate a straight-line distribution of scores. Data clearly show's that's faulty. Actually, I have never, ever seen such a distribution.
I would think the ideal rating system would have maximum dispersion: 20% of each star rating, for instance: a line, rather than a curve. Wouldn't that be preferable?
1) The star ratings have self-selection bias and are not randomly sampled. This is correct as far as it goes. As to the statement that almost all reputation systems use randomness, I must disagree. By definition the internet introduces self-selection bias: each system is limited to those who have computers that are hooked to the internet that are using a particular application in a particular site and who opt-in to participation. Randomness does play several useful roles, even in this context, such as is detailed in our book at http://buildingreputation.com/doku.php?id=chapt...
2) I don't see any reason to conclude that a flat distribution is always most desirable at all. People are not random. People have tastes and opinions. There is no data to suggest that taste/opinion is evenly distributed. Don't polls (that are properly randomly sampled) usually have an uneven distribution of results? Otherwise we wouldn't have so many polls. :-)
Randy