Weighting of scores

Talk Discussion in 'BeerAdvocate Talk' started by drtth, Oct 18, 2015.

Thread Status:
Not open for further replies.
  1. drtth

    drtth Initiate (0) Nov 25, 2007 Pennsylvania
    In Memoriam

    A recent discussion of Pilsner Urquell triggered this thought. It is my understanding that currently all review scores count with the same weight towards the overall average for each beer, regardless of when those reviews were done. Using PU as the example, this means for PU that ratings from back in the day before PU switched to brown bottles and their cool shipment initiative began will have generally been on samples that were less than fresh and potentially not well protected from temperature and light before purchase. Many newer ratings will be on samples that were in closed packages and brown bottles that were shipped across the Atlantic in chilled condition.

    This suggests the idea that it might be desirable to have a differential weighting over time on the contribution of ratings to the overall rating score. If a beer has declined or improved in quality there could then be an additional gradual change in the overall average over time that reflects things such as better or worse quality control, etc. As a different example, I know of at least one locally brewed Pale Ale that did a hop change in the their recipe and many folks think it is now a much better beer than reflected by its overall score.

    This "moving average" would not require any new information beyond what already exists and would seem to provide a possible "smoothing" effect reflecting changes over time. It would also seem consistent with having the n small enough for new beers to be able to break into the Top 250 or into the year's best new beers list, etc.
     
    #1 drtth, Oct 18, 2015
    Last edited: Oct 18, 2015
    Ranbot, AlcahueteJ and Bitterbill like this.
  2. dbrauneis

    dbrauneis Grand High Pooh-Bah (8,071) Dec 8, 2007 North Carolina
    Mod Team BA4LYFE Society Pooh-Bah Trader

    I don't think that I'm a fan of this suggestion because it discounts (or reduces the impact of) reviews by longer term users because of things that you "assume" were issues such as type of bottle used but we cannot use serving type to make those decisions since that information was removed (or at least no longer taken in reviews a couple of years ago) - what if some or many of those users had their PU while in Europe or even Prague (I think I reviewed my PU at the PU Pub in Düsseldorf from draft).

    I think one effect of this change would be to remove better reviews from beers that have been around longer that maybe were pushing the envelope back in the day but are pretty tame now - I'm thinking Victory Storm King as an example here.
     
    yemenmocha likes this.
  3. PapaGoose03

    PapaGoose03 Grand High Pooh-Bah (6,057) May 30, 2005 Michigan
    BA4LYFE Society Pooh-Bah

    With almost 1,400 reviews for PU on record I don't think any new reviews have any significant effect on the overall score for this beer. But over time, if the newer reviews stay positive because of changes in the beer, packaging, etc. then the score will begin to reflect these better reviews. It's just a statistical oddity that comes with the law of large numbers at work here.
     
  4. drtth

    drtth Initiate (0) Nov 25, 2007 Pennsylvania
    In Memoriam

    Yes, good points both and with regard to the effect of old reviews, both better and worse would be equally affected so that the effects of something we might call the "Storm King envelope pushing" would be gradually reduced somewhat over time creating a more realistic view of its present status in the field. (Such dialog and exploration of the consequences of something like this weighting is part of the reason the OP was originally created as a comment rather than a feature request.)
     
    #4 drtth, Oct 19, 2015
    Last edited: Oct 19, 2015
  5. drtth

    drtth Initiate (0) Nov 25, 2007 Pennsylvania
    In Memoriam

    True enough, and if the older a review gets the less weight it has, then the more the law of large numbers would push the average towards a more current representation of the its status relative to others in the style. Especially when it is being compared/contrasted with other, newer versions of the style that do not have an extended history.
     
  6. Ranbot

    Ranbot Pooh-Bah (2,463) Nov 27, 2006 Pennsylvania
    Pooh-Bah

    Alternatively making it known that older reviews will be weighed less might encourage older, trusted users to revisit beers and keep their thoughts in the database more current. I know in many discussions about a specific beer users will contribute something based on their tasting/review notes, but then disclaim the statement saying it's been so long since they reviewed the beer that they should revisit it.
     
  7. yemenmocha

    yemenmocha Grand Pooh-Bah (4,116) Jun 18, 2002 Arizona
    Pooh-Bah

    A substantial percentage of new people to craft beer are younger, new to beer folks. May I suggest there's already disproportionate influence from newer drinkers who have only been doing this hobby for a few years. Maybe think carefully before giving them even more influence in this manner with scoring, and at the same time diminish the influence of reviews from those who have been doing this before BA ever existed. I can only imagine the how skewed the final score would be in @dbrauneis ' example of Victory Storm King or others. Or their reviews for British IPAs, bitters, milds, etc. Someone who got their exposure to craft beer first through American IPAs won't and maybe can't have the same appreciation as those who were serious enthusiasts and liked British IPAs before American IPA ever existed.

    I'm reminded of some thread posts over the years about how Fuller's ESB isn't really true to style, and so someone scored it low. Or some wondering why Pilsner Urquell is historically significant. It isn't uncommon to hear more "veteran" drinkers talk about how newer drinkers don't, or maybe can't, appreciate some of the more subtle styles. We learned about beer from books, visiting Europe, much of it pre-internet. That set of experiences and such make for a different reviewer who turned 21, discovered an "app" on the iphone to tick beers and then occasionally labors through a few sentences to do a review on BA. I'm oversimplifying but I wanted to note that these are debatable views, there are differences in those who just came lately to craft beer and the older folks, and to skew the scoring system in this way is decidedly taking one side in a major way.

    Heck, the PU thread is also a worthwhile example because it is somewhat disturbing to me and a few others regarding how many people mentioned examples that aren't even close to Pilsener Urquell in taste, or within style.
     
    #7 yemenmocha, Oct 20, 2015
    Last edited: Oct 20, 2015
    jmdrpi and dbrauneis like this.
  8. Scott17Taylor

    Scott17Taylor Initiate (0) Oct 28, 2013 Iowa
    Trader

    Agreed, and I'm one of the new young BA members. Devaluing an older review makes no sense, especially because many of the older members have a higher appreciation for the more subtle styles as mentioned above I want their opinion to come through.
     
    yemenmocha likes this.
  9. drtth

    drtth Initiate (0) Nov 25, 2007 Pennsylvania
    In Memoriam

    It's an interesting assumption that the old guard, many of whom are no longer around, used to write better, more perceptive, more knowledgable reviews when they were new to the site. It's also a testable one which may be worth exploring. My personal experience tells me that as my experience with writing reviews has grown, so has my ability to do a sensory evaluation and communicate it.

    It also tells me that newer site members have no monopoly on not knowing certain distinctions that are crucial to others. I've seen old reviews that obviously confused hop bitterness and skunking, and a fair number that equated hop bitterness with hop flavors when they are separate and separable things both on the palate and in the brewing process. I also recall being assured by old timers early on in my first year or so on the site that that there was a formula to tell us how bitter any beer would be. Indeed I was once venomously told how ignorant I was when trying to explain/understand how and why perceived bitterness seemed to be different from the number the formula gave.

    So I just keep thinking, just as some older quite accurate scores would begin to gradually lose weight over time so to would some older scores that reflect misunderstanding and inaccuracy and since those numbers, in theory, mostly cancel each other out now there wouldn't seem to be much lost except an effect that clouds rhe current numbers with the weight of the way things used to be compared to the way they are now. The world of flavorful beer has changed even in a time frame as short as five years and we can argue we have more and better beers to chose from than we did as little as 10 years ago and that has a changing effect over time.

    Two things a gradual decline with age in the weight of review scores in figuring the overall average would not do is to change the score we see when looking at the individual reviews or effect in any way the words we read and the value, learning and perceptiveness we find when reading those reviews.
     
    #9 drtth, Oct 20, 2015
    Last edited: Oct 20, 2015
Thread Status:
Not open for further replies.