-
Notifications
You must be signed in to change notification settings - Fork 40
How do we express state revenue and production rankings? #1365
Comments
Link to a notes doc after reviewing the California page. I don't actually get to your question about negative numbers until #6: https://docs.google.com/document/d/1Yju7CYTc90SFg6KaE0vJIgJ-84qWJIZsXRVnellXRW0/edit |
This is still underway as part of the state profile updates. |
My thoughts on this have evolved. From what I've read around the internets, it sounds like the best practice for negative contributions to a positive total (or vice-versa) are not typically expressed as a numeric percentage, because they don't really make sense. I can dig into my history a bit to find some references, but I think the thing to do here is to update our state rankings and percentages to be |
While investigating #1397, I queried the db for state revenue rankings with a % > 100 and found some other offenders (note: these values are truncated for legibility, so the ones with % = 100 are actually between 100 and 101):
There are some values that have different signs than the annual total (for instance, in the last row). Then there are some others that have both negative numbers, but the state number is larger (proportionally) than the annual total. This makes me think that we just shouldn't show percentages for either negative state or totals. But what about when the state total is greater than the nationwide total (and, presumably, there were other states with negative revenues to bring down the nationwide total)? Thoughts, @RyanSibley, @coreycaitlin, @meiqimichelle, @mentastc? Note: these numbers are from the new revenue data that Nathan sent us, so they're provisional; but there will certainly be instances of each of these in the final data. |
Here are the edge cases we need to decide how to represent, if the percentages are even worth listing:
|
Nathan notes that we should not say "percent of US total revenue" for a given commodity, because we're only working with federal revenue, which accounts for about 25% of all revenue generated by the extractives industry (including private). |
Is this issue mostly resolved now that we have Nathan's categories, or is this something different? And/or, it seems like we're moving away from showing percentages along with many of our revenue/production values, right? |
Nathan's commodities groups solve most of the weird-data issues here. Also, I think we're moving away from showing year-over-year percentages, and we may not wind up showing percentages at all. If we do keep some form of percentages (i.e. "In 2013, federal revenue from coal extraction in Utah accounted for 8 percent of all federal revenue from coal extraction on federal lands"...which is a mouthful), I think there will at least be fewer edge cases. |
Closing this out in favor of #1465, since we'll mostly only be using percentages or rankings in intro sections. |
While building out state prototypes (#1355) and our new data "infrastructure" (#1353) I noticed one issue with state rankings: they might be a little complicated to describe if they're derived from state and offshore region data. For instance, in 2013 California was the "top" state producer of Fuel Oil at a whopping $68, but the national total in that year was actually negative $73k, with Western Gulf of Mexico contributing the overwhelming majority.
So there are a couple of questions here:
For 2. I was thinking that we would derive a percentage by taking the absolute values of the regional and national totals, and dividing those. So
abs(68) / abs(-73725) * 100
yields .09%. This accounts for both the edge cases of states with negative revenue in a positive national total and vice-versa.The text was updated successfully, but these errors were encountered: