Decide on core principles against which to judge breaking changes #12

santiweight · 2021-10-31T19:05:44Z

Let's decide on some principles that guide the design of base and other CLC-maintained libraries, and let's write them down in a central location.

As it stands, well-thought-out and considered proposals such as Joachim Breitner's no /= in Eq proposal get accidentally hijacked by high-level discussion behind what is sufficient breakage to make a proposal untenable.

For example, within that thread alone, the proposal maker and myself both disagree on the cost of such a change. Joachim argues that the change will cause minimal issues, whereas I argue that while the breakage is relatively small, the change will cause confusion and frustration, especially for those less experienced Haskellers and time-limited library maintainers.

Similar discussion occurred around what should a typeclass' methods contain in #3. In particular I see discussion around whether a typeclass should contain methods. For example, many of the typeclasses contain default functions that are included for the sake of runtime efficiency. However, that principle seems nebulous and not clearly defined. For example, elem and maximum, which require an Eq and Ord constraint respectively, are user-definable in the Foldable class. However, ^^ and ^, which I would imagine can have their own efficient implementations, are not defined on any of their required classes, and therefore cannot be more-speedily defined...

Just to reiterate, this is not specific to any singular function or class (perhaps ^^ and ^ are actually consistent!). The point of this proposal is that we don't have a document or central discussion, as far as I know, that outlines the parameters that make a proposal worth or not worth accepting. If we don't have a central discussion of such simple principles, work is very hard to get done because everyone has a slightly different perspective on what the right balance of breakage-for-improvement vs backwards-compatibility is.

The text was updated successfully, but these errors were encountered:

tomjaguarpaw · 2021-10-31T19:14:23Z

I strongly support this. I will have my own personal opinion about the (/=) proposal but I could be swayed easily to change it if the decision was being made in the context of an overarching design goal for base. At the moment there are good arguments either way and I only have a gut feeling to go on (I believe the same applies to others as well). On the other hand, a more coherent plan for base could make it easy to decide what is the consistent thing to do here.

santiweight · 2021-10-31T19:16:51Z

Yes exactly - currently we have well-meaning people with thoughtful but differing opinions that cannot be resolved. That is exactly what guidelines aim to prevent!

Bodigrim · 2021-10-31T21:23:12Z

we don't have a document or central discussion, as far as I know, that outlines the parameters that make a proposal worth or not worth accepting

The README states such parameter: "The committee makes decisions by simple majority voting".

Yes exactly - currently we have well-meaning people with thoughtful but differing opinions that cannot be resolved. That is exactly what guidelines aim to prevent!

I'm sorry, but I strongly believe that this is a rabbit hole of getting nothing done. "Well-meaning people will have thoughtful but differing opinions" about proposed guidelines (and their application) as well, asking for higher level principles and ad infinitum.

It's untrue that differing opinions about base cannot be resolved: the primary purpose of CLC is exactly to provide such resolutions by voting. Indeed, some people will disagree with CLC decisions, but this is life.

@santiweight @tomjaguarpaw you are most welcome to come up with a set of guidelines and their implications for base and present the latter as your proposal (and the former as its justification).

santiweight · 2021-11-01T00:01:08Z

I'm sorry but I don't buy this at all... Understand, of course, that I value your work and contribution!

What if I want to say, rewrite the Monad of no return proposal. What am I meant to do? Am I meant to just bring it up casually? If I bring it up casually, then I'll likely be shut down for "not being concrete - provide real examples", and rightly so. So what are my alternatives? Spend say 5-10 hours writing up a well thought out and considered proposal only to be shut down by a 10-0 vote because it's not even up for consideration...

But worse yet, I get the impression from the way you're discussing this that there is no plan for base. The whole reason we are here is because no one had a concrete plan for what base would be. It seems that base has incrementally had kruff tacked on, with no clear way to release breaking changes. How often are we allowed to release backwards-incompatible changes? Once every year? Every two years? Once every 6 months with the GHC release?

What about namespace changes? Can they happen any time? This is crucial information for not just a proposal but for base users and maintainers. As it stands, base has remained somewhat stale due to changes that are some mixture of very disruptive but also non-committal enough.

base was already maintained by popular majority in a committee afaik, and the situation hasn't improved. head and last still exist in base to popular derision, and the namespaces for the base libraries are confusing, with much standard functionality (liftA2, join etc.). I don't see how this will ever change unless there is some basic understanding of what base's role is in the Haskell ecosystem, even if the decision ends up being to leave base completely as is...

I'm sorry, but I strongly believe that this is a rabbit hole of getting nothing done. "Well-meaning people will have thoughtful but differing opinions" about proposed guidelines (and their application) as well, asking for higher level principles and ad infinitum.

Yes but they are voted on once. If there are no voted-on guidelines, then we will persistently rehash the same backwards compatibility/extreme breakage discussions and never be able to dismiss arguments as "not within base's scope". No one is asking for higher level principles, just the basics, such as what proposals would even be considered.

I'll cook up some guidelines to discuss, but I think the most important thing here is to have some values and process, even if they are unilaterally constructed by the CLC. Decision of almost any kind would be better than a lack of principles.

Btw, for related work, see the "GHC evolution principles" proposal that Richard Eisenberg posted recently. The specific intention being:

This PR is an attempt to create a central place in the proposals repo for principles. We can then use these principles to guide our responses to individual proposals.

Bodigrim · 2021-11-01T00:55:55Z

How often are we allowed to release backwards-incompatible changes? Once every year? Every two years? Once every 6 months with the GHC release?

Usually each release of GHC is accompanied with a major release of base, which incorporates backwards-incompatible changes.

What about namespace changes? Can they happen any time?

I'm not sure what you mean by "namespace changes". Potentially they can happen in every major release of base.

What if I want to say, rewrite the Monad of no return proposal. What am I meant to do?

No one is asking for higher level principles, just the basics, such as what proposals would even be considered.

Please refer to https://github.com/haskell/core-libraries-committee/blob/main/PROPOSALS.md for a guidance on how to raise a proposal and what is in scope.

santiweight · 2021-11-01T01:26:35Z

Thank you for the response :) I am not clear that there is any place where any of this information is outlined. I am only aware of the fact that base is released every 6-ish months because of enough time in the Haskell community to acquire the information via osmosis. Is there somewhere where a newcomer could discover this information?

You say that each release of GHC (which I believe to be every ~6 months) is accompanied by a release of base which might have backwards-incompatible changes. How often are minor changes released?

I am confused - the proposals readme doesn't say anything about what is in scope according to my reading (I've read it through a few times to be sure). I'm not trying to be facetious, but how do I know whether, for example, exporting transformers in Prelude is a potential option? Should I create an issue and have a CLC member respond? The closest thing I see is the following quote:

If you've got insight into how to improve performance, behaviour, or structure of base, awesome! This is the stuff we'd love to hear about. You should follow the steps below, keeping in mind that the bigger the scope of your proposal, the more detailed it should be.

Is your point that any change related to the core libraries is a reasonable proposal?

Bodigrim · 2021-11-01T01:49:30Z

How often are minor changes released?

Major releases of base are tied to major releases of GHC, minor releases of base are tied to minor releases of GHC.

I'm not trying to be facetious, but how do I know whether, for example, exporting transformers in Prelude is a potential option?

Such proposal would be in scope for CLC.

Is your point that any change related to the core libraries is a reasonable proposal?

The paragraph you just quoted gives an exact description of what is in scope: pretty much any non-trivial change to base. Changes to core libraries (as opposed to base) are normally not in scope, please check https://github.com/haskell/core-libraries-committee#core-libraries for more details on this.

Icelandjack · 2021-11-12T00:18:38Z

How about a documentation principle, to include counter-examples for type classes: https://blog.functorial.com/posts/2015-12-06-Counterexamples.html

Bodigrim · 2021-11-12T00:20:11Z

@Icelandjack documentation is generally out of scope for CLC.

chshersh · 2021-11-12T15:29:21Z

(Not a CLC member)

I'd like to share my view on the subject as a maintainer of multiple open-source Haskell packages. I don't represent any group, this is just my vision and personal opinion. Maybe some can empathise.

I understand the existing discussion as follows (correct me if I'm wrong):

Scope: CLC considers any non-minor changes to base and base only
Outcome: The result of whether the proposal is accepted or rejected is based solely on votes on the CLC members.

I think Scope is fine. It makes sense to me to have a separate group of experienced people to consider all changes to the standard library as they affect all Haskell developers.

However, I have some concerns regarding Outcome. I think this approach is flawed. It means that the result depends not on some objective guiding principles but on the personal preferences of currently elected members. It's not clear what are the preferences of those people (you can deduce some logic from previous decisions or from knowing such people but it's a lot of work and different people can infer different views). In other words, if you open a proposal at a different time with different members, you might get better luck to see your proposal accepted.

Another point that concerns me is that the existing approach puts CLC members in a position of uncontrolled power. They (in theory) can accept or reject changes based on their mood, personal preferences, or maybe even some personal qualities of the person who opens a proposal. I'm not at any moment saying that CLC has people with such harmful views. But why allow such a possibly dangerous situation to occur if it's extremely easy to prevent this from happening at all? Isn't "making illegal states unrepresentable" one of the Haskell mottos?

Therefore, I believe, it's extremely important to write down explicitly the goals behind the base roadmap.

On a different note, I would like to express my personal views on breaking changes and breaking backwards compatibility.

I think that with the current state of the Haskell ecosystem breaking changes must not be allowed unless:

They fix a security vulnerability

or

We have a process for the smooth introduction of breaking changes and support of OSS maintainers

The main goal of the Haskell Foundation is to "broaden the adoption of Haskell". CLC is affiliated with HF. From the website:

Affiliation means that the group supports the goals of the Haskell Foundation, and that the Haskell Foundation in turn supports this group.

In my personal view, introducing breaking changes decreases the adoption. So I don't understand how much this affiliation means in practice.

More importantly to me, almost all people maintain Haskell libraries for free in their free time. By accepting breaking changes you ask volunteers to do even more work if they want to use newer GHC versions on top of their generous contribution to the ecosystem.

I would like to emphasise the following:

⚠️ The size of breakage doesn't matter. Breakage is breakage. ⚠️

When maintainers constantly need to fix tedious issues for no apparent reason, they can easily burn out, leave the community and stop improving the ecosystem.

Again, if base goals allow such breakages then it's fine (I don't think it's a good goal though) but at least this needs to be written explicitly. Something like "the main goal of proposals to base is to have the perfect ideal Haskell standard library, no matter how much breakages it will require". So at least Haskell maintainers are aware of the explicit risks they are taking when they decide to maintain a Haskell library.

TristanCacqueray · 2021-11-12T16:31:43Z

Would it be possible to require that a breaking change includes a migration strategy and patches for the main projects that are affected?

merijn · 2021-11-12T17:10:59Z

Major releases of base are tied to major releases of GHC, minor releases of base are tied to minor releases of GHC.

This is not true. This has been the case for the past few years, yes. But mostly due to happenstance. Everytime I bring up tying base version to GHC in the way you propose in the #ghc channel, I've been informed this is not desirable because base (in principle) should evolve separately from GHC and this has happened in the past.

ocharles · 2021-11-12T17:18:11Z

base (in principle) should evolve separately from GHC and this has happened in the past.

But base can't practically make any release because it's bundled with GHC, and not reinstallable. So holding this stance "in theory" isn't doing anything useful.

gbaz · 2021-11-12T17:23:05Z

There is at least one guiding policy for breaking changes, which is the three release policy: https://gitlab.haskell.org/haskell/prime/-/wikis/libraries/3-release-policy

"Changes to basic libraries are planned and implemented so that, at any time, it is possible to write code that works with the latest three releases of GHC and base, without resorting to CPP, and without causing warnings even when compiled with -Wall. Such code may not necessarily be idiomatic, though."

This is effectively what's been requested in that breaking changes include a migration strategy.

The three release policy has been a key part of all CLC decisions since its formation (just about), and it may be worth placing it directly in this repo, or highlighting it more explicitly.

goldfirere · 2021-11-12T19:57:48Z

Context setting: I am not a member of the CLC. Though I am a member of the Haskell Foundation board, I write purely in my personal (and professional) capacity as an interested Haskeller.

I continue to support the proposal suggesting that broad guidelines be articulated for the evolution of base. I think such guidelines serve many goals:

continuity across time as the CLC membership evolves
information for potential proposers that can inform whether or not the proposal will be accepted
a way to help a CLC member vote on proposals that are otherwise hard to evaluate
broad alignment on the direction for the future of Haskell and its ecosystem

However, I do not think that such guidelines will remove the (in my opinion) essential human element: the CLC is composed of individuals with strengths and weaknesses, and these individuals will vote according to inscrutable internal processes (as we all do in all of our decision making). This means that @chshersh's very valid concerns that the results of a proposal may depend on, say, the timing of the proposal will not be negated by the introduction of guidelines. The guidelines will help, to be sure -- and I support writing them! -- but they will not "fix" this problem.

Instead, I think the best we can do is to make the selection process of these individuals as transparent as possible. The repo currently has no guidelines about how the CLC membership is selected and how it is refreshed over time. I think this is an oversight that should be corrected.

About Haskell Foundation affiliation: I see that the CLC is affiliated (see https://haskell.foundation/affiliates/), but I do not see that the CLC has met all the requirements for affiliation, as listed at https://haskell.foundation/affiliates/about/. In particular, I see no code of conduct or information about refreshing CLC membership. (Maybe there are other gaps, too -- I have not checked closely.) It looks like there is some work to do here.

Timing: The CLC reboot is still very fresh. The housekeeping details I'm advocating for here (guidelines for evolution, rules for membership, code of conduct, etc.) are important, but perhaps not as exciting as actually improving base. Furthermore, I think that some of these details may be addressed better after the current CLC membership has some time to get to know one another and establish some working practices (that could then get written down). I, personally, am thus fine if the CLC wants to delay handling these matters -- until a certain prescribed date, at which point it would return. I really don't want to saddle our new, intrepid CLC volunteers with drudge-work. I also don't want the important governance work completely forgotten.

Thanks, all, for a great conversation here -- it's wonderful to have a place to discuss these important issues, and I am thus very grateful for the work in rebooting the CLC.

szabi · 2021-11-12T21:18:52Z

In other words, if you open a proposal at a different time with different members, you might get better luck to see your proposal accepted.

As long as the committee's membership is not too volatile, I see no problem in that. A CLC decision once taken can (and should be able to) be overturned later. There should be no ban on reconsidering a proposal, say, 3 years later, as the basis -- the state of base -- has changed by then. There might be merit to the proposal then, even if at its first proposition it was found not to be convincing.

Of course, reopening a proposal, say, 6 months after it was rejected should likely be not considered, as it's unlikely that the fundamental basis has changed enough to merit a different outcome.

sjakobi · 2021-11-12T22:33:54Z

@chshersh As a fellow maintainer I do empathise with your perspective. I suspect that, as a maintainer of an alternative prelude, you are particularly exposed to breaking changes in base. Fortunately, most packages are not alternative preludes.

While I do feel that breaking changes need to be done carefully, I'm already quite happy that proposals involving them require an impact analysis:

If your proposal includes breaking changes, you must include an impact analysis. How many packages, approximately, will need to be updated? What is your plan for smoothing over that process, and how long do you estimate it will take?

I remember one proposal a few years ago that was approved without an impact analysis, which suggested to remove the Data.Semigroup.First and Data.Semigroup.Last types. In this case, I realized that this would cause quite a bit of trouble once I started implementing it. If the proposal process at that time had involved an impact analysis, I'm sure this would have prevented me from the trouble of getting the decision reverted.

Apart from that, my experience with breaking changes in base has largely been a good one. Changes like the Monoid-Semigroup-proposal, MonadFail-proposal etc. have really improved the language, and I don't remember having much trouble implementing them. But base still contains a lot of historical baggage that will require breaking changes to clean up. I believe that Haskell will be adopted by more people than it has had users so far. It's primarily for these new adopters that base needs to be improved.

I'm very optimistic that the new CLC and the new proposal process will lead to these necessary improvements, and that the CLC and the Haskell ecosystem can manage the breaking changes involved.

Bodigrim · 2021-11-13T00:38:19Z

⚠️ The size of breakage doesn't matter. Breakage is breakage. ⚠️

The thing is however that CLC is responsible for minority of breakages. Almost every release of GHC intricately breaks type checking. Almost every version of GHC changes ghc-prim in subtle ways and/or template-haskell in arcane ways. Plus overhauls of GHC API. So while we are already on a treadmill, let's grasp this opportunity to do something useful. You see, GHC regularly imposes new challenges for maintainers, but we are still shy to add foldl' to Prelude, because it is a dreadful breaking change.

I'm happy to embrace a principle saying that if there are no breaking changes in GHC, template-haskell or dependencies of base, there should be no breaking changes to base itself.

Ericson2314 · 2021-11-13T00:42:07Z

As the risk of being a broken record, the essential step is to decouple base from GHC. The total amount of breakage is not an issue, it's that it comes lockstep that makes it's more annoying than it need be, and prevents us from making base actually nice.

juhp · 2021-11-13T08:22:24Z

What about base5? I mean I had been anticipating that base-5 would to become the smaller, cleaner, modern standard library with some significant breaking changes, also consolidating parts of some popular alternative preludes, and other cleanups etc. ((Showing my age here but I can barely remember base-3.)) I guess there is revolutionary vs evolutionary long term change. Is it practical to start collective work on base-5, or realistic? Obviously the goal of base-5 shouldn't be to break as many packages as possible, on the other hand some people could be more open to one-time major changes rather than continual churn in base-4? Or is it preferable to just evolve base-4 gradually for continuity. From the point of view of really cleaning up the Prelude, base-5 seems necessary at least, but it could take some years to be ready.

ocramz · 2021-11-13T11:03:07Z

As the risk of being a broken record, the essential step is to decouple base from GHC. The total amount of breakage is not an issue, it's that it comes lockstep that makes it's more annoying than it need be, and prevents us from making base actually nice.

@Ericson2314 Are there any concrete proposals on how to decouple base from GHC releases ?

I'm happy to embrace a principle saying that if there are no breaking changes in GHC, template-haskell or dependencies of base, there should be no breaking changes to base itself.

This ^ seems to be the only practical proposal so far.

Beyond that and documenting thoroughly all breaking changes, I think finding overarching principles for the evolution of base is mostly a distraction. The research half of Haskell will always push for research-oriented changes to GHC and base will follow in a piecemeal fashion.

Bodigrim · 2021-11-13T14:03:58Z

(Sorry for piecemeal answers, I'm doing too many things at once.)

Another point that concerns me is that the existing approach puts CLC members in a position of uncontrolled power.

That's not quite so, there are escape hatches in place. As outlined in PROPOSALS.md, base is owned by GHC developers (after all it's just a part of GHC source tree, right?), who for various reasons choose to outsource its evolution to CLC. If suddenly CLC goes mad, GHC developers have a supreme authority to strip CLC of its job or insist on other resolution.

Ericson2314 · 2021-11-13T18:17:59Z

@ocramz Yes.

Uou can do it today with enough stomach for CPP --- c.f. how glibc supports many different syscall ABIs but hopefully not that bad. I think it could well be worth it to start with that for anything that doesn't cleanly separate into a ghc-specific package for GHC to depend on. See what @Kleidukos wrote in https://gitlab.haskell.org/ghc/ghc/-/issues/20647 for what such a split might look like.

Longer term, we would want to use backpack, and perhaps my idea in https://gitlab.haskell.org/ghc/ghc/-/wikis/Rehabilitating-Orphans-with-Order-theory, to make this less annoying.

hasufell · 2021-11-14T13:44:25Z

I propose:

Reasons for breaking changes to `base`

1. (security) bugfixes

No-brainers like the recent aeson breaking change. But may also include general bugs that require stronger types and thus breaking API.

The reason "security" is bracketed is because there's no definite definition of what a security bugfix is or is not. Any bug may potentially lead to an exploit, even if we're not aware of it. Thus I suggest not to focus too much on the word "security".

2. correctness improvements

This isn't about ergonomics or philosophical discussions like "is head a good idea?". It's about things that aren't mathematically/categorytheoretically correct or are fragile in the low-level sense: encoding issues, wrong representations (like FilePath), etc.

These may not be outright "bugs", because there's no obvious consensus about what's the right course, like with the broken ByteString IsString instance. But they may very well lead to bugs.

3. ergonomics improvements

This is the weakest category. Ergonomics can range from "let's replace head with safeHead" over things like the mentioned proposal or Monad of no return. These don't improve correctness per se and are not bugfixes, but make things more pleasant/stricter/haskell-ish for a large audience and not just core-library or base maintainers.

There must be a compelling reason why this can't be implemented outside of base and there should be a non-intrusive migration path and a 1-2 years migration period starting with compiler warnings.

This category requires a separate voting mechanism (see below).

Non-reasons for breaking changes

1. Performance improvements

If a performance improvement requires breaking API, then the improved functions/types should be implemented outside of base with a possible documentation adjustment in base, linking to those resources.

There may be exceptions to this rule, when fixing e.g. time complexity requires breaking API.

2. Trying out alternative API approaches (elegance)

There are many ways to express APIs. Some may be "better" than others, e.g. lazy IO is "evil" and I'd also like to see a proper streaming library replace those. However, those can be implemented outside of base and base shouldn't be a testbed for API experiments, but provide minimal building blocks.

3. ergonomics improvements that can be implemented outside of base

E.g. things that don't touch core instances/classes or core types. base shouldn't try to compete with alternative preludes.

Voting

I also suggest that 3. ergonomics improvements requires an unanimous CLC vote. It could be discussed whether ergonomics improvements require approval from core libraries maintainers and GHC maintainers as well, because this is the weakest category.

phadej · 2021-11-14T15:10:50Z

@hasufell: Do I understand your criteria right, that AMP,FTP,SMP which I guess all are Trying out alternative API approaches, and therefore won't be possible to do?

EDIT: I think that some voices are still not settled with these changes and are still frustrated.

I'd like to see changes of AMP scale to still be possible in the future (not easy, just possible).

hasufell · 2021-11-14T15:13:54Z

@phadej

AMP would probably fall under 2. correctness improvements, where I explicitly include math/CT correctness topics.

santiweight · 2021-11-14T17:41:31Z

I can only give half-baked thoughts currently (job search) - though the big guns of the Haskell are now here and my opinion will carry less (a good thing)!

I think something that is sorely missed in this discussion is an acknowledgment that the current severe uproar is really an indication that Haskell looks like a nightmare in production. I agree with many comments that "Eq of No Neq is not a big change". However, the overarching issue is the one that @chshersh raised: a prospective business faces the very serious and severe problem that they cannot predict whether their code will be easily-updatable to modern GHC versions. The issue with Eq of no Neq is not that the change itself causes churn or no churn - it's that no one can say how much churn maintainers and businesspeople will face in say the next year. It is seriously completely unpredictable, and subject to the whims of 6 people, even if those people are awesome! While there might be an answer, it is not in writing in a public location, and that simply won't cut it when millions of dollars are on the line.

I for one want to use Haskell at my day job, and while it is frustrating, the bay and many other cultures are safety-first, because product stability and predictable productivity is the name of the game. I have personally faced the "use Haskell at work" conversation, with an experienced Haskeller and self-exiled contributor who was pretty much terrified of using Haskell in production. The reality of the matter, whether you think it's reasonable or not, is that many Haskellers and non-Haskellers alike have no confidence that the decision making process is global or reasonable. While that is a statement of fear, economic recessions have occurred over similar amounts of uncertainty...

I personally would advocate for:

A 2 year breaking changes window, and for breaking changes in popular APIs (stuff in Prelude, Control.Monad, etc.) to only occur every two years.
For migrations to be semi-automated at the very least. I think we should keep non-migrators' work to pretty-much-only publishing a new package version on Hackage. (this is not black-white - more of a principle)
Provide migration tools that are repeatable and can be hijacked for different proposals. Exploit ghc-exactprint and friends! (I have started work on this - let me know if you are interested in collaborating)
Some kind of global strategy for base. I personally fear that the only reasonable strategy for base is one of extreme stability. Instead of proposing improvements to base, it is up to proponents of change, in my opinion, to make the work minimal for everyone else.
Some alternative to CPP before we have any backwards-incompatible changes of any size.

santiweight · 2021-11-14T17:53:03Z

The barrier for changes needs to improve a lot more also. I believe a migration "strategy" won't cut it if we want people to have confidence in Haskell. The issue is not whether or not there is a strategy, but whether "I will have to spend my weekend updating my 50 libraries" which is quite an awful user experience.

If you advocate for a change to base, it should be up to you do the work by implementing the migration to the ecosystem, be that via PRs on Hackage or a migration tool. Defining that the migration is "easy" is still deferring the work to someone else. To be clear - I actually don't think Eq of no Neq is the issue, but the black box of future proposals which may or may not impact unsuspecting and time-constrained maintainers.

In other words - there is a very interesting technical PL problem here of how to avoid having downstream users pay the cost of trivial (non-bugfixing) migrations, such as Eq of no Neq or Monad of no return. I am personally starting to try to address this by finding ways to do PRs across all Hackage repos. That is only one sub-problem in a vast (but solvable) space and there are many moving pieces that Haskellers can work to address.

A good way, imo to think about this is thinking of base as a user-facing business. If any single one of my applications required me to spend even 2 hours migrating for some aesthetic change that came from an internal designer, I would immediately delete the application. Why then is it so surprising that Haskellers are scared at staying in Haskell... Many Haskellers feel that:

Eq of no Neq will potentially affect them and their time whilst not gaining personally
are scared that some future change will be far worse because no promises have been made

I love Haskell, and will stay because I love it, but if I were on the fence or outside of Haskell - I would delete the Haskell app extremely quickly if I felt that way!

tomjaguarpaw · 2021-11-19T15:24:05Z

For the purposes of looking at Simon's comment with "scientific control" I think it would also be beneficial to gather evidence of the cost of upgrading in other ecosystems. How many person years are required to upgrade Java, Go, C++, Python, for example?

simonmar · 2021-11-19T17:14:29Z

Upgrading the C++ toolchain probably takes a lot more effort overall, but there's a lot more C++ code of course. I don't have any actual data I can share unfortunately.

I realised I should explain a little more why release notes and warnings don't end up being as useful as you might think. The larger the codebase, the more you want to centralise the job of maintaining and upgrading the toolchain and libraries. So it'll be one person or a small number of people doing most of the work.

Not only are there a lot of release notes and announcements to read during an upgrade, but since the person doing the upgrade isn't the person who wrote the application code, they are likely to miss important things or not realise the implications of changes mentioned in the release notes anyway. Therefore instead of trying to proactively fix things, we rely heavily on automation: turn on -Wall -Werror and use HLint extensively, fix compile errors and hope that batteries of tests and benchmarks will catch anything else that gets through. Project owners would normally get a chance to check that an upgrade is OK, but typically they'll just look at the test/benchmark results. If your tests and benchmarks aren't catching problems, then they're not good enough!

The changes that worry me the most are things like this: https://www.haskell.org/ghc/blog/20210607-the-keepAlive-story.html

The compiler, warnings, and HLint are not going to catch that. If we're lucky the benchmarks will catch the performance regression before it gets into production, but even if the benchmarks catch it, tracking down the cause of the regression will be an adventure for somebody. The only way around that is for someone to be reading the release notes carefully and proactively acting on it - fortunately I know about this one so when we do the upgrade to 9.0+ I'm going to have to go around and change all the withForeignPtrs to unsafeWithForeignPtrs. Here's hoping I remember :)

davean · 2021-11-19T18:38:31Z

Yes it's the semantic changes that scare me - and I do read the release notes. These are the ones that need careful consideration and planning. They're also often the ones - like the noted one - we can't really avoid. We should focus our planning and solutions more on those.

…

On Fri, Nov 19, 2021 at 12:14 PM Simon Marlow ***@***.***> wrote: Upgrading the C++ toolchain probably takes a lot more effort overall, but there's a lot more C++ code of course. I don't have any actual data I can share unfortunately. I realised I should explain a little more why release notes and warnings don't end up being as useful as you might think. The larger the codebase, the more you want to centralise the job of maintaining and upgrading the toolchain and libraries. So it'll be one person or a small number of people doing most of the work. Not only are there a lot of release notes and announcements to read during an upgrade, but since the person doing the upgrade isn't the person who wrote the application code, they are likely to miss important things or not realise the implications of changes mentioned in the release notes anyway. Therefore instead of trying to proactively fix things, we rely heavily on automation: turn on -Wall -Werror and use HLint extensively, fix compile errors and hope that batteries of tests and benchmarks will catch anything else that gets through. Project owners would normally get a chance to check that an upgrade is OK, but typically they'll just look at the test/benchmark results. If your tests and benchmarks aren't catching problems, then they're not good enough! The changes that worry me the most are things like this: https://www.haskell.org/ghc/blog/20210607-the-keepAlive-story.html The compiler, warnings, and HLint are not going to catch that. If we're lucky the benchmarks will catch the performance regression before it gets into production, but even if the benchmarks catch it, tracking down the cause of the regression will be an adventure for somebody. The only way around that is for someone to be reading the release notes carefully and proactively acting on it - fortunately I know about this one so when we do the upgrade to 9.0+ I'm going to have to go around and change all the withForeignPtrs to unsafeWithForeignPtrs. Here's hoping I remember :) — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#12 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AABZLSYUAWLBFLYFT23YY53UM2AYFANCNFSM5HCTHEQQ> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.

Ericson2314 · 2021-11-30T02:06:55Z

I would recommend @simonmar's team and others take advantage of things like -Wcompat by toggling whether they fail builds. I agree with the basic "fail fast" principle that non-errors get ignored, but by doing things like -Werror=compat and -Wno-error=compat one can still be "failure-driven" and yet not deal with every sort of breakage at once.

At Obsidian, we were in a bit a special position in that GHCJS is usually the limiting factor preventing upgrades, blocking us after the ecosystem has by-and-large resolved most regular platform-agnostic issues. But we try to take an analogous separating our Nixpkgs and GHC upgrades so we are never upgrading both at the same time. We sometimes get stuck on weirder errors than intended breakage, and this "tick tock" alternation of what we are upgrading also helps troubleshooting, and the "upgrade experiment" is better controlled.

In particular, the worst part about upgrades is not to the total time investment, but that the work is so nonparallel: when fixing failures, once typically only gets a few failures at a time. With -Wno-error=compat then -Werror=compat, a vanguard group of people can go through getting things to merely build, more permissively, and a second group can follow fixing the error warnings. Beyond the parallelism and critical path length benefits, lIke "scouting" in real time strategy games, this lifts the "fog of war" of the upgrade projects, revealing any major impediments sooner.

Similarly, we should:

Hopefully release GHC more often, in which case there are similarly more and smaller "waves of upgrades" propagating through, and thus better incrementality, parallelism and troubleshooting.
Decouple base and GHC as I have written in https://discourse.haskell.org/t/pre-pre-hftp-decoupling-base-and-ghc/3727/8, so those upgrades can also be separated for the same reasons.

Here's the most important thing to remember: the closer large organizations are to the latest release, the better incentivized they are to help out with upstream FOSS development, as any contributions they make will trickle back to them and benefit them sooner. We want to get that upgrade latency to an absolute minimum, independent of the throughput of breaking changes.

jberryman · 2021-12-01T02:55:52Z

I've worked at three Haskell startups. I more or less agree with Simon's overall sentiment, and the observation that warnings don't save much time (the exception being when they can contain more information than an error would); certainly we won't notice warnings in dependencies.

But the particulars sound very different from my experience at much smaller companies, even though e.g. we've been running a fork of ghc with patches.

At hasura we have 400-500 modules primarily in a single mono repo. We're in the process of upgrading to 9.2 from 8.10; upgrading our own code is probably a day's work, while fixing dependencies and (in particular) getting those changes upstreamed, temporarily vendoring and tracking PRs, etc... is a soul-crushing and thankless (like, being actually scolded) slog. When we're done we'll need to have a discussion about whether our upgrade will be blocked on hls, and then maybe try to help there (9.0 is still not quite fully supported iirc)

I think in general the breaking changes to ghc and core libs are really thoughtful, well-considered, and reasonable.

But I think the principle I'd propose is: if you break it, fix hackage. If that's too difficult to do by hand, then write a script (if that's too difficult write a tool). If maintainers don't want to merge a change to support an unreleased ghc version, figure out how to automate staging PRs from head.hackage
maybe.

I don't think that's too much to aspire to (at least for "small" breaking changes, and for the popular/active subset of hackage)

EDIT: actually one benefit of changes that come with a warning the version prior is that it does allow you to open a PR ahead of time without maintainers yelling at you. But again we're only going to start fixing other people's libraries when we go to upgrade and find they break

treeowl · 2021-12-01T05:14:30Z

@simonmar

Warning of upcoming changes in release notes or announcements is good, but it doesn't save us any time. We're probably not going to read the release notes for all the tools and packages we're upgrading until we're actually doing the upgrade, and maybe not even then - we'll find out when we hit the error.

That's ... a bit troubling. If you don't read release notes until you notice something "doesn't work", that means you could miss important but subtle changes in behavior. Ouch!

vdukhovni · 2021-12-01T05:17:18Z

In fact from our perspective it would be better not to have the warning stage, it's just extra churn. We'll fix it when it breaks.

That is an interesting relevation, and I wonder how widespread that is. I expect that well tended code is likely kept warning-free, so to their maintainers, the introduction of the warning is already the “now do something” stage. And for code that is just kept alive, warnings are likely ignore until one cannot. So I wonder, how useful are they after all? Are they “just” a more aggressive form of release notes then?

Personally I like to address future issues on an ongoing basis, as they come up, rather than big-bang. So warnings are helpful. And build and maintain compatibility with future releases to which I'm not necessarily immediately ready to switch. Approaches vary...

szabi · 2021-12-01T10:25:16Z

@vdukhovni, for reference and context, what kind of projects and code bases do you work on?

chshersh · 2021-12-01T10:52:03Z

It usually takes us 6 months to upgrade to a newer GHC version in Standard Chartered. We start building our codebase from the first released version of each new GHC (e.g. alpha). Usually, every new GHC version (sometimes even minor ones) contain bugs that make it impossible to use it on our codebase. So we start building our project earlier to discover bugs and report them as early as possible.

We are using GHC 8.10.7 at the moment. We've started the process of upgrading to GHC 9.0 in Feb 2021, and now we're migrating to 9.2.1 instead. However, we're waiting for the dependencies to catch up before finishing the migration, so it'll probably take about a year in total to move from 8.10 to 9.2 for us. This migration work is done by a single person mostly and they work not only on this 🙂 It's usually "make some progress -> hit the problem -> work on other stuff while waiting for the solution -> repeat".

A few details about our project (in a team that uses GHC/Haskell and not Mu/Haskell):

Total 110K LOC and 700 Haskell modules
We use 480 dependencies from Hackage
We have our own Shake-based build tool forge instead of cabal or stack (the tool itself is built with stack though)
Our build tool allows specifying patches to upstream dependencies, so we can fix all issues in dependencies on our side
We use a statically-linked version of GHC which we build by ourselves using options provided by GHC
Our CI builds the project with -Werror and we use an extended list of GHC warnings, including -Wall, -Wcompat and many others

So, usually, the main problems that delay our GHC upgrades are:

Bugs in GHC.
Waiting for dependencies to catch up.

I'm all hands for contributing patches upstream ⬆️ However, the bank doesn't allow us to contribute to OSS projects during work time and patching almost 500 dependencies by ourselves each time is not a sustainable option. Hence the waiting. But if GHC and other libraries had fewer breaking changes, this could significantly speed up our upgrades and the process of testing GHC versions earlier.

szabi · 2021-12-01T11:03:43Z

However, the bank doesn't allow us to contribute to OSS projects during work time

I understand that is a business/management decision, also likely at a much higher level than your unit, but tbf., it sounds like a particularly uneconomic one, given the lack of upstreaming makes you accumulate more and more patches to maintain yourself down the line.

chshersh · 2021-12-01T11:14:02Z

@szabi Yeah, I agree that it's a suboptimal position to be in. There were several attempts to change the situation. But they went nowhere and this is not something I can change, unfortunately 😞

goldfirere · 2021-12-01T14:47:43Z

I'm hearing here that several companies are doing real work to patch dependencies in the process of upgrading. This work sounds duplicated across the companies. Would it make sense to somehow work together to avoid this duplication? One model for this is to have, say, a pool of money that gets spilled out at every GHC release to get some pre-defined portion of our ecosystem up-to-date. Would that be of general interest? Or maybe the overhead of coordinating that drowns out the improvement to efficiency -- not sure.

gasche · 2021-12-01T15:06:45Z

In the OCaml community, we had a problem of "wait for dependencies to upgrade", and we have improved the issue a lot thanks to work that happens during the "new compiler release process" (from branching to release):

We have CI processes to build all software of opam-repository (our open source packaging repository) as soon as the release is branched, and then several time during the release process. (A lot of this CI is being done at OCamllabs.)
There is a person dedicated to observing this CI output, pinging maintainers of broken packages or contributing fixes/patches themselves, and sometimes reporting issues to the OCaml compiler development team. (Note: this is a job that requires good expertise, to look at breakage reports and understand which issues should be fixed on the package side, and which are worth an issue against the compiler.)
(Our dedicated person is @kit-ty-kate, and her time for this is funded by the OCaml Software Foundation.)
In addition some core "platform" tools have a dedicated release-awareness process with one person from the project in charge of following compatibility during the release process, with weekly meeting to discuss the current state. (Again OCamllabs is to be thanked for this initiative.)

On most releases we are in a good state where most of the ecosystem is compatible with the new release essentially on the day that the release is officially shipped -- instead of individual packages having to wait for weeks or months as we did in the past.

stacktrust · 2021-12-01T15:11:05Z

@ocramz

Startups : What are your coping/evolution strategies with breaking changes in dependencies ? Or do you avoid updating altogether?

[niche OSS project with limited resources]

The latter, after the original code author / Haskell guru moved on. We use GHC 5+ years old for a Xen toolstack with ongoing minor enhancements. No GHC patches.

Source: https://github.com/OpenXT/manager/tree/master/xenmgr
OE cross-compilation: https://markmail.org/message/v3qgtj7mtjmvkpgc

As the cost of GHC uprev remained stable at infinity for a small team without a Haskell expert, alternatives like moving to Rust have slowly become more practical.

phadej · 2021-12-01T15:13:43Z

@gasche

GHC haskell has it too:

We have CI processes to build all software of opam-repository (our open source packaging repository) as soon as the release is branched, and then several time during the release process.

head.hackage is not all of software of Hackage, but plenty anyway.

There is a person dedicated to observing this CI output

Kind of. See https://gitlab.haskell.org/ghc/head.hackage/-/graphs/master

However as the head.hackage is primarily meant to test compiler, the patches are of minimal quality to make packages work with latest released GHC and GHC-head. This is pragmatic choice, as nothing else is tested. And the patches are not submitted upstream, but maintainers are welcome to look there (e.g. I do).

How that work is funded: I don't know.

gasche · 2021-12-01T15:32:38Z

We ask for mergeable patches that are actually submitted to the various upstream projects. I believe that most managers are happy to merge the patches proposed by @kit-ty-kate.

(I forgot to mention it, but this work (in particular enabling people to test proposed patches/changes or debug issues) requires a specific opam-repository overlay, just like head.stackage.)

gbaz · 2021-12-01T16:20:37Z

I would add that for library authors and maintainers, compat warnings are a very good thing. In particular, if there's a full release cycle between warning and breakage, there are high odds that either you or another library consumer will notice the warnings and attend to them early, rather than waiting until a confirmed breakage.

jberryman · 2021-12-01T17:15:11Z

I'm hearing here that several companies are doing real work to patch dependencies in the process of upgrading. This work sounds duplicated across the companies. Would it make sense to somehow work together to avoid this duplication?

@goldfirere I don't think duplication is a big issue; once a ghc is released we can open PRs. And the impression I get from this thread and my own experience is everyone waits for someone else to fix dependencies :) The duplication is probably between head.hackage and the patches that actually make it to upstream.

I'm very keen to learn more about the system OCaml has, that @gasche describes. It seems to address the two pain points:

staging fixes before a release (it sounds like there's both tooling here, and a culture or understanding that such patches for an unreleased version are okay to merge)
actually implementing the fixes efficiently

And I like the idea of breaking changes being decided in coordination with a team that can be consulted and say "this would break things in a way that would require X, Y and Z" or "it would be easy for us to migrate the ecosystem for this change". It's a mechanism of accountability.

And at the end of the day, if libraries (and as a consequence, applications) are lagging way behind ghc or core libs then presumably ghc itself suffers, unable to judge whether changes actually benefit users (wrt performance, compile time, that kind of thing).

simonmar · 2021-12-02T21:53:45Z

@simonmar

Warning of upcoming changes in release notes or announcements is good, but it doesn't save us any time. We're probably not going to read the release notes for all the tools and packages we're upgrading until we're actually doing the upgrade, and maybe not even then - we'll find out when we hit the error.

That's ... a bit troubling. If you don't read release notes until you notice something "doesn't work", that means you could miss important but subtle changes in behavior. Ouch!

I bet a lot of people just bump the stackage version and run CI without reading all the release notes for all the packages that have changed in the upgrade. That's all we're doing here.

I'll also say that if your CI isn't detecting changes in behavior then you already have problems waiting to happen. One thing I've learned doing this over the years is to rely on humans as little as possible!

gadmm · 2021-12-03T16:51:01Z

I'm very keen to learn more about the system OCaml has, that @gasche describes.

For a start (and probably the most important), OCaml has been very conservative in avoiding breaking changes, unless deemed concerning an internal detail.

One unfortunate part of the ecosystem that suffers from breakage are the PPX preprocessors, since they rely on internal representations of the compiler. For this, they have chosen an “upgrade the world” approach (see https://github.com/ocaml-ppx/ppxlib/wiki/The-State-of-the-PPX-Transition, https://discuss.ocaml.org/t/ppxlib-0-22-an-update-on-the-state-of-ppx/7296/9).

Bodigrim · 2022-06-25T13:16:36Z

The last message here is six months old. The topic of stability has been picked up by Haskell Stability Work Group. The raging storm around (/=) has burnt out. README has been updated to deliniate responsibilities of CLC more clearly.

I'd like to close this issue soon, it accumulated too many different threads to be actionable. If there are any outstanding proposals left, which you'd like to pursue, feel free to open new discussions.

santiweight · 2022-06-25T18:11:12Z

I support closing this issue also. Thanks @Bodigrim

santiweight changed the title ~~What makes a change worth it?~~ Decide on core principles against which to judge breaking changes Oct 31, 2021

tomjaguarpaw mentioned this issue Oct 31, 2021

Proposal: Remove method (/=) from class Eq #3

Closed

chreekat mentioned this issue Dec 1, 2021

GHC Ops Proposal haskellfoundation/tech-proposals#18

Merged

Bodigrim added the meta General questions on CLC rules and policies label Dec 17, 2021

IamfromSpace mentioned this issue Dec 18, 2021

Kafka event mapping Nike-Inc/hal#102

Merged

hasufell mentioned this issue Jan 14, 2022

Proposal: showLitChar (and show @Char) shouldn't escape readable Unicode characters #26

Closed

Bodigrim closed this as completed Jun 25, 2022

hasufell mentioned this issue Aug 21, 2022

Add the '|>' pipe operator #78

Closed

chshersh mentioned this issue Mar 1, 2023

Should every member provide a manifesto? #141

Closed

brandonchinn178 mentioned this issue Aug 20, 2024

RFC: Add policy for breaking changes fourmolu/fourmolu#430

Merged

3 tasks

Decide on core principles against which to judge breaking changes #12

Decide on core principles against which to judge breaking changes #12

Comments

santiweight commented Oct 31, 2021

tomjaguarpaw commented Oct 31, 2021

santiweight commented Oct 31, 2021

Bodigrim commented Oct 31, 2021

santiweight commented Nov 1, 2021

Bodigrim commented Nov 1, 2021 • edited Loading

santiweight commented Nov 1, 2021 • edited Loading

Bodigrim commented Nov 1, 2021

Icelandjack commented Nov 12, 2021

Bodigrim commented Nov 12, 2021

chshersh commented Nov 12, 2021

TristanCacqueray commented Nov 12, 2021

merijn commented Nov 12, 2021

ocharles commented Nov 12, 2021 • edited Loading

gbaz commented Nov 12, 2021

goldfirere commented Nov 12, 2021

szabi commented Nov 12, 2021

sjakobi commented Nov 12, 2021

Bodigrim commented Nov 13, 2021

Ericson2314 commented Nov 13, 2021

juhp commented Nov 13, 2021

ocramz commented Nov 13, 2021

Bodigrim commented Nov 13, 2021

Ericson2314 commented Nov 13, 2021

hasufell commented Nov 14, 2021 • edited Loading

Reasons for breaking changes to base

1. (security) bugfixes

2. correctness improvements

3. ergonomics improvements

Non-reasons for breaking changes

1. Performance improvements

2. Trying out alternative API approaches (elegance)

3. ergonomics improvements that can be implemented outside of base

Voting

phadej commented Nov 14, 2021 • edited Loading

hasufell commented Nov 14, 2021

santiweight commented Nov 14, 2021 • edited Loading

santiweight commented Nov 14, 2021 • edited Loading

tomjaguarpaw commented Nov 19, 2021

simonmar commented Nov 19, 2021

davean commented Nov 19, 2021 via email

Ericson2314 commented Nov 30, 2021

jberryman commented Dec 1, 2021 • edited Loading

treeowl commented Dec 1, 2021

vdukhovni commented Dec 1, 2021

szabi commented Dec 1, 2021

chshersh commented Dec 1, 2021

szabi commented Dec 1, 2021

chshersh commented Dec 1, 2021

goldfirere commented Dec 1, 2021

gasche commented Dec 1, 2021

stacktrust commented Dec 1, 2021 • edited Loading

phadej commented Dec 1, 2021

gasche commented Dec 1, 2021

gbaz commented Dec 1, 2021

jberryman commented Dec 1, 2021

simonmar commented Dec 2, 2021

gadmm commented Dec 3, 2021

Bodigrim commented Jun 25, 2022 • edited Loading

santiweight commented Jun 25, 2022

Bodigrim commented Nov 1, 2021 •

edited

Loading

santiweight commented Nov 1, 2021 •

edited

Loading

ocharles commented Nov 12, 2021 •

edited

Loading

hasufell commented Nov 14, 2021 •

edited

Loading

Reasons for breaking changes to `base`

phadej commented Nov 14, 2021 •

edited

Loading

santiweight commented Nov 14, 2021 •

edited

Loading

santiweight commented Nov 14, 2021 •

edited

Loading

jberryman commented Dec 1, 2021 •

edited

Loading

stacktrust commented Dec 1, 2021 •

edited

Loading

Bodigrim commented Jun 25, 2022 •

edited

Loading