Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

No definition of what constitutes an A, AA, or AAA conformance Level #3889

Open
Helixopp opened this issue Jun 2, 2024 · 115 comments
Open

No definition of what constitutes an A, AA, or AAA conformance Level #3889

Helixopp opened this issue Jun 2, 2024 · 115 comments
Labels
a11y-tracker Group bringing to attention of a11y, or tracked by the a11y Group but not needing response. Conformance WCAG 2.x Understanding

Comments

@Helixopp
Copy link

Helixopp commented Jun 2, 2024

There does not appear to be any clear definition of what parameters a success criteria must meet in order to qualify for any particular conformance level.

In order to be considered Conformance Level A does a success criteria have meet a certain level of severity, as in if not met it will severely adversely impact the user.

Likewise is an AA Conformance Level less severe/impactful? And so on.

Or is it that Level A Conformance is attributed to easy fixes, slightly more complicated issues are deemed AA Level, etc.?

Accessibility consulting companies are assigning severity levels to WCAG Success Criteria (critical, moderate, serious, etc.)which seem to be largely based on conformance level.

This is extremely subjective, and dangerously misleading. Lack of transparency of what criteria are required to be met for each conformance level is partly to blame.

If such information does exist it is too difficult to find.

@Helixopp Helixopp changed the title No definition of what constitutes an A, AA, or AAA conformance criteria No definition of what constitutes an A, AA, or AAA conformance Level Jun 2, 2024
@patrickhlauke
Copy link
Member

patrickhlauke commented Jun 2, 2024

From memory, the decision of which level was assigned to each SC relates back to the old concept of Priority from WCAG 1.0 https://www.w3.org/TR/WAI-WEBCONTENT/#priorities but it does indeed seem that this has not been ported as such to the WCAG 2.x documents.

I do think that your question partly mixes up different concepts though: the level for an SC has been set, it is what is is. and WCAG is binary ... you either pass or fail an SC, and in order to claim conformance to a particular level, you as developer must address all SCs at that level (and any level underneath that chosen level, i.e. for AA conformance need to fully address all A and AA). there's no vagueness/wiggle room there.

However, it's clear that beyond the binary pass/fail, there are always nuances in how bad a failure actually is in the context of a page/site as a whole. That is something that can't be absolutely and unambiguously defined in the spec, as it's so heavily dependent on context. And THAT is usually what agencies/testers try to indicate through their own additional severity rankings. It still doesn't change anything in terms of the binary pass/fail and the need to satisfy all SCs for your chosen conformance level, but is a subjective extra categorisation that testers give solely to help developers prioritise remediation. They still have to remediate everything that fails, but it acknowledges that some fixes will likely be easier or more impactful or important. I don't think that part can ever be made unambiguous and purely objective.

Accessibility consulting companies are assigning severity levels to WCAG Success Criteria (critical, moderate, serious, etc.)which seem to be largely based on conformance level.

Can't speak to what other shops do, but we (TetraLogical) base our severity levels primarily on the real-world impact a fail has on actual users, regardless of SC level.

Having said all that, agree it would be nice to include a bit of high-level rationale about A/AA/AAA in the non-normative explanatory documentation to just give a feel for why SCs were slotted into the different levels.

@mraccess77
Copy link

It was my understanding that Level A was for items that can't be worked around even with accessibility features or assistive technology and that they are barriers even with other common technologies being present. Some items such as contrast were slotted AA because at the time there were ways to use custom stylesheets for instance to change the text color or contrast and for SC 1.4.4 use of screen magnification software to enlarge the text. For other SC like 1.2.3 and 1.2.5 (and others) the SC at A and AA built on each other to provide layers of access to different degrees.

AAA items are ones that may not be able to be met in all situations so they had to be moved to AAA.

@alastc
Copy link
Contributor

alastc commented Jun 2, 2024

There is an explanation in the understanding docs here: https://www.w3.org/WAI/WCAG22/Understanding/conformance#levels

As Patrick mentioned, it doesn't (and cannot) take into account the context of the issue, so people assigning severity should use other factors.

I appreciate is is hard to find, but where would you have looked for it in the first place? Perhaps we can use that information to add a link in a useful place.

@Helixopp
Copy link
Author

Helixopp commented Jun 2, 2024 via email

@Helixopp
Copy link
Author

Helixopp commented Jun 2, 2024 via email

@Helixopp
Copy link
Author

Helixopp commented Jun 2, 2024 via email

@mraccess77
Copy link

From the understanding conformance document

The success criteria were assigned to one of the three levels of conformance by the working group after taking into consideration a wide range of interacting issues. Some of the common factors evaluated when setting the level included:

  • whether the success criterion is essential (in other words, if the success criterion isn't met, then even assistive technology can't make content accessible)
  • whether it is possible to satisfy the success criterion for all Web sites and types of content that the success criterion would apply to (e.g., different topics, types of content, types of Web technology)
  • whether the success criterion requires skills that could reasonably be achieved by the content creators (that is, the knowledge and skill to meet the success criterion could be acquired in a week's training or less)
  • whether the success criterion would impose limits on the "look & feel" and/or function of the Web page. (limits on function, presentation, freedom of expression, design or aesthetic that the success criterion might place on authors)
  • whether there are no workarounds if the success criterion is not met

@Helixopp
Copy link
Author

Helixopp commented Jun 2, 2024 via email

@patrickhlauke
Copy link
Member

FWIW, I agree with @Helixopp that there should at least be a very broad definition for what constitutes a Level A, AA, AAA SC - along similar lines as the broad WCAG 1.0 priority definitions, as currently that's not properly spelled out.

(also, hope you don't mind @Helixopp, but I edited your email follow-up comments to remove the previous messages that they were replying to)

@Helixopp
Copy link
Author

Helixopp commented Jun 2, 2024 via email

@GreggVan
Copy link

GreggVan commented Jun 3, 2024

There is a very clear definition of what goes in each level.
The level assigned to each provision is the level that the working group reached consensus on for that SC.
That is the only definition of the level of each SC.

If you are looking for a formula - or set of checkboxes -- you won't find them. There were many different factors that went into each decision.

I can't speak definitively for 2.1 and 2.2 but for 2.0 -- in each case (almost every case) there were those that wanted to put a provision into another level than where it ended up. In the end - it was the level that everyone could reach a consensus on. That everyone could accept after sometime short and sometimes long debate and discussion(s).

There was even debate about having two or three levels. At one point it looked like a majority wanted to go to 2 levels. But when asked it turned out half wanted to put level AA into level A and half wanted to put Level AA into level AAA. In the end -- what people could agree on was the three levels with the SC distributed as they were.

I won't repeat all the different considerations -- since many of the are listed already. But those are just a subset of the 30 or 40 different things that people brought up in arguing for an SC to be in one level vs another.

@Helixopp
Copy link
Author

Helixopp commented Jun 3, 2024 via email

@patrickhlauke
Copy link
Member

From the understanding conformance document

The success criteria were assigned to one of the three levels of conformance by the working group after taking into consideration a wide range of interacting issues. Some of the common factors evaluated when setting the level included:

  • whether the success criterion is essential (in other words, if the success criterion isn't met, then even assistive technology can't make content accessible)
  • whether it is possible to satisfy the success criterion for all Web sites and types of content that the success criterion would apply to (e.g., different topics, types of content, types of Web technology)
  • whether the success criterion requires skills that could reasonably be achieved by the content creators (that is, the knowledge and skill to meet the success criterion could be acquired in a week's training or less)
  • whether the success criterion would impose limits on the "look & feel" and/or function of the Web page. (limits on function, presentation, freedom of expression, design or aesthetic that the success criterion might place on authors)
  • whether there are no workarounds if the success criterion is not met

Something like the above, combined with an explanation that - in naive terms - "Based on these considerations, Level A SCs are more critical, Level AA slightly less so, and Level AAA tend to not apply to all situations and impose limits..." or something. No doubt, that would be contentious when spelled out like that, though...

Maybe at the very least having something like the above bullet points though that give an insight into the rationale for having different levels.

@Helixopp
Copy link
Author

Helixopp commented Jun 3, 2024 via email

@mbgower
Copy link
Contributor

mbgower commented Jun 3, 2024

There does not appear to be any clear definition of what parameters a success criteria must meet in order to qualify for any particular conformance level.

I'm going to suggest that the only time this is relevant is when the working group is coming up with new success criteria (and I've asked similar questions at that point). From an author perspective, one chooses the conformance level that meets the needs of one's client (or the regulations that are in effect), and one completes all the criteria that are indicated for that level.

Accessibility consulting companies are assigning severity levels to WCAG Success Criteria (critical, moderate, serious, etc.)which seem to be largely based on conformance level.

Do you have examples of this? That is not my experience. I understand there was intention by some of the participants in 2.0 that level A criteria would be considered more critical to meet (and therefore have a higher level of severity, or at least priority, when not met); however, there is not any normative language that supports this. Further, in reality, the primary measure of severity tends to be a subjective understanding of its impact on users, either in regard to the degree it renders content unusable by specific users (such as those reliant on the keyboard API), or in regard to the number and types of users affected. Finally, the context of a specific page has a great deal to do with the relative severity cause by failing any one criterion's requirements.

@Helixopp
Copy link
Author

Helixopp commented Jun 3, 2024 via email

@patrickhlauke
Copy link
Member

I understand there was intention by some of the participants in 2.0 that level A criteria would be considered more critical to meet (and therefore have a higher level of severity when not met); however, there is not any normative language that supports this.

The whole idea that there's Level A, AA, AAA, and that they are additive (to meet A, you just do As; to meet AA, you have to pass A and AA; for AAA conformance, you must meet A, AA, AAA) does set up a hierarchy, showing that A is more "foundational" than AA so more important to meet, etc.

@mraccess77
Copy link

Most global standards and regulations call out A and AA which indicates to me that both are needed to bring a sufficient level of accessibility to disabled people.

At the same time, folks always want to prioritize issues - clearly some things such as flashing content are highest priority because of the impact - but other requirements may also be priority based on the factors previously discussed. However, the standard itself can't really prioritize one persons need above another - so the standard can't really get into that - but others can try to make determinations based on impact and biggest benefit to user needs.

@Helixopp
Copy link
Author

Helixopp commented Jun 4, 2024 via email

@Helixopp
Copy link
Author

Helixopp commented Jun 4, 2024 via email

@patrickhlauke
Copy link
Member

Most global standards and regulations call out A and AA which indicates to me that both are needed to bring a sufficient level of accessibility to disabled people.

but then even with that, it shows that AAA SCs are considered "less of a priority" / "less important", so again this is evidence that there is a hierarchy (whether intentional or not)

@mbgower
Copy link
Contributor

mbgower commented Jun 4, 2024

@Helixopp I'm going to dodge all the commentary so far and try to answer your original statement and questions.

Proposed draft response

There does not appear to be any clear definition of what parameters a success criteria must meet in order to qualify for any particular conformance level.

Correct, there is nothing published in normative text in 2.x. The Understanding levels of Conformance subsection previously referenced provides a framework, but no clear delineation. Rationales that went into level assignment between A and AA are not included in the specification and can only be inferred.

In order to be considered Conformance Level A does a success criteria have meet a certain level of severity, as in if not met it will severely adversely impact the user[?]

No, level A criteria do not have to meet a certain level of severity. There is no clear relationship between a criterion's stated level and the severity of an issue resulting from that criterion not being met. This would be impossible to provide, given the broad range of scenarios in which any criterion may be considered, or the degree to which a criterion's requirements are not met.

As well, there appear to be no regulatory frameworks that assign increased penalties to non-conformance based on the level of a failing success criterion. If such existed, that could potentially be one way in which conformance level could affect someone's assessment. But even in such a hypothetical, that would arguably affect the priority the issue was given, not its severity. Without the context of the page, it is not feasible to assign severity.

Likewise is an AA Conformance Level less severe/impactful? And so on.

Just as there is no correlation at Level A, there is none at level AA. Given that level AAA is not part of a regulatory framework in any known jurisdiction, failures of AAA are less severe strictly from a compliance perspective. The relative impact on any user (one way of measuring severity) of an unmet success criterion is again going to be influenced by any number of factors that are independent of the level classification.

@mbgower
Copy link
Contributor

mbgower commented Jun 4, 2024

I understand the desire to vehemently defend WCAG

Given pointed past comments elsewhere about various failings of the standard by pretty much everyone involved in this thread, I don't think anyone involved could be accused of vehemently defending it. I think everyone involved has a fairly mature attitude towards a useful but imperfect standard, so I suggest we set aside such language.

@Helixopp
Copy link
Author

Helixopp commented Jun 4, 2024 via email

@Helixopp
Copy link
Author

Helixopp commented Jun 4, 2024 via email

@Helixopp
Copy link
Author

Helixopp commented Jun 4, 2024 via email

@Helixopp
Copy link
Author

Helixopp commented Jun 4, 2024 via email

@kiara-stewart
Copy link

Whew! Lots of context here! Getting back up to speed after the weekend! I'll provide a brief recap for those who, like me, might need it.

Ask:

Clarify the parameters that were used to determine conformance levels.

Concerns:

  • Misinterpretation: Some consulting agencies rank accessibility issues based solely on WCAG conformance levels (e.g., Level A = high, Level AA = medium, Level AAA = low), which misrepresents their real-world impact.
  • Defendability: The lack of transparent criteria undermines the credibility and effectiveness of WCAG.
  • Future Guidance: There is no standardized method for assigning conformance levels to new success criteria.

References:

Main Argument:

  • Consensus-Driven Decision Making:
    • Challenge: Relies on qualitative judgments, which can be subjective and not easily replicable.
    • Defense: Reflects real-world complexities. Disability and, subsequently, accessibility exist on a spectrum. Additionally, it mirrors the process of how most government policies are created globally; they are not solely based on data.

Consensus:

  • Short term: We’ll update the non-normative documentation to clarify the relationship between severity, priority, and conformance levels.
  • Long-term: We recognize that creating a binary framework for leveling retroactively would be extremely challenging. WCAG 3.x? Or, it may be so complex that we revert back to our current state.

Copy Considerations:

  • Instead of stating what conformance levels are not good for, explain what would be a better measure.
  • Ensure the language used does not imply that conformance levels are irrelevant.

My Updated Copy:

The severity and prioritization of accessibility issues should not be determined solely by WCAG conformance levels, but rather by the criticality of the content and how real users perceive and operate the given site.

Note: This is still open for revision, and I welcome others to contribute their versions.

@kiara-stewart
Copy link

RE: Separating priority and severity

Aren't content, perceivability, and operability crucial for accurately assessing both severity and priority? If not, it might be more effective to identify a common denominator rather than prescribing individual assessment methods for practitioners. (E.g. assign severity by taking these steps, but determine priority by following these steps)

Or, we can just address the one most likely to be misunderstood - severity (as stated in the original ask).

@Helixopp
Copy link
Author

Helixopp commented Jun 26, 2024 via email

@kiara-stewart
Copy link

kiara-stewart commented Jun 26, 2024

Thank you for clarifying your request @Helixopp.

I understand and respect that associating a business name with an issue like this could be harmful, even if the issue is widespread.

I hope you can also respect that addressing problems is easier when we can discuss specific examples.

Since you're familiar with the businesses and initiated the original issue, could you share how you might write the disclaimer?

Side note: Whenever possible, I would appreciate it if we used specific numbers instead of quantifiers like "many," "few," or "some" to clarify our points. While this may not always be feasible, it really helps eliminate vagueness and ensure accuracy/accountability.

@Helixopp
Copy link
Author

Please review the thread. Examples have been provided. Suggested wording has been provided. I have responded that I would like to horizontally review the suggested wording with the Accessible Platform Architectures Working Group before we commit to the text. It will take a few weeks before we can calendar this discussion on APA's agenda.

It doesn't matter how many businesses apply conformance levels this way. It matters that the text of WCAG makes this possible. The W3C doesn't hold anyone accountable for how they use our publications.

@kiara-stewart
Copy link

Suggested wording has been provided.

I know because I have authored several of the iterations myself.

I have responded that I would like to horizontally review the suggested wording with the Accessible Platform Architectures Working Group before we commit to the text. It will take a few weeks before we can calendar this discussion on APA's agenda.

I specifically requested your version of the disclaimer copy because I genuinely valued your unique context and perspective @Helixopp .

However, based on your response, I'm assuming that any comments made after your request to send this copy to the APA team have not been and will not be considered.

If my assumption is incorrect, could you clarify which version of the text we are planning to present to them?

It doesn't matter how many businesses apply conformance levels this way. It matters that the text of WCAG makes this possible. The W3C doesn't hold anyone accountable for how they use our publications.

I'm not entirely sure what you meant in this part of your message. To clarify my earlier "side note," I was requesting specific figures for use in our working group discussions.

@mbgower
Copy link
Contributor

mbgower commented Jun 28, 2024

@Helixopp

I believe the Working Group has already agreed to the errata/disclaimer resolution. We are just formalizing the wording.

Do you mean that you believe there is general consensus in the comments of this issue? I think it would be more accurate to say that several suggestions have been offered by different people as ways to try to address various comments you have made. But I haven't actually seen suggested wording from you, which could really help bring this issue to some kind of completion.

Please note that the WCAG 2 Task Force has a process of vetting PRs and responses, before sending those on to the Working Group. The Working Group has not addressed this in the past to best of my knowledge. If it has, and you mean there is already a resolution to address this, please point to minutes of that resolution.

It would be inappropriate to list specific business names. Many participants in the Working Group already agree that this misconception has been observed throughout their careers.

One can provide real world examples of situations without 'naming names.' I'd welcome those. That said, if someone has done this in a public VPAT or ACR, then I think it is appropriate to cite those.

I haven't seen anyone in this thread agree that a misconception about levels by a page owner has led to significant problems with severity and priority ranking, as you describe. That's why I keep asking for details and examples. Repeatedly responding that it happens does not actually make it any more true.

@electronicwoft
Copy link

My point is clearly defining what conformance levels mean, not how people use them to prioritize issues. The fact that people use conformance levels to prioritize issues reinforces the importance of clearly articulating uniform parameters for each conformance level and consistently applying them

priority or the order in which defect A is resolved relative to defect B is meaningless without another metric to hash it with something suchb as frequency or the number of people affected - it is no less or more subjective than the assignment of severity.

Similarly, the notion that developers are somehow swayed by higher or lower measurements of severity are without evidence.

In the vast majority of situations, severity is defined for functional/systems testing, and priority is determined by product owners in a defect triage meeting or sprint ritual.

The proposed wording is an addendum rather than an erratum as it seeks to clarify rather than correct, but either way I don't believe it will prevent people from drawing comparisons.

A preferred approach might be to include an addendum that:

  1. explains why traditional severity and/or priority definitions are completely unsuitable for accessibility defects, and;
  2. encourages the use of levels of conformance as the basis for a meaningful resolution order that ensures desired conformance objectives can be achieved

FWIW, a normative update to WCAG 2.3 tabling the parameters used and describing how consensus was reached for each success criteria would likely assist practitioners explain the W3's thinking.

IMHO, it's the pushback from various stakeholders that is driving the perceived need for decisions about definitions to justify the relative importance of a given defect.

It is not too late.

It is very easy, for example, for a manaager to say that there is no such thing as a S1 or P1 accessibility defect because of the disconnect between functional/systems testing definitions of severity and priority and levels of conformance.

By providing an explanation in black and white, at least practitioners have something to point to when they're being railroaded by indifferent management.

From where I sit, the early draft of the WCAG 3.0 conformance model is only going to make this mismatch between specification and the expectations of so-called business stakeholders worse - if the relatively straightforward WCAG 2.x conformance model is being translated into what is more familiar, then this all-new conformance model witll fail utterly.

@electronicwoft
Copy link

FWIW, I agree with @Helixopp that there should at least be a very broad definition for what constitutes a Level A, AA, AAA SC - along similar lines as the broad WCAG 1.0 priority definitions, as currently that's not properly spelled out.

an explanation is always better than no explanation ... there's never going to be numerically verifiable threshholds or objective measurements of user impact largely because of the nature of the subject matter, but it wouldn't be too hard to slap together a table using factors like populations of affected users in terms of broadest, broader, and broad, for example, or is assistive technology required to work around a failure of this success criterion: yes, sometimes; never etc. Even an overlay of Schedule B in EN501.349 would provide practitioners with something in black and white to back-in a priority decision about a given failure. As it stands, the Easter-island statue silence in WCAG 2.x serves no-one.

@GreggVan
Copy link

GreggVan commented Aug 22, 2024

It is very straight forward

Definition of a Level A (or AA or AAA ) provision

  • a provision that was placed in Level A by the working group

If you want to know how they decide which level

  • the decide the level by consensus.

If you want to know what criteria they used

  • the nature of consensus is that everyone does not need to agree on their rationale - they just have to agree on the outcome. There are many lists (see the Understanding WCAG 2.0 for one) of SOME of the factors that were considered by the members of the working group. But there is no formula for determining it. The working group for WCAG 2.0 tried very hard to capture exactly what the rationales were and we found that there were no hard and fast rules - and the different people had different rationales. So don't look for a formula. Every time we came up with one -- we found it did not fit the results -- and the group did not want to change the their judgement on where something would go to fit any formula.

So don't look for a formula or critical factor.

Oh I can say that there was never a weighing of importance of one disability over another (except in effort -- we spent much more effort working on provisions for groups like cognitive, language, and learning disabilities since they were the hardest to find solutions for - since mostly we solved accessibility using AT (there are very few provisions for people who are blind -- but they had AT (Screen readers) so they could make materials accessible themselves as long as the information was compatible with AT. But for cognitive, language, and learning disabilities there was essentially no AT so that group could not make thing accessible for themselves like blind people could -- but just using the AT compatibility provisions. There are therefore many more provisions for direct access (no AT) for people with cognitive, language, and learning disabilities than for any other disability including blindness. In fact - we tried to get everything we could think of that would qualify (be testable) into WCAG 2.0. There never was a time when we excluded or downleveled a provision because of disability.

Best g

@electronicwoft
Copy link

electronicwoft commented Aug 23, 2024

There is no question as to the extraordinary efforts of those involved in formulating what has pioneered an entire discipline - this is not about saying the original working group did something fundamentally wrong. Nor is it to say that WCAG 2.0 is no longer entirely fit for purpose.

But, in my experience, just restating that this is the consensus a group of people arrived at for a given success criterion two decades earlier when being questioned by an indifferent and likely ignorant stakeholder is just not very persuasive when prosecuting a case for why this or that carries greater or lesser weight than something else.

Noone could have predicted the way that WCAG 2.0 has been desseminated, applied, paraphrased, interpreted, or challenged no more than the scribblers of various holy scriptures could be thousands of years ago.

I am sure you can see the clumsy parallel ...

So, while there may not be a 'formula or a critical factor', a working group note fleshing out the explanation in the latter part of your comment with further explanation of what levels of conformance are not equivalent to or should not be correlated with, or addressing the 'churches' which have evolved with the specification would be - in my view, at least - more helpful than what is published now about how success criteria are assigned to levels of conformance. > It is very straight forward

@GreggVan
Copy link

The explanations are already documented. We took great care to do that and mention all the major factors that were involved. There is not any more to say. There were a myriad of factors. But the strongest ones are already listed.

when people ask for more -- they ask for determinants -- which were the "criteria" for being A or AA -- as if there was a formula or weighted list. Or that everyone used the same criteria for their decision as to which level. They didn't. We had advocates for every disability and sub-disability, from industry (and they had different views and priorities and thoughts ) and academics and developers and government and evaluators and just volunteers. Together they took each provision -- discussed at great length - and decided which level it would go in. Then they debated back and forth until they reached consensus on a level for each one.

So that is why I said that the only definition of exactly why each one ended at a particular level is -- that they were put at the level that the working group reached consensus that it should be put at. Lots of people with lots of views on lots of factors -- and probably few if any people that put the same weight on the same factors in making their decision.

So there were no rules or defining factors or formulas for deciding. So the best that can be done is to describe the key factors that people said they used (which is done in WCAG/Understanding WCAG). And to say that the final decision was the decision made by the group.

Hopefully that makes it clearer.
The reply is not meant to be dismissive of the desire to know more. But that the question has been answered as best as it can be and still be accurate.

best

@electronicwoft
Copy link

thank you for taking the time elaborating how it went down back then, but, for me and presumably others, what is in the understanding documents just doesn't cut the mustard anymore for practitioners at the coalface so I guess I'll leave it there ...

@patrickhlauke
Copy link
Member

patrickhlauke commented Aug 23, 2024

"why is this SC in level FooBar?"
"because the group decided that it should go in FooBar"
"so what is the meaning of level FooBar?"
"it's the level that the group decided SCs like this one should be in"

it keep reminding me of: “When I use a word,’ Humpty Dumpty said in rather a scornful tone, ‘it means just what I choose it to mean — neither more nor less.’"

and to be clear, we're not asking for a definitive formula here, but clearly, the parameters/axes that led the group to a particular decision have been laid out. things like (but not limited to) "how severe the problem is for certain user groups", "how many user groups are affected", "how easy/hard it is to achieve a better outcome/implement a solution", "are there workarounds that make the problem circumventable".

so clearly the axes have been defined, just that nowhere it has stated, in a simple way, how those axes influenced a decision: when an SC tended to affect more users, was easier to implement, and had no workarounds, it generally ended up in Level A; on the other end of the spectrum, if an SC tended to not affect as many users, or was harder to implement, or there were workaround (albeit laborious/convoluted) that made them less of a showstopper, then an SC tended to be lumped more towards Level AA or AAA. this seems fairly non-controversial statement (though of course not elegant, but far from a "definitive formula" which nobody has actually been asking for), but somehow there's a strong reluctance to acknowledge this for some reason...

@JAWS-test
Copy link

I've read all the comments, but there are so many now that I'm not sure if I've missed something:

However, a better definition is not possible because the SC levels are already assigned and even if we were to come up with a new definition, the existing levels would no longer fit. But they can't be changed (so easily). Therefore: Why not live with the fact that there is a definition that may not be perfect, but is still comprehensible. @patrickhlauke: The 3 levels are not just Humpty Dumpty ...

@patrickhlauke
Copy link
Member

patrickhlauke commented Aug 23, 2024

@JAWS-test because that definition you link to defines the considerations/axes, but then does not actually define what A, AA, AAA are. it explains the considerations, but then misses the last bit that says how those considerations then map (even at a very broad/handwavy way) to A, AA, AAA. and i'm still surprised that some people are seemingly trying to gaslight folks about it

I mean, even just a simple statement at the end of that definition that says "consider the Levels to be rough groupings of similar SCs (similar based on these various axes/considerations). these levels don't necessarily imply importance or severity or any kind of hierarchy, on their own"

that would still not be satisfying (and arguably still not quite true), but at least it would provide that one missing bit from the definition.

@electronicwoft
Copy link

"why is this SC in level FooBar?" "because the group decided that it should go in FooBar" "so what is the meaning of level FooBar?" "it's the level that the group decided SCs like this one should be in"

it keep reminding me of: “When I use a word,’ Humpty Dumpty said in rather a scornful tone, ‘it means just what I choose it to mean — neither more nor less.’"

and to be clear, we're not asking for a definitive formula here, but clearly, the parameters/axes that led the group to a particular decision have been laid out. things like (but not limited to) "how severe the problem is for certain user groups", "how many user groups are affected", "how easy/hard it is to achieve a better outcome/implement a solution", "are there workarounds that make the problem circumventable".

so clearly the axes have been defined, just that nowhere it has stated, in a simple way, how those axes influenced a decision: when an SC tended to affect more users, was easier to implement, and had no workarounds, it generally ended up in Level A; on the other end of the spectrum, if an SC tended to not affect as many users, or was harder to implement, or there were workaround (albeit laborious/convoluted) that made them less of a showstopper, then an SC tended to be lumped more towards Level AA or AAA. this seems fairly non-controversial statement (though of course not elegant, but far from a "definitive formula" which nobody has actually been asking for), but somehow there's a strong reluctance to acknowledge this for some reason...

love the humpty-dumpty ... I was going to say 'fundamentalism' hence the holy scriptture analogy, but thought that they may be considered inappropriate, and I don't intend to be offensive.

@electronicwoft
Copy link

I've read all the comments, but there are so many now that I'm not sure if I've missed something:

However, a better definition is not possible because the SC levels are already assigned and even if we were to come up with a new definition, the existing levels would no longer fit. But they can't be changed (so easily). Therefore: Why not live with the fact that there is a definition that may not be perfect, but is still comprehensible. @patrickhlauke: The 3 levels are not just Humpty Dumpty ...

agreed @JAWS-test that it is what it is and we have to live with it, but this does not preclude providing an addendum or additional information rather than formulae.

My central concern is the various 'churches' that have filled the void in lieu of a satisfactory explanation thereby making a nonsense of conformance altogether .... there is nothing preventing an explanation of the original process or directly addressing the ways in which - in my view - these levels are being misused in these growing orthodoxies in a working group note or whatever ...

@JAWS-test
Copy link

JAWS-test commented Aug 23, 2024

@patrickhlauke and @electronicwoft: ok, thank you very much for the answers. I can understand that. I.e. the point is that A should defined as essential, no workaround possible, easy to fix, without restrictions on the design

AAA would be defined as: not important, there could be workarounds, difficult to fix, may affect the design or content ...

AA is somewhere in between. It won't be so easy to find a good formulation for this. But I understand that the description of the process for determining the levels is not identical to the definition of the levels. However, in terms of content, the definition will not provide much more than the explanation at https://www.w3.org/WAI/WCAG22/Understanding/conformance#levels

@chaals
Copy link
Contributor

chaals commented Aug 23, 2024

I'd dispute calling AAA "not important" in particular. It's there because it makes a material difference to some people with disabilities.

My very rough mental model, that's now two-and-a-half decades old (going back to formulating the levels for WCAG 1.0), is:

Level A are things that are outright showstoppers - if you don't do them, saying you've created an accessible resource is in the region of a bare-faced lie. Not having a tall narrow set of steps for a wheelchair user to get to the bathroom might be an analogy.

Level AA are the things that can be worked around, but at great effort - some people with disabilities will be unable to effectively use a resource that fails one of these. Removing a couple of low steps that some people could help most wheelchairs over, adding a rough curb-cut, not having a ramp at a 35º angle.

Level AAA are the things that make the difference between "more or less accessible" and "equality of opportunity" - putting the ramp front and centre of the entrance, a light gradient, broad enough to go in company, no corner so tight that many wheelchairs can't actually use it.

When it comes to agreeing exactly where a given requirement fits, given that in different cases it will manifest in different ways and at different levels of severity, means there is no magic formula.

Think about describing images: Sometimes it's crucial, like the map explaining where a shop is or the menu icon. Sometimes it's a picture of the sponsor's CEO, who actually looks like a stock photo, or an actual stock photo - probably a "conventionally pretty" woman because that's what the media still does - added to a newspaper article for no obvious reason.

@cstrobbe
Copy link

cstrobbe commented Aug 23, 2024

When we were working on WCAG 2.0, there was a document titled Requirements for WCAG 2.0. I think criteria for conformance levels were brought up in discussions about that document but they were not included in it. As Greg Vanderheiden pointed out in his 3 June comment, the conformance levels assigned to new criteria were based on what we could reach consensus on.

One undocumented rule that we used is that anything that affected visual design would not be at Level A, which is why the most basic contrast requirements (WCAG 2.0's SC 1.4.3 and WCAG 2.1's SC 1.4.11) are at Level AA.

There is also Requirements for WCAG 3.0; based on a brief look, this document does not discuss criteria for assigning conformance levels to success criteria.

@alastc
Copy link
Contributor

alastc commented Aug 23, 2024

I think the main point of the thread is (or should be) to encourage people to apply severity appropriately. I'd suggest a little change to the last suggestion I could find:

Conformance levels (A, AA, or AAA) are not indicative of issue severity for a particular instance of an issue. The severity or impact of any one issue on a given site should be determined by the criticality or importance of the content, as well as how easy the site is to operate and understand.


Sidenote: When we report issues the severity is completely separate from WCAG level. I don't know about other agencies, but we've employed people who have worked at other agencies and none were surprised by this approach or asked why we didn't use WCAG levels.

The only time I've come across people asking about whether to use the WCAG level for prioritization is development teams who are new to accessibility. In a scenario where they will have to work through all the A/AA issues anyway, it isn't the end of the world if they do the level A ones first. However, we do encourage them to use the severity and t-shirt-size columns to determine priority (so easy-to-fix blockers are first etc).

@patrickhlauke
Copy link
Member

patrickhlauke commented Aug 23, 2024

@alastc that'd be a good start, yes. then, if it truly is a case of "SCs have been grouped into levels based on similarity (on the various axes of consideration)" it'd be good to mention that too, to make it even clearer that it's not a hierarchy of importance, but just a very rough set of buckets (which still doesn't explain why this was done - if #allSCsMatter - what each bucket actually represents, but it'd be better than nothing/conjecture)

@patrickhlauke
Copy link
Member

perhaps relevant, as the missing piece for me at least has been "yes, we understand that SCs were placed in categories/levels because the WG decided they should go there, but surely the WG must have had a working understanding or definition of what each category/level actually represents in order to make that judgement call" ... this seems a good starting point https://lists.w3.org/Archives/Public/w3c-wai-gl/2002AprJun/0266.html

@GreggVan
Copy link

Let me start by first asking -- why do people want to know? Usually it is:

  1. "so I know which ones are most important to do --
  2. or to do first"

The answer to the first (which are more/most important) is

  • they are all important - or the time to put them in and create the conformance document and the Understanding WCAG 2.0 and the techniques documents (which all took an ENORMOUS amount of time to do) would not have been spent by people who were all volunteering and had jobs and families as well.

The answer to the second (which should I do first) is

  • that is difficult to answer quickly - and different people have put forward different strategies and all are good. here are some

    • well first there are always low hanging fruit. Things that you could do quickly that you could get a whole bunch done. Just do them rather than having them all stand in line behind some one item that is really hard and takes a long time to fix. That doesn't mean you should just do them in order of ease. See the next point about importance. But don't let one single important item block you from doing 30 other easier ones that are important for someone.
    • do them in order of importance if you can figure out what that is. Doing a lot of little things that don't really matter much and only make the page marginally different is not as important as doing the important ones that really create real problems.
    • there are no little ones. They are little for one individual, but they may be showstoppers for other people. This particularly happens in the area of cognitive language and learning disabilities where people think that these are just "usability" issues when they are in fact issues that prevent people in these groups from using things.
    • do the showstoppers first. Those things that basically cause a page to be completely unusable. A couple examples are pages at flash and can put someone into a seizure before they even know there is a problem on the page and pages that auto play audio (that does not stop after a few seconds) so that an individual using a screen reader suddenly cannot hear their screen reader. But also see the comment above about the fact that everything can be a showstopper for somebody so don't have to narrow a definition of showstopper.
    • etc

OK that now raises the question:\
Why are there three levels if it isnt importance. What do the levels mean?

That is a great question - that is somewhat lost to time. Here is the answer in short - followed by more detail.

  • all of the provisions were important (including AAA) so the levels were not created to indicate importance.
    • items are in AAA only because we could not require them for one reason or another such as
      1. they could not be applied to all pages and we could not define which pages they should apply to. So they could not be required (for all pages - which is what WCAG does)
      2. we could not find any way to objectively test whether they were done or not (so could not do them)
      3. It would be impossible to do them on every page (e.g. requiring sign language translation of every page - since there were not enough people who could do the translations over the course of a year for all the pages that were generated in a minute on the web. )
      4. etc. [Some of them are captured here](https://www.w3.org/WAI/WCAG22/Understanding/conformance#levels
      5. )
      - so items were in AAA not because they were less important but for some other reason where the working group did not feel they could be required for all pages.
      - level AAA was created to keep things into the guidelines that could not be applied everywhere -- but should be done whenever possible.

OK that explains level AAA (sort of). So why are there two more levels (A and AA) instead of just one more (other than AAA)?

That is the one that is most misunderstood I think.

  • And all sorts of rationale are provided mostly in retrospect.

  • @patrickhlauke found notes from back when we were working on it (Thank you Patrick for finding that).

    • This gives you and idea of what the 2.0 working group was trying to do. But it turns out that when we tried to apply these rules to sort the provisions -- we kept finding exceptions.
    • If you look at these and the final sort of WCAG 2.0 you will see they don't quite fit. Some fit but others don't.
    • There is also different interpretation by working group members as to what the words in each of these "sorting rules" meant.
    • So this document is very useful in getting insight into what the working group members were thinking about - but much of this was done AFTER many/most SC were sorted and we were trying to figure out what we used as rules for doing it to shorten discussions for each new one.
  • And most of the writeups done since -- are rationalizations for how it was done and not a list of the rules that were used in actually doing it. At least not rules that were strict sorting rules. They were more guidelines and starting points for where to put things.

** So that leaves -- Why were there 3 levels instead of 2?**

The bottom line is that -- we could not get any consensus when we tried to have just two levels

  • Required
  • Recommended

At one point - there was a concerted push to have WCAG look more like other standards - where there were only two levels: required and recommended (equivalent to SHALLs and SHOULDs.) There was even a majority of the group that wanted to have two levels.

The problem was that half of the group that wanted two levels wanted to create it by collapsing everything in AA into AAA, and the other half wanted to collapse AA into A.

So in the end - the one thing that everyone could agree on was the three levels.

And the items in the three levels were determined in the same (consensus) manner. Each item was placed in the location where the working group finally put them.-- often after long discussions on multiple meetings -- and always using feedback from the many public-releases-and-requests-for-comments that we received.

All the lists of ideas for how to sort them from the past and the lists that have been made retrospectively to explain them are all good indications of the factors that went into deciding which went where -- but they are not absolute determinants -- and you will find things that seem to not quite fit or not fit that set of criteria. That is because they were factors that were considered - not criteria.

If you are looking for the absolute accurate description of why each item ended up in the level it did -- here it is.

The SC ended up at the level that the working group could reach a consensus on putting them after sometimes long discussions and multiple releases for public comment.

That is the only accurate and detailed answer.

  • all the other retrospective lists of reasons (including the one from the original working group) are good lists of factors that were considered in the process. But not deterministic items.

Looking back is only so helpful. And mostly just for the similar task that we will have for WCAG 3.0

As to using levels for the two questions at the top (i.e.,. "Which are more important ?" and "Where should I start?") the answer is DON'T.

  • They are bad indications of both.
  • There are other much better strategies and better writeups and documentation of both.

But NEVER take levels as levels of importance to people with disabilities. They were NEVER that.

Hope this is helpful.

Best

Gregg
Co-Chair WCAG 2.0 working group

@patrickhlauke
Copy link
Member

patrickhlauke commented Aug 24, 2024

[edit: actually, I give up, this is just going further in circles and there's no real point here anymore]

@electronicwoft
Copy link

I think the main point of the thread is (or should be) to encourage people to apply severity appropriately. I'd suggest a little change to the last suggestion I could find:

Conformance levels (A, AA, or AAA) are not indicative of issue severity for a particular instance of an issue. The severity or impact of any one issue on a given site should be determined by the criticality or importance of the content, as well as how easy the site is to operate and understand.

Sidenote: When we report issues the severity is completely separate from WCAG level. I don't know about other agencies, but we've employed people who have worked at other agencies and none were surprised by this approach or asked why we didn't use WCAG levels.

The only time I've come across people asking about whether to use the WCAG level for prioritization is development teams who are new to accessibility. In a scenario where they will have to work through all the A/AA issues anyway, it isn't the end of the world if they do the level A ones first. However, we do encourage them to use the severity and t-shirt-size columns to determine priority (so easy-to-fix blockers are first etc).

this is precisely what is wrong with the current situation in my view.

if severity is determined by criticality then it is tautological and superfluous.

As I have said before, as a training aid for developers it is at best vague and at worst misleading because it is a value plucked out of the sky by testers who - in my experience - wouldn't know their ass from a hole in the ground when assessing user impact ...

So let's abandon severity once and for all and look at priority.

"the importance of the content as well as how easy the site is to operate and understand" is neither readily testable nor especially practicable given that it is likely only confirmed by extensive usability testing which is a nice to have most organisations place behind accessibility in their list of priorities.

What's more, 'the importance of the content' is, if you break it down' just another way of saying 'the chances of this or that preventing a user from completing a given user task is 100%' which is basically saying it's a Sev1 issue.

And, again, not something I'd leave up to the average tester.

So let's put aside priority as defined above also and concentrate on why a tester needs to assign a value to a defect in the first place or at all.

The reason is so that whomever is resolving said defects then has what is effectively a chronology to help structure their workload.

As soon as levels of conformance - whatever these might mean or how they are defined or by what process they were conceived - is removed from the calculation of this chronology, then the ultimate organisational goal of achieving a given level of conformance for a defined scope of web content is jeopardised.

tester X says that a single failure of 1.1.1 is a S4 and hey presto it isn't resolved before production and, given the awkward nature of the WCAG 2.x conformance model, may mean that conformance requirements 1, 2, and 3 are not satisfied and any claim of conformance for that scope or part of it is no longer valid.

It simply doesn't matter what each level of conformance means or how a given success criterion was assign to it - as soon as levels of conformance are obliviated by arbitrary assignments of severity or priority, or importance, or untested perceptions about ease-of-use, then the specifcation becomes valueless.

This is why there must be another factor or factors that can be assigned by testers objectively such as frequency or scope or some other categorisation that don't rely on a moistened finger in the air or experience or sixth sense or divine guidance to formulate a chronology based on levels of conformance.

Any proposed addendum must make it clear that traditional conceptions of severity-as-user-impact or criticality as just another name for severity or any other similar utterley subjective or untraceable or untestable factor is inappropriate for setting a chronology.

@electronicwoft
Copy link

[edit: actually, I give up, this is just going further in circles and there's no real point here anymore]

that's sad to hear, Patrick - there is a point and it's to make WCAG 2.x more usable for the people who are stuck with it.

@JAWS-test
Copy link

if all levels are important and, above all, if there is no difference between A and AA, then WCAG 5.2.1 makes no sense. There, a precise distinction is made between

  • A (“minimum level”)
  • A+AA
  • A+AA+AAA

If they are all equally important, WCAG could also offer that someone achieves conformance with

  • AA
  • AAA
  • A+AAA
  • AA+AAA

@electronicwoft
Copy link

electronicwoft commented Aug 24, 2024

the key there is ''equally' ... some are more equal than others! No-one has said that all success criteria are EQUALLY important, only that they are important ... which is about as useful as saying that they all are success criteria.

this is part of the problem with this thread - terms such as severity, priority, criticality, importance, etc. are undefined ...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
a11y-tracker Group bringing to attention of a11y, or tracked by the a11y Group but not needing response. Conformance WCAG 2.x Understanding
Projects
None yet
Development

No branches or pull requests