Code is clean if it can be understood easily – by everyone on the team. With understandability comes readability, changeability, extensibility and maintainability. All the things needed to keep a project going over a long time without accumulating up a large amount of technical debt. Writing clean code from the start in a project is an investment in keeping the cost of change as constant as possible throughout the lifecycle of a software product. Therefore, the initial cost of change is a bit higher when writing clean code than quick and dirty programming, but is paid back quite soon. Especially if you keep in mind that most of the cost has to be paid during maintenance of the software. Unclean code results in technical debt that increases over time if not refactored into clean code. There are other reasons leading to Technical Debt such as bad processes and lack of documentation, but unclean code is a major driver. As a result, your ability to respond to changes is reduced.
Legend: DO + DON’T –
Most software defects are introduced when changing existing code. The reason behind this is that the developer changing the code cannot fully grasp the effects of the changes made. Clean code minimises the risk of introducing defects by making the code as easy to understand as possible. #Principles
Two classes, components or modules are coupled when at least one of them uses the other. The less these items know about each other, the looser they are coupled. A component that is only loosely coupled to its environment can be more easily changed or replaced than a strongly coupled component.
Cohesion is the degree to which elements of a whole belong together. Methods and fields in a single class and classes of a component should have high cohesion. High cohesion in classes and components results in simpler, more easily understandable code structure and design.
When a software system has to be maintained, extended and changed for a long time, keeping change local reduces involved costs and risks. Keeping change local means that there are boundaries in the design which changes do not cross.
We normally build software by adding, extending or changing features. However, removing elements is important so that the overall design can be kept as simple as possible. When a block gets too complicated, it has to be removed and replaced with one or more simpler blocks.
The software is difficult to change. A small change causes a cascade of subsequent changes.
The software breaks in many places due to a single change.
You cannot reuse parts of the code in other projects because of involved risks and high effort.
Taking a shortcut and introducing technical debt requires less effort than doing it right.
Building, testing and other tasks take a long time. Therefore, these activities are not executed properly by everyone and technical debt is introduced.
The design contains elements that are currently not useful. The added complexity makes the code harder to comprehend. Therefore, extending and changing the code results in higher effort than necessary.
Code contains lots of code duplication: exact code duplications or design duplicates (doing the same thing in a different way). Making a change to a duplicated piece of code is more expensive and more error-prone because the change has to be made in several places with the risk that one place is not changed accordingly.
The code is hard to understand. Therefore, any change takes additional time to first reengineer the code and is more likely to result in defects due to not understanding the side effects.
A class should have one, and only one, reason to change.
You should be able to extend a classes behaviour without modifying it.
Derived classes must be substitutable for their base classes.
Depend on abstractions, not on concretions.
Make fine grained interfaces that are client-specific.
Smaller classes are easier to grasp. Classes should be smaller than about 100 lines of code. Otherwise, it is hard to spot how the class does its job and it probably does more than a single job.
The granule of reuse is the granule of release.
Classes that change together are packaged together.
Classes that are used together are packaged together.
The dependency graph of packages must have no cycles.
Depend in the direction of stability.
Abstractness increases with stability
Coding-, architecture-, design guidelines (check them with tools)
Simpler is always better. Reduce complexity as much as possible.
Leave the campground cleaner than you found it.
Always look for the root cause of a problem. Otherwise, it will get you again and again.
C#, Java, JavaScript, XML, HTML, XAML, English, German …
Check out and then build with a single command.
Run all unit tests with a single command.
Always use a source control system.
Assure integrity with Continuous Integration
Do not override warnings, errors, exception handling – they will catch you.
Decoupling the construction phase completely from the runtime helps to simplify the runtime behaviour.
If you have a constant such as default or configuration value that is known and expected at a high level of abstraction, do not bury it in a low-level function. Expose it as an argument to the low-level function called from the high-level function.
Have a reason for the way you structure your code, and make sure that reason is communicated by the structure of the code. If a structure appears arbitrary, others will feel empowered to change it.
When you make a decision in your code, make sure you make it precisely. Know why you have made it and how you will deal with any exceptions. Structure over Convention + Enforce design decisions with structure over convention. Naming conventions are good, but they are inferior to structures that force compliance.
“ONE SWITCH”: There may be no more than one switch statement for a given type of selection. The cases in that switch statement must create polymorphic objects that take the place of other such switch statements in the rest of the system.
Favour symmetric designs (e.g. Load – Save) and designs that follow analogies (e.g. same design as found in .NET framework).
Do not mix code that handles multi-threading aspects with the rest of the code. Separate them into different classes.
Something put in the wrong place.
Functionality is at wrong level of abstraction, e.g. a PercentageFull property on a generic IStack.
Fields holding data that does not belong to the state of the instance but are used to hold temporary data. Use local variables or extract to a class abstracting the performed action.
Prevent configuration just for the sake of it – or because nobody can decide how it should be. Otherwise, this will result in overly complex, unstable systems.
Do not add functionality on top, but simplify overall.
If one module depends upon another, that dependency should be physical, not just logical. Don’t make assumptions.
Use dependency injection. Singletons hide dependencies. Base Classes Depending On Their Derivatives – Base classes should work with any derived class.
Minimise interface to minimise coupling
The methods of a class should be interested in the variables and functions of the class they belong to, and not the variables and functions of other classes. When a method uses accessors and mutators of some other object to manipulate the data within that object, then it envies the scope of the class of that other object. It wishes that it were inside that other class so that it could have direct access to the variables it is manipulating.
Things that don’t depend upon each other should not be artificially coupled.
Hidden Temporal Coupling –
If, for example, the order of some method calls is important, then make sure that they cannot be called in the wrong order.
Aka Law of Demeter, writing shy code. A module should know only its direct dependencies.
Names have to reflect what a variable, field, property stands for. Names have to be precise.
Choose names that reflect the level of abstraction of the class or method you are working in.
The name of an interface should be derived from its usage by the client, such as IStream.
The name of a class should reflect how it fulfils the functionality provided by its interface(s), such as MemoryStream : IStream
The name of a method should describe what is done, not how it is done.
fields -> parameters -> locals -> loop variables long -> short
Names have to reflect the entire functionality.
Don’t invent your own language when there is a standard.
No prefixes, no type/scope information
If you do something a certain way, do all similar things in the same way: same variable name for same concepts, same naming pattern for corresponding concepts.
Use locals to give steps in algorithms names.
Boundary conditions are hard to keep track of. Put the processing for them in one place, e.g. nextLevel = level + 1;
Instead of passing primitive types like strings and integers, use dedicated primitive types: e.g. AbsolutePath instead of string.
Comment does not add any value (redundant to code), is not well formed, not correct grammar/spelling.
Too dense algorithms that lose all expressiveness.
Violations of “the Principle of Least Astonishment”. What you expect is what you get.
Hidden Logical Dependency –
A method can only work when invoked correctly depending on something else in the same class, e.g. a DeleteItem method must only be called if a CanDeleteItem method returned true, otherwise it will fail.
Loops, exception handling, …encapsulate in sub-methods.
The statements within a method should all be written at the same level of abstraction, which should be one level below the operation described by the name of the function.
Prefer fewer arguments. Maybe functionality can be outsourced to a dedicated class that holds the information in fields.
Prevent usage. Return complex object holding all values, split into several methods. If your method must change the state of something, have it change the state of the object it is called on.
public int Foo(bool flag) Split method into several independent methods that can be called from the client without the flag.
Static method that should be an instance method Source Code Structure
Variables and methods should be defined close to where they are used. Local variables should be declared just above their first usage and should have a small vertical scope.
Nested code should be more specific or handle less probable scenarios than unnested code.
Keep everything belonging to the same feature together. Don't use namespaces communicating layers. A feature may use another feature; a business feature may use a core feature like logging.
if (this.ShouldBeDeleted(timer)) is preferable to if (timer.HasExpired && !timer.IsRecurrent).
Positive conditionals are easier to read than negative conditionals.
Delete unused things. You can find them in your version control system.
Code that is not dead but does not add any functionality
Comment holding information better held in a different kind of system: product backlog, source control. Use code comments for technical notes only.
Eliminate duplication. Violation of the “Don’t repeat yourself” (DRY) principle.
Replace Magic Numbers and Strings with named constants to give them a meaningful name when meaning cannot be derived from the value itself.
Use reference codes instead of enums if they have to be persisted. Use polymorphism instead of enums if they define behaviour.
Catch exceptions as specific as possible. Catch only the exceptions for which you can react in a meaningful manner.
Only catch exceptions when you can react in a meaningful way. Otherwise, let someone up in the call stack react to it.
In an exceptional case, throw an exception when your method cannot do its job. Don't accept or return null. Don't return error codes.
Exceptions should be thrown as early as possible after detecting an exceptional case. This helps to pinpoint the exact location of the problem by looking at the stack trace of the exception.
Using exceptions for control flow: has bad performance, is hard to understand and results in very hard handling of real exceptional cases.
Exceptions can be swallowed only if the exceptional case is completely resolved after leaving the catch block. Otherwise, the system is left in an inconsistent state.
Change your system in small steps, from a running state to a running state.
Identify the existing features in your code and prioritise them according to how relevant they are for future development (likelihood and risk of change).
Refactor the boundaries of your system to interfaces so that you can simulate the environment with test doubles (fakes, mocks, stubs, simulators).
Cover a feature with Acceptance Tests to establish a safety net for refactoring.
Within a feature, identify the components used to provide the feature. Prioritise components according to relevance for future development (likelihood and risk of change).
Refactor (or introduce) interfaces between components so that each component can be tested in isolation of its environment.
Cover the features provided by a component with Acceptance Tests.
Refactor, Reengineer, Keep + Decide for each component whether to refactor, reengineer or keep it.
Redesign classes within the component and refactor step by step (see Refactoring Patters). Add unit tests for each newly designed class.
Use ATDD and TDD (see Clean ATDD/TDD cheat sheet) to re-implement the component.
If you anticipate only few future changes to a component and the component had few defects in the past, consider keeping it as it is.
Change both pieces of code stepwise until they are identical.
First, isolate the code to be refactored from the rest. Then refactor. Finally, undo isolation.
Move from one representation to another by temporary duplication of data structures.
Refactor by introducing a temporary parallel implementation of an algorithm. Switch one caller after the other. Remove old solution when no longer needed.
Introduce an internal component boundary and push everything unwanted outside of the internal boundary into the demilitarized zone between component interface and internal boundary. Then refactor the component interface to match the internal boundary and eliminate the demilitarized zone.
Two developers solving a problem together at a single workstation. One is the driver, the other is the navigator. The driver is responsible for writing the code. The navigator is responsible for keeping the solution aligned with the architecture, the coding guidelines and looks at where to go next (e.g. which test to write next). Both challenge their ideas and approaches to solutions.
A developer walks a peer developer through all code changes prior to committing (or pushing) the changes to the version control system. The peer developer checks the code against clean code guidelines and design guidelines.
In a Coding Dojo, a group of developers come together to exercise their skills. Two developers solve a problem (kata) in pair programming. The rest observe. After 10 minutes, the group rotates to build a new pair. The observers may critique the current solution, but only when all tests are green.
Specify a feature first with a test, then implement.
Red – green – refactor. Test a little – code a little.
Write a unit test that reproduces the defect – Fix code – Test will succeed – Defect will never return.
Aka test after. Write unit tests to check existing code. You cannot and probably do not want to test drive everything. Use POUT to increase sanity. Use to add additional tests after TDDing (e.g. boundary cases).
Objects have to be easily creatable. Otherwise, easy and fast testing is not possible.
Pass dependencies and configuration/parameters into the constructor that have a lifetime equal to or longer than the created object. For other values use methods or properties.
Use abstraction layers at system boundaries (database, file system, web services, COM interfaces ...) that simplify unit testing by enabling the usage of mocks.
Structure the tests always by AAA. Never mix these three blocks.
Create a test assembly for each production assembly and name it as the production assembly + “.Test”.
Put the tests in the same namespace as their associated testee.
Unit test methods show all parts needed for the test. Do not use SetUp method or base classes to perform actions on testee or dependencies.
Use the SetUp / TearDown methods only for infrastructure that your unit test needs. Do not use it for anything that is under test.
Names reflect what is tested, e.g. FeatureWhenScenarioThenBehaviour.
One test checks one scenario only.
Test and resource are together: FooTest.cs, FooTest.resx
Give the variable holding the System Under Test always the same name (e.g. testee or sut). Clearly identifies the SUT, robust against refactoring.
Give the variable holding the result of the tested method always the same name (e.g. result).
Always use the same name for variables holding uninteresting arguments to tested methods (e.g. anonymousText).
Just working is not enough, make sure you understand why it works.
Always unit test boundaries. Do not assume behaviour.
Use fakes to simulate all dependencies of the testee.
Use a dynamic fake framework for fakes that show different behaviour in different test scenarios (little behaviour reuse).
Use manually written fakes when they can be used in several tests and they have only little changed behaviour in these scenarios (behaviour reuse).
Make sure that you follow the AAA (arrange, act, assert) syntax when using mocks. Don’t mix setting up stubs (so that the testee can run) with expectations (on what the testee should do) in the same code block.
Tests that do not check the testee but values returned by fakes. Normally due to excessive fake usage.
If your test needs a lot of mocks or mock setup, then consider splitting the testee into several classes or provide an additional abstraction between your testee and its dependencies.
Unit tests have to be fast in order to be executed often. Fast means much smaller than seconds.
Clear where the failure happened. No dependency between tests (random order).
No assumed initial state, nothing left behind, no dependency on external services that might be unavailable (databases, file system …).
No manual test interpretation or intervention. Red or green!
Tests are written at the right time (TDD, DDT, POUTing) Unit Test Smells
Passing test that at first sight appears valid but does not test the testee.
A test that needs dozens of lines of code to set up its environment. This noise makes it difficult to see what is really tested. Too Large Test / Assertions for Multiple Scenarios – A valid test that is, however, too large. Reasons can be that this test checks for more than one feature or the testee does more than one thing (violation of Single Responsibility Principle).
A test that accesses internals (private/protected members) of the testee directly (Reflection). This is a refactoring killer.
A test that is dependent on the development environment and fails elsewhere. Use continuous integration to catch them as soon as possible.
A test that checks more than it is dedicated to. The test fails whenever something changes that it checks unnecessarily. Especially probable when fakes are involved or checking for item order in unordered collections.
Test contains information that is not relevant to understand it.
A test that fills the console with text – probably used once to manually check for something.
A test that catches exceptions and lets the test pass.
A test that tests a completely different testee than all other tests in the fixture.
A test that checks something no longer required in the system. May even prevent clean-up of production code because it is still referenced.
Hidden Test Functionality –
Test functionality hidden in either the SetUp method, base class or helper class. The test should be clear by looking at the test method only – no initialisation or asserts somewhere else.
The construction of dependencies and arguments used in calls to testee makes test hardly readable. Extract to helper methods that can be reused.
Split test or use assertion messages.
Tests should not have any conditional test logic because it’s hard to read.
Tests depend on special logic in production code.
Sometimes passes, sometimes fails due to left overs or environment.
A test checks exactly one feature of the testee. That means that it tests all things included in this feature but not more. This includes probably more than one call to the testee. This way, the tests serve as samples and documentation of the usage of the testee.
Make tiny little steps. Add only a little code in test before writing the required production code. Then repeat. Add only one Assert per step.
Whenever a test gets complicated, check whether you can split the testee into several classes (Single Responsibility Principle)
Use behaviour verification only if there is no state to verify.
Use test DSLs to simplify reading tests: helper methods, classes.
Use code coverage to find missing tests but don’t use it as a driving tool. Otherwise, the result could be tests that increase code coverage but not certainty.
Make small steps to get feedback as fast and frequent as possible. Not Running Test Before Writing Production Code – Only if the test fails, then new code is required. Additionally, if the test, surprisingly, does not, fail then make sure the test is correct.
Refactoring is an investment in the future. Readability, changeability and extensibility will pay back.
Don’t assume, check it. If it is easy, then the test is even easier.
Make it simpler, otherwise bugs will hide in there and maintainability will suffer.
These tests are brittle and refactoring killers. Test complete “mini” use cases in a way which reflects how the feature will be used in the real world. Do not test setters and getters in isolation, test the scenario they are used in. Red Bar Patterns
Pick a test you are confident you can implement and which maximises learning effect (e.g. impact on design).
Write a test that does not fully check the required behaviour, but brings you a step closer to it. Then use Extend Test below.
Extend an existing test to better match real-world scenarios.
If you think of new tests, then write them on the TO DO list and don’t lose focus on current test.
Write tests against external components to make sure they behave as expected. Green Bar Patterns Fake It (‘Til You Make It) + Return a constant to get first test running. Refactor later. Triangulate – Drive Abstraction + Write test with at least two sets of sample data. Abstract implementation on these.
If the implementation is obvious then just implement it and see if test runs. If not, then step back and just get test running and refactor then.
First, implement operation for a single element. Then, step to several elements.
Acceptance tests check for the required functionality. Let them guide your TDD.
An acceptance test is a test for a complete user feature from top to bottom that provides business value.
Use automated Acceptance Test Driven Development for regression testing and executable specifications.
Write acceptance tests for individual components or subsystems so that these parts can be combined freely without losing test coverage.
Simulate system boundaries like the user interface, databases, file system and external services to speed up your acceptance tests and to be able to check exceptional cases (e.g. a full hard disk). Use system tests to check the boundaries.
Do not write acceptance tests for every possibility. Write acceptance tests only for real scenarios. The exceptional and theoretical cases can be covered more easily with unit tests. Clean ATDD/TDD Cheat Sheet Continuous Integration
Run all unit and acceptance tests covering currently worked on code prior to committing to the source code repository.
Run all unit and acceptance tests on every commit to the version control system on the continuous integration server. Communicate Failed Integration to Whole Team + Whenever a stage on the continuous integration server fails, notify whole team in order to get blocking situation resolved as soon as possible.
Split the complete continuous integration workflow into individual stages to reduce feedback time.
Automatically build an installer as often as possible to test software on a test system (for manual tests, or tests with real hardware).
Install the system to a test environment on every commit or manual request. Deployment to production environment is automated, too. Test Pyramid
Constraint Test = Test for non-functional requirements.
Markers from http://www.planetgeek.ch/wp-content/uploads/2013/06/Clean-Code-V2.1.pdf https://www.amazon.com/Clean-Code-Handbook-Software-Craftsmanship/dp/0132350882 https://cleancoders.com/
Test Driven Development: By Example by Kent Beck ATDD by Example: A Practical Guide to Acceptance Test-Driven Development by Markus Gärtner The Art of Unit testing by Roy Osherove xUnit Test Patterns: Refactoring Test Code by Gerard Meszaros