Python Cyclomatic Complexity
This post presents several scenarios with high CC and the ways to decrease it in Python.
Here’s a short introduction about how the cyclomatic complexity (CC) in computed: Introduction to Code Metrics
Keeping low CC is important for other developers to understand and maintain your code. Generally, we try to write short functions with single responsibility to avoid high CC. The following scenarios describe functions with high CC and the ways to decrease the complexity.
Homogeneous if
switch
As there’s no switch ... case ...
in Python as in other languages,
you might want to implement similar code by using many if ... elif ... else ...
statements. This can make the code heavy. Here is an example
if value == "a":
action_a()
elif value == "b":
action_b()
elif value == "c":
action_c()
else:
action_default()
This code works. But if you submit it in a large team, this code can be fragile. Other developers might add some inner logic or operations inside each if
.
This will make it inhomogeneous as described in the next section and difficult to refactor later.
Dictionary Mapping
A clean way for this situation is to create a mapping dictionary. In Python, functions are also object. So the code above can be written as this
action_map = {"a": action_a, "b": action_b, "c": action_c}
action = action_map.get(value, action_default)
action()
Of course, when the action of each case is more complex, like when the function grows too long, the inner logic becomes more and more complicated, or each action requires some specific configuration. In this case, you should consider polymorphism to refactor it into class.
Factory Pattern
A factory pattern here can be a good idea, like this
class Action(object):
@classmethod
def from_value(cls, value):
for sub_cls in cls.__subclasses__():
if "action" + value == sub_cls.__name__.lower():
return sub_cls()
return cls()
def __call__(self):
action_default()
class ActionA(Action):
def __call__(self):
action_a()
class ActionB(Action):
def __call__(self):
action_b()
class ActionC(Action):
def __call__(self):
action_c()
# main
action = Action.from_value(value)
action()
The above code is equivalent to the previous ones, but it uncouples the different actions. With this structure, you can add or delete one action without modifying any others. Every child action behaves like a plugin here.
There’s a variation I wrote some time ago in
abstract_factory.py.
It can be cleaner by setting __call__
as abstract, which makes the intention
of the parent class more clear to other developers.
Although I think the third option may look fancier, I would prefer to start the development
with the first implementation for a simple case. Then we advance to the second one when there
are too many if
. Finally when it is needed, we can refactor the code into the third pattern.
Inhomogeneous conditional logic and loop
Consider the following piece of code
if value_a == "a":
action_a()
if value_b != "c":
action_a()
if value_b == "b" and value_c == "d":
action_b()
if value_c != "c":
action_c()
This is really bad. The work flow is hard to follow. It is difficult to figure out which case will land where. It’s better refactor the code with single responsibility principle.
Unluckily this type of code is not so rare when people are in a hurry to finish some change
before deadline. Nothing can be easier than to add an if
in front of everything so his
feature will be guaranteed to work.
If you have ever done this, just be nice and thoughtful to people who will read and maintain your code later. Imagine one day you are asked to add a feature to this code. Will you enjoy working on it ?
Checking tools
For Python, radon is a useful tool to check the CC level. Generally, it is recommended to keep the CC of each function under 6. You can check if you have any function above this threshold with
radon cc . --min B -s
It also can be part of automatic checks after each commit.