You can order print and ebook versions of Think Python 3e from Bookshop.org and Amazon.
6. Return Values#
In previous chapters, we’ve used built-in functions – like abs
and round
– and functions in the math module – like sqrt
and pow
.
When you call one of these functions, it returns a value you can assign to a variable or use as part of an expression.
The functions we have written so far are different.
Some use the print
function to display values, and some use turtle functions to draw figures.
But they don’t return values we assign to variables or use in expressions.
In this chapter, we’ll see how to write functions that return values.
6.1. Some functions have return values#
When you call a function like math.sqrt
, the result is called a return value.
If the function call appears at the end of a cell, Jupyter displays the return value immediately.
import math
math.sqrt(42 / math.pi)
3.656366395715726
If you assign the return value to a variable, it doesn’t get displayed.
radius = math.sqrt(42 / math.pi)
But you can display it later.
radius
3.656366395715726
Or you can use the return value as part of an expression.
radius + math.sqrt(42 / math.pi)
7.312732791431452
Here’s an example of a function that returns a value.
def circle_area(radius):
area = math.pi * radius**2
return area
circle_area
takes radius
as a parameter and computes the area of a circle with that radius.
The last line is a return
statement that returns the value of area
.
If we call the function like this, Jupyter displays the return value.
circle_area(radius)
42.00000000000001
We can assign the return value to a variable.
a = circle_area(radius)
Or use it as part of an expression.
circle_area(radius) + 2 * circle_area(radius / 2)
63.000000000000014
Later we can display the value of the variable we assigned the result to.
a
42.00000000000001
But we can’t access area
.
area
NameError: name 'area' is not defined
area
is a local variable in a function, so we can’t access it from outside the function.
6.2. And some have None#
If a function doesn’t have a return
statement, it returns None
, which is a special value like True
and False
.
For example, here’s the repeat
function from Chapter 3.
def repeat(word, n):
print(word * n)
If we call it like this, it displays the first line of the Monty Python song “Finland”.
repeat('Finland, ', 3)
Finland, Finland, Finland,
This function uses the print
function to display a string, but it does not use a return
statement to return a value.
If we assign the result to a variable, it displays the string anyway.
result = repeat('Finland, ', 3)
Finland, Finland, Finland,
And if we display the value of the variable, we get nothing.
result
result
actually has a value, but Jupyter doesn’t show it.
However, we can display it like this.
print(result)
None
The return value from repeat
is None
.
Now here’s a function similar to repeat
except that has a return value.
def repeat_string(word, n):
return word * n
Notice that we can use an expression in a return
statement, not just a variable.
With this version, we can assign the result to a variable. When the function runs, it doesn’t display anything.
line = repeat_string('Spam, ', 4)
But later we can display the value assigned to line
.
line
'Spam, Spam, Spam, Spam, '
A function like this is called a pure function because it doesn’t display anything or have any other effect – other than returning a value.
6.3. Return values and conditionals#
If Python did not provide abs
, we could write it like this.
def absolute_value(x):
if x < 0:
return -x
else:
return x
If x
is negative, the first return
statement returns -x
and the function ends immediately.
Otherwise, the second return
statement returns x
and the function ends.
So this function is correct.
However, if you put return
statements in a conditional, you have to make sure that every possible path through the program hits a return
statement.
For example, here’s an incorrect version of absolute_value
.
def absolute_value_wrong(x):
if x < 0:
return -x
if x > 0:
return x
Here’s what happens if we call this function with 0
as an argument.
absolute_value_wrong(0)
We get nothing! Here’s the problem: when x
is 0
, neither condition is true, and the function ends without hitting a return
statement, which means that the return value is None
, so Jupyter displays nothing.
As another example, here’s a version of absolute_value
with an extra return
statement at the end.
def absolute_value_extra_return(x):
if x < 0:
return -x
else:
return x
return 'This is dead code'
If x
is negative, the first return
statement runs and the function ends.
Otherwise the second return
statement runs and the function ends.
Either way, we never get to the third return
statement – so it can never run.
Code that can never run is called dead code. In general, dead code doesn’t do any harm, but it often indicates a misunderstanding, and it might be confusing to someone trying to understand the program.
6.4. Incremental development#
As you write larger functions, you might find yourself spending more time debugging. To deal with increasingly complex programs, you might want to try incremental development, which is a way of adding and testing only a small amount of code at a time.
As an example, suppose you want to find the distance between two points represented by the coordinates \((x_1, y_1)\) and \((x_2, y_2)\). By the Pythagorean theorem, the distance is:
The first step is to consider what a distance
function should look like in Python – that is, what are the inputs (parameters) and what is the output (return value)?
For this function, the inputs are the coordinates of the points. The return value is the distance. Immediately you can write an outline of the function:
def distance(x1, y1, x2, y2):
return 0.0
This version doesn’t compute distances yet – it always returns zero. But it is a complete function with a return value, which means that you can test it before you make it more complicated.
To test the new function, we’ll call it with sample arguments:
distance(1, 2, 4, 6)
0.0
I chose these values so that the horizontal distance is 3
and the
vertical distance is 4
.
That way, the result is 5
, the hypotenuse of a 3-4-5
right triangle. When testing a function, it is useful to know the right answer.
At this point we have confirmed that the function runs and returns a value, and we can start adding code to the body.
A good next step is to find the differences x2 - x1
and y2 - y1
.
Here’s a version that stores those values in temporary variables and displays them.
def distance(x1, y1, x2, y2):
dx = x2 - x1
dy = y2 - y1
print('dx is', dx)
print('dy is', dy)
return 0.0
If the function is working, it should display dx is 3
and dy is 4
.
If so, we know that the function is getting the right arguments and
performing the first computation correctly. If not, there are only a few
lines to check.
distance(1, 2, 4, 6)
dx is 3
dy is 4
0.0
Good so far. Next we compute the sum of squares of dx
and dy
:
def distance(x1, y1, x2, y2):
dx = x2 - x1
dy = y2 - y1
dsquared = dx**2 + dy**2
print('dsquared is: ', dsquared)
return 0.0
Again, we can run the function and check the output, which should be 25
.
distance(1, 2, 4, 6)
dsquared is: 25
0.0
Finally, we can use math.sqrt
to compute the distance:
def distance(x1, y1, x2, y2):
dx = x2 - x1
dy = y2 - y1
dsquared = dx**2 + dy**2
result = math.sqrt(dsquared)
print("result is", result)
And test it.
distance(1, 2, 4, 6)
result is 5.0
The result is correct, but this version of the function displays the result rather than returning it, so the return value is None
.
We can fix that by replacing the print
function with a return
statement.
def distance(x1, y1, x2, y2):
dx = x2 - x1
dy = y2 - y1
dsquared = dx**2 + dy**2
result = math.sqrt(dsquared)
return result
This version of distance
is a pure function.
If we call it like this, only the result is displayed.
distance(1, 2, 4, 6)
5.0
And if we assign the result to a variable, nothing is displayed.
d = distance(1, 2, 4, 6)
The print
statements we wrote are useful for debugging, but once the function is working, we can remove them.
Code like that is called scaffolding because it is helpful for building the program but is not part of the final product.
This example demonstrates incremental development. The key aspects of this process are:
Start with a working program, make small changes, and test after every change.
Use variables to hold intermediate values so you can display and check them.
Once the program is working, remove the scaffolding.
At any point, if there is an error, you should have a good idea where it is. Incremental development can save you a lot of debugging time.
6.5. Boolean functions#
Functions can return the boolean values True
and False
, which is often convenient for encapsulating a complex test in a function.
For example, is_divisible
checks whether x
is divisible by y
with no remainder.
def is_divisible(x, y):
if x % y == 0:
return True
else:
return False
Here’s how we use it.
is_divisible(6, 4)
False
is_divisible(6, 3)
True
Inside the function, the result of the ==
operator is a boolean, so we can write the
function more concisely by returning it directly.
def is_divisible(x, y):
return x % y == 0
Boolean functions are often used in conditional statements.
if is_divisible(6, 2):
print('divisible')
divisible
It might be tempting to write something like this:
if is_divisible(6, 2) == True:
print('divisible')
divisible
But the comparison is unnecessary.
6.6. Recursion with return values#
Now that we can write functions with return values, we can write recursive functions with return values, and with that capability, we have passed an important threshold – the subset of Python we have is now Turing complete, which means that we can perform any computation that can be described by an algorithm.
To demonstrate recursion with return values, we’ll evaluate a few recursively defined mathematical functions. A recursive definition is similar to a circular definition, in the sense that the definition refers to the thing being defined. A truly circular definition is not very useful:
vorpal: An adjective used to describe something that is vorpal.
If you saw that definition in the dictionary, you might be annoyed. On the other hand, if you looked up the definition of the factorial function, denoted with the symbol \(!\), you might get something like this:
This definition says that the factorial of \(0\) is \(1\), and the factorial of any other value, \(n\), is \(n\) multiplied by the factorial of \(n-1\).
If you can write a recursive definition of something, you can write a Python program to evaluate it.
Following an incremental development process, we’ll start with a function that take n
as a parameter and always returns 0
.
def factorial(n):
return 0
Now let’s add the first part of the definition – if the argument happens to be 0
, all we have to do is return 1
:
def factorial(n):
if n == 0:
return 1
else:
return 0
Now let’s fill in the second part – if n
is not 0
, we have to make a recursive
call to find the factorial of n-1
and then multiply the result by n
:
def factorial(n):
if n == 0:
return 1
else:
recurse = factorial(n-1)
return n * recurse
The flow of execution for this program is similar to the flow of countdown
in Chapter 5.
If we call factorial
with the value 3
:
Since 3
is not 0
, we take the second branch and calculate the factorial
of n-1
.…
Since
2
is not0
, we take the second branch and calculate the factorial ofn-1
.…Since
1
is not0
, we take the second branch and calculate the factorial ofn-1
.…Since
0
equals0
, we take the first branch and return1
without making any more recursive calls.The return value,
1
, is multiplied byn
, which is1
, and the result is returned.The return value,
1
, is multiplied byn
, which is2
, and the result is returned.
The return value 2
is multiplied by n
, which is 3
, and the result,
6
, becomes the return value of the function call that started the whole
process.
The following figure shows the stack diagram for this sequence of function calls.
The return values are shown being passed back up the stack.
In each frame, the return value is the product of n
and recurse
.
In the last frame, the local variable recurse
does not exist because the branch that creates it does not run.
6.7. Leap of faith#
Following the flow of execution is one way to read programs, but it can quickly become overwhelming. An alternative is what I call the “leap of faith”. When you come to a function call, instead of following the flow of execution, you assume that the function works correctly and returns the right result.
In fact, you are already practicing this leap of faith when you use built-in functions.
When you call abs
or math.sqrt
, you don’t examine the bodies of those functions – you just assume that they work.
The same is true when you call one of your own functions. For example, earlier we wrote a function called is_divisible
that determines whether one number is divisible by another. Once we convince ourselves that this function is correct, we can use it without looking at the body again.
The same is true of recursive programs. When you get to the recursive call, instead of following the flow of execution, you should assume that the recursive call works and then ask yourself, “Assuming that I can compute the factorial of \(n-1\), can I compute the factorial of \(n\)?” The recursive definition of factorial implies that you can, by multiplying by \(n\).
Of course, it’s a bit strange to assume that the function works correctly when you haven’t finished writing it, but that’s why it’s called a leap of faith!
6.8. Fibonacci#
After factorial
, the most common example of a recursive function is fibonacci
, which has the following definition:
Translated into Python, it looks like this:
def fibonacci(n):
if n == 0:
return 0
elif n == 1:
return 1
else:
return fibonacci(n-1) + fibonacci(n-2)
If you try to follow the flow of execution here, even for small values of \(n\), your head explodes.
But according to the leap of faith, if you assume that the two recursive calls work correctly, you can be confident that the last return
statement is correct.
As an aside, this way of computing Fibonacci numbers is very inefficient. In Chapter 10 I’ll explain why and suggest a way to improve it.
6.9. Checking types#
What happens if we call factorial
and give it 1.5
as an argument?
factorial(1.5)
RecursionError: maximum recursion depth exceeded in comparison
It looks like an infinite recursion. How can that be? The function has a
base case – when n == 0
.
But if n
is not an integer, we can miss the base case and recurse forever.
In this example, the initial value of n
is 1.5
.
In the first recursive call, the value of n
is 0.5
.
In the next, it is -0.5
.
From there, it gets smaller (more negative), but it will never be 0
.
To avoid infinite recursion we can use the built-in function isinstance
to check the type of the argument.
Here’s how we check whether a value is an integer.
isinstance(3, int)
True
isinstance(1.5, int)
False
Now here’s a version of factorial
with error-checking.
def factorial(n):
if not isinstance(n, int):
print('factorial is only defined for integers.')
return None
elif n < 0:
print('factorial is not defined for negative numbers.')
return None
elif n == 0:
return 1
else:
return n * factorial(n-1)
First it checks whether n
is an integer.
If not, it displays an error message and returns None
.
factorial('crunchy frog')
factorial is only defined for integers.
Then it checks whether n
is negative.
If so, it displays an error message and returns None.
factorial(-2)
factorial is not defined for negative numbers.
If we get past both checks, we know that n
is a non-negative integer, so we can be confident the recursion will terminate.
Checking the parameters of a function to make sure they have the correct types and values is called input validation.
6.10. Debugging#
Breaking a large program into smaller functions creates natural checkpoints for debugging. If a function is not working, there are three possibilities to consider:
There is something wrong with the arguments the function is getting – that is, a precondition is violated.
There is something wrong with the function – that is, a postcondition is violated.
The caller is doing something wrong with the return value.
To rule out the first possibility, you can add a print
statement at the beginning of the function that displays the values of the parameters (and maybe their types).
Or you can write code that checks the preconditions explicitly.
If the parameters look good, you can add a print
statement before each return
statement and display the return value.
If possible, call the function with arguments that make it easy check the result.
If the function seems to be working, look at the function call to make sure the return value is being used correctly – or used at all!
Adding print
statements at the beginning and end of a function can help make the flow of execution more visible.
For example, here is a version of factorial
with print statements:
def factorial(n):
space = ' ' * (4 * n)
print(space, 'factorial', n)
if n == 0:
print(space, 'returning 1')
return 1
else:
recurse = factorial(n-1)
result = n * recurse
print(space, 'returning', result)
return result
space
is a string of space characters that controls the indentation of
the output. Here is the result of factorial(3)
:
factorial(3)
factorial 3
factorial 2
factorial 1
factorial 0
returning 1
returning 1
returning 2
returning 6
6
If you are confused about the flow of execution, this kind of output can be helpful. It takes some time to develop effective scaffolding, but a little bit of scaffolding can save a lot of debugging.
6.11. Glossary#
return value: The result of a function. If a function call is used as an expression, the return value is the value of the expression.
pure function: A function that does not display anything or have any other effect, other than returning a return value.
dead code:
Part of a program that can never run, often because it appears after a return
statement.
incremental development: A program development plan intended to avoid debugging by adding and testing only a small amount of code at a time.
scaffolding: Code that is used during program development but is not part of the final version.
Turing complete: A language, or subset of a language, is Turing complete if it can perform any computation that can be described by an algorithm.
input validation: Checking the parameters of a function to make sure they have the correct types and values
6.12. Exercises#
# This cell tells Jupyter to provide detailed debugging information
# when a runtime error occurs. Run it before working on the exercises.
%xmode Verbose
Exception reporting mode: Verbose
6.12.1. Ask a virtual assistant#
In this chapter, we saw an incorrect function that can end without returning a value.
def absolute_value_wrong(x):
if x < 0:
return -x
if x > 0:
return x
And a version of the same function that has dead code at the end.
def absolute_value_extra_return(x):
if x < 0:
return -x
else:
return x
return 'This is dead code.'
And we saw the following example, which is correct but not idiomatic.
def is_divisible(x, y):
if x % y == 0:
return True
else:
return False
Ask a virtual assistant what’s wrong with each of these functions and see if it can spot the errors or improve the style.
Then ask “Write a function that takes coordinates of two points and computes the distance between them.” See if the result resembles the version of distance
we wrote in this chapter.
6.12.2. Exercise#
Use incremental development to write a function called hypot
that returns the length of the hypotenuse of a right triangle given the lengths of the other two legs as arguments.
Note: There’s a function in the math module called hypot
that does the same thing, but you should not use it for this exercise!
Even if you can write the function correctly on the first try, start with a function that always returns 0
and practice making small changes, testing as you go.
When you are done, the function should only return a value – it should not display anything.
6.12.3. Exercise#
Write a boolean function, is_between(x, y, z)
, that returns True
if \(x < y < z\) or if
\(z < y < x\), andFalse
otherwise.
6.12.4. Exercise#
The Ackermann function, \(A(m, n)\), is defined:
Write a function named ackermann
that evaluates the Ackermann function.
What happens if you call ackermann(5, 5)
?
6.12.5. Exercise#
The greatest common divisor (GCD) of \(a\) and \(b\) is the largest number that divides both of them with no remainder.
One way to find the GCD of two numbers is based on the observation that if \(r\) is the remainder when \(a\) is divided by \(b\), then \(gcd(a, b) = gcd(b, r)\). As a base case, we can use \(gcd(a, 0) = a\).
Write a function called gcd
that takes parameters a
and b
and
returns their greatest common divisor.
Copyright 2024 Allen B. Downey
Code license: MIT License
Text license: Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International