If you have done some Scala for a while, you know about pattern matching and match/case
. Things like:
1
2
3
4
value match {
case Some(value) => …
case None => …
}
But there is another use of the case
keyword, without match
, as in:
1
map foreach { case (k, v) => println(k + " -> " + v) }
The first time I saw this kind of things I was a bit puzzled: in which situations could case
be used without match
? Well, it turns out 1 that a block with a bunch of case
inside is one way of defining an anonymous function.
There is nothing new with anonymous functions of course, and Scala has a very compact notation for those that doesn’t involve case
. But this particular way of defining anonymous functions gives you a lot for free, namely all the good things of pattern matching like casting-done-right, guards, and destructuring. The example above, with foreach
, shows how case can be used for destructuring the tuples of the map into key and value components.
But there is more. Consider:
1
2
scala> List(41, "cat") map { case i: Int => i + 1 }
scala.MatchError: cat (of class java.lang.String)
As expected this crashes, because the pattern match doesn’t know what to do when the string “cat” is passed to it.
On the other hand, this example doesn’t crash:
1
2
scala> List(41, "cat") collect { case i: Int => i + 1 }
res1: List[Int] = List(42)
So what’s the difference? Does collect
just catch the MatchError
and proceed? That would be clumsy and inefficient. In fact, the apparent magic lies in the fact that case
blocks define special functions called partial functions. 2
Now you might wonder, coming from a “normal” programming language background, what it means, for a function to be “partial”. Well, it comes from mathematics, where it’s opposed to “total” functions.
But even though it comes from math it’s actually simple. Take for example this function:
1
def inc(i: Int) = i + 1
It is defined for any Int
input value. That means for that any Int
argument, it produces a resulting Int
result. 3
A partial function on the other hand is defined only for a subset of the possible values of its arguments:
1
def fraction(d: Int) = 42 / d
is not defined for d == 0
and fraction(0)
will throw an exception. Think also of the square root function, which is not defined for negative real numbers. Examples abound. And it’s true also for the collect
example above, where the anonymous function is only defined for an Int
argument but not for a String
(or any other) argument.
So you get the idea about some values not “making sense” as the argument of a function because they can’t yield a significant result.
Now if you think about it you will notice lots of situations like this in your programs, where functions are expected to work properly only for some input values. If the function is called with a disallowed value, it will typically crash, yield a special return value, or throw an exception (and this should better be documented). In short, partial function are very common in real-life programs even if you don’t know about it.
So here fraction
is defined as a regular function, but conceptually it is a partial function. The good thing is that Scala has built-in support for partial functions thanks to the PartialFunction
trait. And here is one way of defining such a partial function:
1
2
3
4
val fraction = new PartialFunction[Int, Int] {
def apply(d: Int) = 42 / d
def isDefinedAt(d: Int) = d != 0
}
A PartialFunction
must provides a method isDefinedAt
, which allows the caller of the partial function to know, beforehand, whether the function can return a result for a given input value:
1
2
3
4
scala> fraction.isDefinedAt(42)
res2: Boolean = true
scala> fraction.isDefinedAt(0)
res3: Boolean = false
And if you call the function:
1
2
3
4
scala> fraction(42)
res4: Int = 1
scala> fraction(0)
java.lang.ArithmeticException: / by zero
This takes us back to the use of case
to define partial functions. The exact same function can be written:
1
2
val fraction: PartialFunction[Int, Int] =
{ case d: Int if d != 0 => 42 / d }
(Notice that you must specify that the PartialFunction[Int, Int]
type. It would be great if Scala had a syntax to make this even more compact but it doesn’t as of Scala 2.11.)
And if you call the function:
1
2
3
4
scala> fraction(42)
res5: Int = 1
scala> fraction(0)
scala.MatchError: 0 (of class java.lang.Integer)
(Note that there is one visible difference from the outside when you use the case way: you get a MatchError
as you usually do with pattern matching.)
The idea doesn’t apply only to numbers. In our collect
example above, the partial function implicitly defined looks like this:
1
2
val incAny: PartialFunction[Any, Int] =
{ case i: Int => i + 1 }
The function takes an Any
as parameter because List(41, "cat")
is a List[Any]
. But it is only defined for inputs that are of type Int
:
1
2
3
4
scala> incAny(41)
res6: Int = 42
scala> incAny("cat")
scala.MatchError: cat (of class java.lang.String)
Passing a String
didn’t go too well, as expected. But now you can check this before calling the function with:
1
2
3
4
scala> incAny.isDefinedAt(41)
res7: Boolean = true
scala> incAny.isDefinedAt("cat")
res8: Boolean = false
So we now have the explanation for the difference in behavior between collect
and map
, which is that collect
expects a partial function. It asks incAny
whether it is defined for 41
and then "cat"
, and so automatically filters out "cat"
. Another cool thing here is that the Scala compiler can even infer a clean resulting collection type: List[Int]
!
1
2
scala> List(41, "cat") collect incAny
res9: List[Int] = List(42)
Also, as you notice, if you define the partial function inline, the compiler knows that it’s a partial function and you avoid the explicit PartialFunction
trait.
Notice that partial functions can lie:
1
2
3
4
5
6
7
8
9
scala> val liar: PartialFunction[Any, Int] =
{ case i: Int => i; case s: String => s.toInt }
liar: PartialFunction[Any,Int] = <function1>
scala> liar.isDefinedAt(42)
res10: Boolean = true
scala> liar.isDefinedAt("cat")
res11: Boolean = true
scala> liar("cat")
java.lang.NumberFormatException: For input string: "cat"
Here liar
says incorrectly that it’s defined for "cat"
. It would probably be better to write:
1
2
3
4
5
scala> val honest: PartialFunction[Any, Int] =
{ case i: Int => i; case s: String if isParsableAsInt(s) => s.toInt }
honest: PartialFunction[Any,Int] = <function1>
scala> honest.isDefinedAt("cat")
res12: Boolean = false
So now you see how partial functions defined with case
can be used for things like collect
with a super compact notation. You will see them in other places, including catch expressions.
There is another situation in Scala where partial functions are “just there” and you might not know it. Take the following List:
1
val pets = List("cat", "dog", "frog")
In Scala, any instance of Seq
, Set
or Map
is also a function. So you can write:
1
2
scala> pets(0)
res13: java.lang.String = cat
But:
1
2
scala> pets(3)
java.lang.IndexOutOfBoundsException: 3
Wouldn’t that mean that the pets
function is, hum, only defined for values 0
, 1
, and 2
? Sounds familiar? Wouldn’t it be cool to look at pets as a partial function then? Well you can because in Scala any instance of Seq
or Map
(but not Set
) is actually a partial function. So you can write:
1
2
3
4
scala> pets.isDefinedAt(0)
res14: Boolean = true
scala> pets.isDefinedAt(3)
res15: Boolean = false
And if you had a list of indexes and wanted to safely collect values for these indexes in a new list, you could write:
1
2
scala> Seq(1, 2, 42) collect pets
res16: Seq[java.lang.String] = List(dog, frog)
Here it works well because collect
handles everything for us. But it can be a pain to check isDefinedAt
all over the place. If anything, it feels a bit like a null
check, and we hate those in Scala. The good news is that in Scala the PartialFunction
trait supports the lift
method, which converts the partial function to a normal function that doesn’t crash:
1
2
3
4
scala> pets.lift(0)
res17: Option[java.lang.String] = Some(cat)
scala> pets.lift(42)
res18: Option[java.lang.String] = None
As you see the lift
returns a function that returns an Option
of the value. This allows you to safely process values without null
checks and without calling isDefinedAt
yourself:
1
2
3
4
scala> pets.lift(0) map ("I love my " + _) getOrElse ""
res19: java.lang.String = I love my cat
scala> pets.lift(42) map ("I love my " + _) getOrElse ""
res20: java.lang.String = ""
I hope this helps make some sense of partial functions in Scala.
NOTE: This post was updated on 2014-12-27 to fix the use of def
where val
was called for, based on user comments. A few typos have been corrected as well.
NOTE: This post was updated on 2017-01-17 to address the fact that Set
is not a partial function, as kindly noted by a reader.
From The Scala Language Specification: “An anonymous function can be defined by a sequence of cases […] which appear as an expression without a prior match.” ↩
This is not to be confused with partially applied functions, which are a completely different topic. ↩
In Scala, it is defined even for
Int.MaxValue
, asInt.MaxValue + 1 == Int.MinValue
. The result is just plain wrong but it’s defined! ↩
Comments powered by Disqus.