# Thoughts on Church Encodings

I was recently reviewing Church encodings, for no particular reason other than to refresh my memory. It’s been a while since I first learned about them and I was pleased that they seemed so much easier to understand this time around! The exercise I was reviewing got me thinking about an idiom I’d learned and used often in Lisp dialects.

Church encodings express various computation concepts using only the lambda calculus. The lambda calculus is a programming languge model that is very, very basic:

where is an identifier. All concepts are defined with under two guiding principles:

- If two items are distinct from each other, they have forms that allow you to make that distinction.
- These forms, along with associated operations, obey the rules we define on them.

Okay, so that sounds very abstract. We can concretize these concepts by looking at, for example, booleans. On the one hand, we might think about true and false as just two tokens that must be different from each other. On the other hand, we can also think about true and false as being a *computation* that (1) enforces the cardinality of the booleans and (2) from a set of the appropriate size, reliably selects the true element and the false element.

So, the Church encoding for true is the function that, when given two elements, selects the first: and the Church encoding for false is the function that, when given two elements, selects the second: . When I first learned about this, I was like, “ummm…okay.” There’s certainly something weird and unsettling about it, which I think at the time was related to the idea that we interpret this expression as a final value when we’re thinking about it like a boolean, but then we can use it like a function inside boolean operations.

In some sense, I shouldn’t have been uncomfortable with the concept in the first place, since at the time, I was doing more coding in Lisp dialects. Lisp has this mantra that “code is data and data is code.”

Anyway, I was looking at exercises to define , , and in the lambda-calculus. If we allow a global store to hold labels in our language (purely for convenience of notation), we can define these as:

What I remembered from doing this before was how it always felt like Church encodings of things are “backwards” in some way. This is the general feeling I’ve had with the constructivist approach to logic, when playing around in Coq. The path of reasoning just feels like it’s slightly out of order.

What also struck me this time was how Lisp-like this is. Boolean operators in Lisp are often used to return default values. One common idiom (or at least, an idiom I *thought* was common) was to replace `(if t1 t2 t3)`

with `(or (and t1 t2) t3)`

. I thought maybe there was some correspondence with the Church encoding, but then I realized that the two encodings were not actually equivalent, because a negated guard (i.e. ) and a falsey consequent (i.e. ) will return , when it should just return . I thought that perhaps I had remembered the form incorrectly – thinking about it carefully, I thought that maybe it should have been `(or (and t1 t2) (and (not t1) t3))`

. This was definitely not something I’d used as a substitute for `(if t1 t2 t3)`

, so I drew a truth table to figure out what was going on. The three alternate expressions are:

We treat and like their Church-encoded counterparts: wlog, returns false if is falsy, and if is truthy. is evaluted in the usual way.

1. | ||||||||

2. | ||||||||

3. | ||||||||

4. | ||||||||

5. | ||||||||

6. | ||||||||

7. | ||||||||

8. |

Rows 3 and 4 have non-equal outcomes for , , and . Row 3 is particularly egregious – it’s the case when our guard is true, but the consequent is falsy. This got me thinking about how, in practice, is a replacement for

```
if guard
then return A
else return B
```

We generally use it in cases where we don’t expect `A`

to be falsy – it’s probably conceptually closer to the ternary operator (`guard ? A : B`

).

The less egregious inconsistency is between and in rows 3 and 4. The outcomes of both are falsy, but the provenance of the falsiness is different. This is fine when we live in a world where the population of false objects is 1 – for example, in strongly statically typed langauges with a boolean type, or in Lisp, where the only falsy object is nil. If we are instead in a world like Javascript, where nil is truthy, but boolean false, 0, and undefined variables are falsy, we could run into problems. If we allow more than one instance to inhabit our falsy type, we could have the situation where could be a number, but could be an undefined variable.