When I was first learning about Functional Programming and Scheme, the idea that order-of-execution didn’t matter in purely functional programs, was “strange to me”, to put it nicely. When I first read about Scheme’s begin form, for example, I remember feeling satisfied that Scheme wasn’t totally insane as it had at least
Walking through their example, you will be delighted to find that the answer is quite simple: rely on the fact that Scheme is a call-by-value language and utilize macros to change the shape of your program. Here is what that means.
In Scheme, arguments to a function are evaluated before control is passed to that function. For example when you apply ‘+’ to the following arguments:
(+ 1 2 (+ 3 4))
each of the arguments is evaluated before passing control to ‘+’, so the call above would really look like:
(+ 1 2 7)
We can generalize this, per the paper, to say that any time we want to force “one thing to come before the other” (which is of particular interest when we want side-effecting operations), we can perform the following, which I’ve arbitrarily named the “lambda-dummy” pattern:
(to reiterate: this pattern comes right from the paper)
((LAMBDA (DUMMY) Statement2) Statement1)
which can be read as “The first expression evaluates to a function, which is applied to Statement1 . Since arguments to functions are evaluated before they are passed to the anonymous function, Statemen1 is evaluated, the result of which is passed to the function. Since the anonymous function exists purely to facilitate ordering, the result of Statement1 is bound to the name DUMMY to remind us that we would never use it. Finally, the body of the anonymous function, Statement2, is evaluated.” This lets us handle ordering for two statements, but how do we handle more than that? Easy, we utilize macros to restructure our code.
Suppose we wanted to implement a sequencing construct named ‘sequence’ which behaved identically to ‘begin’ (as already referenced above). The behavior we want is that statements are evaluated in order of appearance within the macro. For example, the following code would produce the following result:
(sequencing
(display 'a)
(display 'b)
(display 'c))
> abc
If we were to hand-code an expression using the lambda-dummy pattern described above, it would look like the following:
(Note here that I used PLT Scheme to write this which optionally allows (), [], and {} to be used interchangeably. I leveraged that to attempt to add more visual-cues to the code such that: [] encapsulate anonymous function definitions, {} encapsulate arguments to functions, and () contain anonymous functions, along with their arguments, for application):
([λ (dummy) ([λ (dummy) (display 'c)] {display 'b})] {display 'a})
> abc
We can formalize this transformation by creating a macro with two rules. It is actually pretty straightforward. If there is only one expression to be transformed, the macro returns that expression; if there is more than one than we can apply the lambda-dummy pattern by putting the first statement (S->N) in the argument position and the next expression (S->N+1) in the anonymous function body position (note that since ‘expression’ refers the pattern variable to which the users ‘statement’ is bound, you should feel comfortable using the terms interchangeably here). For every expression in the sequence macro body (S->N+…), this pattern is applied such that the resulting code body “grows in the middle”, resulting in an expression that makes the code look backwards! Additionally, we can rely on the fact that hygiene will prevent our introduced variable ‘ignored’ from ever clobbering any existing bindings rather than relying on the user of the macro to see the name ‘ignored’ and remember not to use it!
(define-syntax sequencing
(syntax-rules ()
[(_ expression) expression]
[(_ expression expressions ...)
((λ (ignored) (sequencing expressions ...)) expression)]))
Applying the sequencing macro like this:
(sequencing
(display 'a)
(display 'b)
(display 'c))
generates the following backwards-looking code (again using the same bracket-based visual-cues):
([λ (ignored) ([λ (ignored) (display 'c)] {display 'b})] {display 'a})
Stare at it for a while. It is actually pretty easy to read when you take into account that the following pattern is present:
- The “whole thing”, or expression, that realized the lambda-dummy pattern, may be called the ‘sequencing-body’.
- The anonymous function is on the left in [] brackets
- The argument expression to that function is on the right in {} brackets
- The argument expression gets evaluated
- The anonymous function gets evaluated. If it evaluates to a ‘sequencing-body’, go to the first step, otherwise evaluate the last expression and we’re done.
Why, you may ask, is this important? It is important because it gives you a conceptual model for order-of-evaluation about which you can reason.
Without it, all you’ve got to go on is that “things happen this way” either because “that is the way the language designer wanted it” or that “computers execute instructions in order, so it has got to be this way”. Neither help you to better understand programming or computation, and the latter, in particular, is not true when you’re wearing your programmer hat.
This topic originally appeared in a post I made to the PLT discussion list. I wanted to get feedback and make sure that conceptually this is a reasonable approach to understanding sequencing in Scheme. The answer that I got was ‘yes’.
Quite interesting. The title made me curious. You explained it well.
LOUIS:
Nice to meet you. If you ever want to talk more about anything, that sounds fun.