Afunctional Programming Languages

What if there were no functions?

What?

Many, many years ago, probably around 2000 while I was teaching Perl, I was waffling on about something or another about Functional Programming techniques and temporarily forgot the word for, uh, the other thing.

Not functional. Disfunctional? Unfunctional? Afunctional? Oh, yeah, *Imperative. That’s it. Now, where was I?*

Disfunctional Programming was a bit too easy to make Perl jokes about, but for some reason the word ‘Afunctional’ stuck in my head. I started wondering what it might possibly mean, and have been wondering ever since. What would an afunctional language look like?

In C there’s the concept of a ‘void function’, a function which returns nothing. That doesn’t seem much like a function but you can still treat it like one, for example:

void add_two_numbers(int a, int b, int *c) {
    *c = a + b;
}

void main() {
    int sum;
    add_two_numbers(2, 3, &sum);
    printf("%d", sum);
}

… we’re abusing the C calling convention a little to return the sum in the memory pointed to by c. This trick is commonly used in C to return multiple values or provide and exception-like behaviour. But it’s still, really, a function.

There are languages with no or very primitive functions, like assembly langauges with JSR and BASIC’s GOSUB. They’re not particularly interesting: you can force a functional paradigm onto them with a calling convention too.

Returning Control

So what is a “function” anyway?

The mathematical definition is about taking inputs and returning an output, but in programming something else is returned: control. When we call a function, we give it control and we expect to get control back: we say “please work out the 43rd Fibonacci Number and get back to me”. The function isn’t just returning the answer to the caller, it is also waking the caller back up, resuming execution from where it left off. Even void functions, which return no (useful) value, are returning control.

Asynchronicity

There’s another way here, commonly called Asynchronous method invocation where we don’t actually pass control to the function being called, we just queue up a request for it to be run and then get on with what we’re doing. The runtime will then schedule that method to be run and notify us when the result is ready. This is a very common way of doing things in Javascript, where for example you can perform an asynchronous web request like so:

    function get_example() {
        var xhr = new XMLHttpRequest();
        xhr.onload = function () {
            console.log(xhr.responseText); 
        }
        xhr.open("GET", "http://example.com/");
        xhr.send();
    }

Calling xhr.open() doesn’t give up control for long: that method almost immediately returns, and then our get_example() function finishes and returns too. When (much later in CPU terms) the answer finally comes back from the Internet it is put in our xhr object, and the anonymous function we supplied as xhr.onload gets called to return control to us so we can do something, in this case write some stuff to the console.

Some javascript frameworks really embrace the callback-heavy nature of this kind of code by taking multiple callbacks, one for success and one for failure, etc, and those callbacks can themselves include (using closures ) more callbacks, etc, to compose functions into complex structures of callbacks.

Callback Hell

This way of programming is often derided as Callback Hell because each action leads to a callback which leads to another action which leads to a callback, and so on, but by taking care we can avoid making our code too confusing. If we wanted get_example to do something after it got the example, we’d need to pass it a callback to call once it was done:

    function get_example(callback) {
        var xhr = new XMLHttpRequest();
        xhr.onload = function () {
            callback(xhr.responseText);
        }
        xhr.open("GET", "http://example.com/");
        xhr.send();
    }

A lot of this can be avoided by doing some simple metaprogramming, for example it is easy to write a function which runs a list of other functions in series or parallel, and always use it instead of a map or a loop. But this comes at a performance penalty, and there’s a cognitive penalty in converting between the approaches.

    function run_in_parallel(tasks, callback) {

        // tasks is a list of functions to call in parallel,
        //   each of which takes a callback parameter to call
        //   when it is finished.
        //
        // callback is called once all tasks have finished.

        var counter = tasks.length;
        for (var i in tasks) {
            tasks[i](function() {
                counter--;
                if (!counter) callback();
            });
        }
    }

Promises

There are alternatives such as ES6 Promises and Python asyncio which attempt to address the problem. With promises, we can write the same code above as:

    function get_example() {
        new Promise(function(resolve, reject) {
            var xhr = new XMLHttpRequest();
            xhr.onload = function() { resolve(xhr.responseText); }
            xhr.open("GET", "http://example.com/");
            xhr.send();
        }).then(
            function(text) { console.log(text); }
        )
    }

The syntax scales somewhat better, but it is still doing the same thing as before: my anonymous function is getting called when it is ready.

Message Passing

Lots of work has been done in this area, including the classic Communicating Sequential Processes and the development of the Actor Model which led to languages such as Erlang.

My interest here is “in the small”, eg: how would you create a programming language with only afunctional constructs. The reason is simple: if you have a message-passing system where the individual actors are not themselves message-passing, then there’s a mismatch at that point, and you have to make architectural decisions about where to make that cutoff. Code which was synchronous needs to be rethought when it is made asynchronous, and you can end up with slightly weird looking code like:

    function get_with_cache(key, callback) {
        if (is_in_cache(key)) {
            var value = get_from_cache(key);
            callback(value);
        } else {
            go_and_get(key, function (value) {
                store_in_cache(key, value);
                callback(value);
            });
        }
    }

… where the callback is called synchronously for a cache hit and asynchronously for a cache miss. This isn’t really problematic, just syntactically awkward.

to be continued …

How?

unix pipes
C runtime

Components

Types - Sources, Transformers, Sinks
Map, Reduce, etc

Syntax

Composition with | and ; and [ ]
Error redirection

... and another thing ...