Chika - an alternative language spurred by uLisp

phunanon · 2020-04-08 23:50:36 UTC

Hello, everybody. Apologies in advance for such a long post—I’ve held back introducing this for a very long time. I hope anybody finds it interesting :)

I have been following uLisp for a few months now. It has been incredibly inspiring! I can really appreciate the versatility of Lisp in general, and it was wonderful having its implementation click when learning about it. However, I found uLisp on my own journey of creation: an s-expression language for Arduino/PC called Chika.
The rationale is a system facilitating “flash once”, where as much of the platform as possible is within user programs rather than built into the language. While this precludes bare-metal native performance, it has luckily proven its own benefits.

Cheekily, a goal all along has been to exceed the speed of uLisp. Chika is able to perform (tak 18 12 6) in 9.6s, compared to uLisp’s 15.0s, on my Adafruit Feather M0; or with O3 optimisation 7.6s vs. 12.8s. This is just one example mind, echoed through many limited benchmarks.
It’s at a heavy price: no mutation, homoiconicity, closures, native REPL, macros, laziness, multi-arity, orthodox syntax, or code imports with necessity for an SD card. I don’t call it a Lisp. Chika eschews heap/GC memory, instead s-expression arguments are stacked and consumed by the native operation or program function.
This gives Chika very different memory characteristics to uLisp. While it doesn’t have to break values into linked-list cells, it does have to duplicate items in memory—however long they may be—before operating on them. Strangely it was way slower mitigating this with copy-on-write than just letting memcpy do its thing, so I chose that…

There are some perceived benefits however, with implemented: 100% pure functions and immutable memory, round-robin scheduler, inter-program messaging, native Linux support. Currently programs request an amount of RAM and have a simple life-cycle: compilation (once), entry, heartbeat, messages, halt. The scheduler ‘beats’ the heartbeat function of each program (if it exists) until halted. Programs can create MQTT-like message callback subscriptions and broadcast messages to the system. These messages are a topic string with simple wildcards and a payload item (any data type).

An example of this architecture, loading two programs and two drivers (also programs) all written in Chika:

The SSD1306 driver (emulating I²C) subscribes to messages with the topics “dis/char”, “dis/clear”, “dis/page”, &c, and displays these synchronously.
The keypad driver uses its heartbeat to monitor an analog pin, and collects presses into ASCII characters, broadcasting “kb/char”.
The prompt program provides a small buffer for line editing, collecting characters from its “kb/char” subscription, and emitting “prompt/input” upon submission.
The reader program invokes the prompt program for a file path, and subscribes to its input plus keypad keys to scroll the file up and down.

Now, this is a novelty - typing characters in ASCII, waiting ~4.5s for the 128x64 OLED to be completely wiped (better to use a slave chip imo). But it works, and logic is fairly decoupled/reusable. Not only that, but the keyboard input could be from any source, such as a proper keyboard or the internet, and the programs wouldn’t bat an eye - it’s just a message to them. The future holds a personal computer middleware with an ecosystem to draw from.

My workflow can involve a lot of tests through emitting messages and checking their response, on PC, and if the virtual-machine screws up (less frequently now) I can use gdb to debug the issue. Source files on Linux can be run like executables with a !#chika shebang.

I won’t talk any more, though there’s a lot more I could say (state management, compilation, variables, syntax, native operations, core library, &c.), but for the curious I’ll answer any questions :)

phunanon · 2020-04-11 12:15:22 UTC

Realised I could probably share at least one image.

This illustrates using the basic REPL, inputting )file.chi to compile a file on SD card, and just the executable name to execute it. The programs involved are the SSD1306 driver, keypad driver, prompt program with basic line editing (backspace, for example), and REPL program which is this simple:

#196

(load "prompt")
(sub "prompt/in" handle-input)

(fn nl (pub "dis/char" \nl))

(fn do-repl input
  (nl)
  (file-w "repltmp.chi"
    (str "(pub " \" "dis/str" \" " (str " input "))"))
  (comp "repltmp.chi")
  (load "repltmp")
  (file-d "repltmp"))

(fn do-load input
  (nl)
  (load input))

(fn handle-input x input
  (case (nth 0 input)
    \( (do-repl input)
    \) (comp (sect input))
    (do-load input))
  (nl)
  (val T))

(fn heartbeat load-prompt
  (if load-prompt (load "prompt"))
  (sleep 255))

Compiles to 221B.

phunanon · 2020-05-25 11:42:26 UTC

Apologies for bumping this thread, but I don’t want to intrude too much :)

github.com

phunanon/Chika/blob/master/docs/platform-comparisons.md

# Chika and other platforms compared

## Benchmarks

| Which            | Arduino      | uLisp `O0` | Chika `O0` | uLisp `O3` | Chika `O3` | CircuitPython |
| ---------------- | ------------ | ---------- | ---------- | ---------- | ---------- | ------------- |
| Fibonacci 15     | Mega 2560    | 512ms      | 651ms      |            |            |               |
| Fibonacci 23     | Mega 2560    | 28s        | 38s        |            |            |               |
| Takeuchi 18 12 6 | Mega 2560    | 49s†       |            |            |            |               |
| Fibonacci 23     | MKRZero      | 12s        | 7832ms     |            |            |               |
| Takeuchi 18 12 6 | MKRZero      | 18s†       | 9575ms     |            |            |               |
| Fibonacci 17     | M0 Adalogger | 536ms      | 441ms      | 452ms      | 344ms      | 160ms         |
| Fibonacci 18     | M0 Adalogger | 880ms      | 714ms      | 744ms      | 556ms      | recursion err |
| Takeuchi 10 6 1  | M0 Adalogger | 1121ms     | 903ms      | 933ms      | 707ms      | 307ms         |
| Takeuchi 18 12 6 | M0 Adalogger | 12494ms    | 9498ms     | 10535ms    | 7433ms     | recursion err |

† Used `(not (< y x))` instead.  
Note: Mega 2560 Chika benchmarks were before significant performance improvements.

**uLisp functions**

This file has been truncated. show original

I’ve done a bit of benchmarking between uLisp, Chika, and CircuitPython. I’m unsurprised it has us all beat, being 3x faster than uLisp and 2.3x faster than Chika. However, both uLisp and Chika can handle deep recursion.
I’d appreciate some peer-review over the memory usages of uLisp I’ve also included. I tried to reason about these from the documentation / source. For example I disbelieve the workspace memory could be so large.

johnsondavies · 2020-05-25 14:14:20 UTC

Thanks for the comparisons! It’s good to have the CircuitPython ones too. I’m surprised it can’t manage the recursion in some cases.

What’s the difference between the ‘O0’ and ‘O3’ columns? Sorry if I’ve missed your explanation somewhere.

For example I disbelieve the workspace memory could be so large.

I’d be happy to explain this but I don’t quite understand what you mean by the question.

Thanks, David

phunanon · 2020-05-25 14:36:08 UTC

the ‘O0’ and ‘O3’ columns

I appended

#pragma GCC optimize ("O3")
#pragma GCC push_options

To the top of uLisp’s .ino and Chika’s VM.

I don’t quite understand what you mean by the question.

Essentially trying to figure out how much RAM each data type and program takes. For example, if the sq function takes 15 cells does that make it 60B (15*4B) in memory?

Also, can you think of any other benchmarks I could try? I was really struggling to find some I could implement across all three languages which would test anything useful.

johnsondavies · 2020-05-25 16:45:09 UTC

Essentially trying to figure out how much RAM each data type and program takes. For example, if the sq function takes 15 cells does that make it 60B (15*4B) in memory?

On the 8/16-bit platforms every object (ie cons cell) takes 4 bytes, and on 32-bit platforms it’s 8 bytes. You can see how much memory a function uses by looking at the before and after workspace usage before the uLisp prompt. So on a Mega 2560:

Before loading the tak function: 1213
After: 1165

Therefore it uses 48 objects, or 192 bytes.

Also, can you think of any other benchmarks I could try? I was really struggling to find some I could implement across all three languages which would test anything useful.

Here are some suggestions:

Factor; a good test of integer calculation speed:
http://www.ulisp.com/show?1EO1/#factor

The recursive function Q2. Another test of recursion, like tak:
http://www.ulisp.com/show?1EO1/#q2

Sudoku solver; a test of arrays, but needs more memory than the M0 boards have::
http://www.ulisp.com/show?33J9

Fast Fourier Transform; a good test of floating-point arithmetic:
http://www.ulisp.com/show?27ES

Query language; would be a good test of an AI-type application, but might be difficult to translate to MicroProlog:
http://www.ulisp.com/show?2I60