Publicaciones sobre python

A Tale of Optimization

2024-08-16 21:19

I reimplemented Pygments in Crystal. It didn't quite go as I expected. I have already written about how I did it but that left a large part of the story untold. You see, I am using Crystal, which compiles to native code. And yet my reimplementation was slower than Python. That's not supposed to happen.

I decided to figure out why, and fix it. This is the story of how I made my software that looked "ok" 30x faster. Mind you, this is going to make me sound much smarter and accurate than I am. I ran into 200 dead ends before finding each improvement.

The First Problem (v0.1.0)

Command	Mean [ms]	Min [ms]	Max [ms]	Relative
`bin/tartrazine ../crycco/src/crycco.cr swapoff > x.html --standalone`	533.8 ± 4.4	523.7	542.9	18.80 ± 0.92
`chroma ../crycco/src/crycco.cr -l crystal -f html -s swapoff > x.html`	28.4 ± 1.4	25.6	32.8	1.00
`pygmentize ../crycco/src/crycco.cr -l crystal -O full,style=autumn -f html -o x.html`	103.5 ± 2.8	95.6	109.1	3.65 ± 0.20

That benchmark (like all the rest) is done using hyperfine and running each command 50 times after a 10-run warmup. Not that it needs so much care, just look at those numbers. Not only is tartrazine almost 20 times slower than chroma, it's also 3.5 times slower than Pygments. And Pygments is written in Python!

Even without comparing, half a second to highlight a 100-line file is ridiculous.

What's going on here? To find out, let's get data. I used callgrind to profile the code, and then kcachegrind to visualize it.

$ valgrind --tool=callgrind bin/tartrazine ../crycco/src/crycco.cr swapoff

As you can see, some functions are called half a billion times and account for 40% of the execution time. What are they?

A string in Crystal is always unicode. The String::char_bytesize_at function is used to convert an offset into the string from characters to bytes. That's because unicode characters can be different "widths" in bytes. So in the string "123" the "3" is the 3rd byte, but in "áéí" the "í" starts in the 5th byte.

And why is it doing that? Because this code does a bazillion regex operations, and the underlying library (PCRE2) deals in bytes, not characters, so everytime we need to do things like "match this regex into this string starting in the 9th position" we need to convert that offset to bytes, and then when it finds a match at byte X we need to convert that offset to characters and to extract the data we do it two more times, and so on.

I decided to ... not do that. One nice thing of Crystal is that even though it's compiled, the whole standard library is there in /usr/lib/crystal so I could just go and read how the regular expression code was implemented and see what to do.

Ended up writing a version of Regex and Regex.match that worked on bytes, and made my code deal with bytes instead of characters. I only needed to convert into strings when I had already generated a token, rather than in the thousands of failed attempts to generate it.

The Second Problem (commit 0626c86)

Command	Mean [ms]	Min [ms]	Max [ms]	Relative
`bin/tartrazine ../crycco/src/crycco.cr -f html -t swapoff -o x.html --standalone`	187.8 ± 4.4	183.2	204.1	7.48 ± 0.30
`chroma ../crycco/src/crycco.cr -l crystal -f html -s swapoff > x.html`	25.1 ± 0.8	23.6	28.5	1.00
`pygmentize ../crycco/src/crycco.cr -l crystal -O full,style=autumn -f html -o x.html`	89.9 ± 4.7	83.6	102.1	3.58 ± 0.22

While better this still sucks. I made it 2.5 times faster, but it's still 7.5 times slower than chroma, and 3.6x slower than Python???

Back to valgrind. This time, the profile was ... different. The regex library was taking almost all the execution time, which makes sense, since it's doing all the work. But why is it so slow?

Almost all the time is spent in valid_utf8. That's a function that checks if a string is valid UTF-8. And it's called all the time. Why? Because the regex library is written in C, and it doesn't know that the strings it's working with are already valid UTF-8. So, it checks. And checks. And checks.

Solution? Let's not do that either. The PCRE2 library has a handy flag called NO_UTF_CHECK just for that. So, if you pass that flag when you are doing a regex match, it will not call valid_utf8 at all!

The Third Problem (commit 7db8fdc)

Command	Mean [ms]	Min [ms]	Max [ms]	Relative
`bin/tartrazine ../crycco/src/crycco.cr -f html -t swapoff -o x.html --standalone`	30.0 ± 2.2	25.5	36.6	1.15 ± 0.10
`chroma ../crycco/src/crycco.cr -l crystal -f html -s swapoff > x.html`	26.0 ± 1.0	24.1	29.2	1.00
`pygmentize ../crycco/src/crycco.cr -l crystal -O full,style=autumn -f html -o x.html`	96.3 ± 7.7	86.1	125.3	3.70 ± 0.33

Yay! My compiled program is finally faster than an interpreted one! And even in the same ballpark as the other compiled one!

I wonder what happens with a larger file!

Command	Mean [ms]	Min [ms]	Max [ms]	Relative
`bin/tartrazine /usr/include/sqlite3.h -f html -t swapoff -o x.html --standalone -l c`	896.6 ± 71.6	709.6	1015.8	7.07 ± 0.58
`chroma /usr/include/sqlite3.h -l c -f html -s swapoff > x.html`	126.8 ± 2.1	122.8	132.9	1.00
`pygmentize /usr/include/sqlite3.h -l c -O full,style=autumn -f html -o x.html`	229.1 ± 4.5	219.5	238.9	1.81 ± 0.05

Clearly something very bad happens when my code processes larger files. I wonder what it is?

At first glance this is not very informative. So, most of the execution time is spent in libgc and libc functions. That's not very helpful. But, if you look a bit harder, you'll see the execution time is spent allocating memory. Hundreds of milliseconds spent allocating memory

Yes, memory is fast. But allocating it is not. And I was allocating a lot of memory. Or rather, I was allocating memory over and over. See that memcpy there?

This took me about a day to figure out, but this line of code is where everything became slow.

matched, new_pos, new_tokens = rule.match(text_bytes, pos, self)
if matched
    # Move position forward, save the tokens,
    # tokenize from the new position
    pos = new_pos
    tokens += new_tokens

That looks innocent, right? It's not. The tokens array is created at the beginning of tokenization, and every time I find new tokens I just append them to it. BUT ... how does that work? Arrays are not infinite! Where does it put the new tokens?

Well, when an array grows, it allocates a new, larger array, copies all the elements there and now you have a larger array with some room to spare. When you keep doing that, you are copying the whole array over and over. It's the cost you pay for the convenience of having an array that can grow.

Where was I calling this? In the formatter. The formatter needs to see the tokens to turn them into HTML. Here's the relevant code:

lines = lexer.group_tokens_in_lines(lexer.tokenize(text))

All it does is group them into lines (BTW, that again does the whole "grow an array" thing) and then it just iterates the lines, then the tokens in each line, slaps some HTML around them and writes them to a string (which again, grows).

The solution is ... not to do that.

In Crystal we have iterators, so I changed the tokenizer so that rather than returning an array of tokens it's an iterator which generates them one after the other. So the formatter looks more like this:

tokenizer.each do |token|
    outp << "<span class=\"#{get_css_class(token[:type])}\">#{HTML.escape(token[:value])}</span>"
    if token[:value].ends_with? "\n"
    i += 1
    outp << line_label(i) if line_numbers?
    end
end

Rather than iterate an array, it iterates ... an iterator. So no tokens array in the middle. Instead of grouping them into lines, spit out a line label after every newline character (yes, this is a bit more complicated under the hood)

There were several other optimizations but this is the one that made the difference.

The Fourth Problem (commit ae03e46)

Command	Mean [ms]	Min [ms]	Max [ms]	Relative
`bin/tartrazine /usr/include/sqlite3.h -f html -t swapoff -o x.html --standalone -l c`	73.5 ± 5.9	67.7	104.8	1.00
`chroma /usr/include/sqlite3.h -l c -f html -s swapoff > x.html`	123.1 ± 2.7	117.5	130.3	1.68 ± 0.14
`pygmentize /usr/include/sqlite3.h -l c -O full,style=autumn -f html -o x.html`	222.0 ± 5.9	207.8	239.4	3.02 ± 0.26

Finally, tartrazine is the fastest one. On a large file! By a good margin! But is that all there is? Is there nothing else to improve? Well, no. I can do the same trick again!

You see, the formatter is returning a String by appending to it, and then we are writing the string to a file. That's the same problem as before!. So, I changed the formatter to take an IO object and write to it directly.

Command	Mean [ms]	Min [ms]	Max [ms]	Relative
`bin/tartrazine /usr/include/sqlite3.h -f html -t swapoff -o x.html --standalone -l c`	70.3 ± 1.8	65.6	73.8	1.00
`chroma /usr/include/sqlite3.h -l c -f html -s swapoff > x.html`	122.2 ± 3.0	116.6	131.1	1.74 ± 0.06
`pygmentize /usr/include/sqlite3.h -l c -O full,style=autumn -f html -o x.html`	219.4 ± 4.4	212.2	235.5	3.12 ± 0.10

As you can see, that still gives a small improvement, of just 3 milliseconds. But that's 5% of the total time. And it's a small change.

And this is where diminishing returns hit. I could probably make it faster, but even a 10% improvement would be just 7 milliseconds on a huge file. If I were GitHub then maybe this would be worth my time, but I am not and it's not.

And how does the final version compare with the first one?

Command	Mean [ms]	Min [ms]	Max [ms]	Relative
`bin/tartrazine-last ../crycco/src/crycco.cr -f html -t swapoff -o x.html --standalone -l c`	16.1 ± 1.1	13.1	21.8	1.00
`bin/tartrazine-first ../crycco/src/crycco.cr swapoff`	519.1 ± 5.4	508.1	533.7	32.29 ± 2.23

A speedup factor of 32.3x, for code that had nothing obviously wrong in it. I'd say that's pretty good.

Tartrazine: reimplementing pygments or my adventure in extreme test-driven-development

2024-08-05 11:59

This is a "what I did this weekend" post, but it's slightly more interesting than others, I think. So, I reimplemented a large chunk of Pygments in a lib called Tartrazine.

Why?

Because I wanted to highlight source code in Markterm, and I wanted to do it in Crystal, not using an external dependency.

I was using Chroma but it's running over a pipe and makes the code look ugly, and you need to install it, and so on.

So ... I knew Chroma was a Go port of Pygments. So I thought ... how hard can it be? They already did it!

Because I believe we need more libraries I just started writing the damned thing.

What?

Pygments/Chroma consists of three parts.

Lexers, which turn a text and turn it into a pile of tokens.
Styles, which when asked about a token type, return a color/bold/underline/etc. "style".
Formatters, which iterate a list of tokens, apply styles and create a stream of text (for example HTML with pretty colors).

The hard part seemed to be the lexers, so I started there.

How?

I lied a little. I started trying to read the Pygments code. It was quickly clear that there are several kinds of lexers, but most of them (like, 90%) are "regex lexers". They are lexers that use a state machine and a bunch of regular expressions to tokenize the input.

I know and have implemented state machines.. State machines are easy. So, I decided to just implement the regex lexers. They have the huge advantage that they have little to no code. THey are just a bunch of regular expressions and a bunch of rules that say "if you see this, do that".

They are implemented as data. Here's what the ada lexer looks like:

    tokens = {
        'root': [
            (r'[^\S\n]+', Text),
            (r'--.*?\n', Comment.Single),
            (r'[^\S\n]+', Text),
            (r'function|procedure|entry', Keyword.Declaration, 'subprogram'),
            (r'(subtype|type)(\s+)(\w+)',
             bygroups(Keyword.Declaration, Text, Keyword.Type), 'type_def'),
            (r'task|protected', Keyword.Declaration),
            (r'(subtype)(\s+)', bygroups(Keyword.Declaration, Text)),
            (r'(end)(\s+)', bygroups(Keyword.Reserved, Text), 'end'),
            (r'(pragma)(\s+)(\w+)', bygroups(Keyword.Reserved, Text,
                                             Comment.Preproc)),
            (r'(true|false|null)\b', Keyword.Constant),
            # builtin types
            (words(BUILTIN_LIST, suffix=r'\b'), Keyword.Type),
            (r'(and(\s+then)?|in|mod|not|or(\s+else)|rem)\b', Operator.Word),
            (r'generic|private', Keyword.Declaration),
            (r'package', Keyword.Declaration, 'package'),
            (r'array\b', Keyword.Reserved, 'array_def'),
            (r'(with|use)(\s+)', bygroups(Keyword.Namespace, Text), 'import'),
            (r'(\w+)(\s*)(:)(\s*)(constant)',
             bygroups(Name.Constant, Text, Punctuation, Text,
                      Keyword.Reserved)),
            (r'<<\w+>>', Name.Label),

While utterly unscrutable, that's just data. Then I looked at how Pygments processes that data, and it was bad news. While it's ok it's very idiomatic Python. Like, metaclasses and things jumping around the codebase. I had a feeling it couldn't be that hard.

After all, the excellent write your own lexer document explains it in about two pages of text!

So, I looked at Chroma's implementation. Let's say I am now distrustful of those who claim go code is simple, and worried I may be extremely dumb.

Sure, if I spent some time I could understand it, but I am not a go person, and I don't have plans to be one soon, so I had to make decisions.

And then I saw a magical folder...

A folder full of XML which is obviously the lexer definitions.

Chroma took the Pygments lexer definitions which were data in Python files, and turned them into data in actual data files.

And actually reading those XML files along the Pygments doc did the trick. I now know how to write a lexer.

But really, How?

Let's look at how, while looking at the definition of a very simple lexer, the "bash_session" one. A lexer, like I said, is a state machine. Each lexer has some metadata, such as its name, aliases, etc, and some instructions about how to process input.

In this case, it says input should end with a newline.

<lexer>
  <config>
    <name>Bash Session</name>
    <alias>bash-session</alias>
    <alias>console</alias>
    <alias>shell-session</alias>
    <filename>*.sh-session</filename>
    <mime_type>text/x-sh</mime_type>
    <ensure_nl>true</ensure_nl>
  </config>

Since a lexer is a state machine, it has states. The first state is always called root. Each state has rules. Because this lexer is very simple, it has only one state with two rules.

  <rules>
    <state name="root">
      <rule pattern="^((?:\[[^]]+@[^]]+\]\s?)?[#$%&gt;])(\s*)(.*\n?)">
        <bygroups>
          <token type="GenericPrompt"/>
          <token type="Text"/>
          <using lexer="bash"/>
        </bygroups>
      </rule>
      <rule pattern="^.+\n?">
        <token type="GenericOutput"/>
      </rule>
    </state>
  </rules>

Each rule has a pattern (a regular expression) which decides if the rule applies or not.

The first rule says "if the line starts with a prompt, capture the prompt, capture the spaces after it, and then capture the rest of the line".

Then, inside the rule, we have "actions". This rule has one action, which is "bygroups". This action says "the first group we capured is a GenericPrompt, the second group is Text, and the third group we should ask the bash lexer to tokenize".

And that makes sense, since a bash session looks like this:

$ echo hello
hello

There you have "$" (the prompt), " " (text), and "echo hello" (bash code).

The second rule is simpler. It says "capture a whole line".

So, when processing that example session, it works like this:

The state is "root" (it always starts there), and we look at the beginning of the file.

The first line matches the first rule, so we capture the prompt, the spaces, and the text. We generate the first two tokens: GenericPrompt and Text. Then we ask the bash lexer to tokenize the rest of the line. It will return a list of tokens, we keep those tokens too.

Because we matched, we move the "cursor" to the end of the match, which is at the beginning of the second line now.

And we start matching again.

The state is root. The first rule doesn't match at the position we're in. The second rule does. So we capture the whole line and generate a GenericOutput token. Move the cursor to the end of the match.

Oops, no more file. There, we tokenized.

Just that?

Well, no. Actions can also "push a state" which means change the current state to something else. States are kept in a stack, so if you were in state "root" and pushed the state "foo" now the stack is "root, foo" and the current state is "foo".

Of course you can also "pop" a state, which means "go back to the previous state".

There are some other bits, such as "include" which means "pretend the rules of the other lexer are here" so we don't have to write them many times in the XML, or that you can pop more than one state, whatever, the basic is just:

You are in a state
Check rules until one matches
Use that rule's actions (you may end up in another state)
Collect any tokens generated
Move the cursor to the end of the match
Go back to 1.

And that's it. That's how you write a lexer.

And then?

But suppose I wrote the lexer, how do I know if I am doing it right? I mean, I can't just run the lexer and see if it works, right?

Well, we could if we only had a whole pile of things to tokenize, and a tool that creates the tokens in a readable format!

Hey, we have those things. There is the pygments test suite and Chroma can output tokens in json!

So, let's do some extreme test-driven-development! After all, I have the tests written, now I just have to pass them, right?

I wrote enough lexer to spit out tokens, wrote a test rig that compared them to chroma's output, and started writing a lexer.

Hey, ya puedo parsear correctamente texto plano! https://t.co/wMv5HuLYKF pic.twitter.com/rgESeyk5Ta
— Roberto H. Alsina (@ralsina) August 3, 2024

That up there is a two-day-long thread of me trying to pass tests. When I finished, over 96% of the test suite was passing and most of the failures were arguable (I think chroma is wrong in some cases).

So, I had written the RegexLexer. Looks like this

That code supports 241 languages, and it's about 300 lines of simple code.

In fact, I think someone (not me) should do what I did but write this thing in C, so it can be used from any language and both chroma and tartrazine are rendered obsolete.

New project: croupier

2023-06-14 17:40

Intro to Dataflow Programming

This post is about explaining a new project, called Croupier, which is a library for dataflow programming.

What is that? It's a programming paradigm where you don't specify the sequence in which your code will execute.

Instead, you create a number of "tasks", declare how the data flows from one task to another, provide the initial data and then the system runs as many or as few of the tasks as needed, in whatever order it deems better.

Examples

Put that way it looks scary and complex but it's something so simple almost every programmer has ran into a tool based on this principle:

make

When you create a Makefile, you declare a number of "targets", "dependencies" and "commands" (among other things) and then when you run make a_target it's make who decides which of those commands need to run, how and when.

Let's consider a more complex example: a static site generator.

Usually, these take a collection of markdown files with metadata such as title, date, tags, etc, and use that to produce a collection of HTML and other files that constitute a website.

Now, let's consider it from the POV of dataflow programming with a simplified version that only takes markdown files as inputs and builds a "blog" out of them.

For each post in a file foo.md there will be a /foo.html.

But if that file has tags tag1 and tag2, then the contents of that file will affect the output files /tags/tag1.html and /tags/tag2.html

And if one of those tags is new, then it will affect tags/index.html

And if the post itself is new, then it will be in /index.html

And also in a RSS feed. And the RSS feeds for the tags!

As you can see, adding or modifying a file can trigger a cascade of changes in the site.

Which you can model as dataflow.

That's the approach used by Nikola, a static site generator I wrote. Because it's implemented as dataflow, it can build only what's needed, which in most cases is just a tiny fragment of the whole site.

That is done via doit an awesome tool more people should know about, because a lot more people should know about dataflow programming itself.

So, what is Croupier?

It's a library for dataflow programming in the Crystal language I am writing!

Here's an example of it in use, from the docs, which should be self-explanatory if you have a passing knowledge of Crystal or Ruby:

require "croupier"

b1 = ->{
  puts "task1 running"
  File.read("input.txt").downcase
}

Croupier::Task.new(
  name: "task1",
  output: "fileA",
  inputs: ["input.txt"],
  proc: b1
)

b2 = ->{
  puts "task2 running"
  File.read("fileA").upcase
}
Croupier::Task.new(
  name: "task2",
  output: "fileB",
  inputs: ["fileA"],
  proc: b2
)

Croupier::Task.run_tasks

Why?

Because I want to write a fast SSG in Crystal, and because dataflow programming is (to me) a fundamental tool in my toolkit.

Anything else?

I will probably also do a simple make-like just as a playground for Croupier.

Learning Crystal by Implementing a Static Site Generator

2023-06-01 01:15

What?

A while back (10 YEARS???? WTH.) I wrote a static site generator. I mean, I wrote one that is large and somewhat popular, called Nikola but I also wrote a tiny one called Nicoletta

Why? Because it's a nice little project and it shows the very basics of how to do a whole project.

All it does is:

Find markdown files
Build them
Use templates to generate HTML files
Put those in an output folder

And that's it, that's a SSG.

So, if I wanted a "toy" project to practice new (to me) programming languages, why not rewrite that?

And why not write about how it goes while I do it?

Hence this.

So, what's Crystal?

It's (they say) "A language for humans and computers". In short: a compiled, statically typed language with a ruby flavoured syntax.

And why? Again, why not?

Getting started

I installed it using curl and that got me version 1.8.2 which is the latest at the time of writing this.

You can get your project started by running a command:

nicoletta/crystal
✦ > crystal init app nicoletta .
    create  /home/ralsina/zig/nicoletta/crystal/.gitignore
    create  /home/ralsina/zig/nicoletta/crystal/.editorconfig
    create  /home/ralsina/zig/nicoletta/crystal/LICENSE
    create  /home/ralsina/zig/nicoletta/crystal/README.md
    create  /home/ralsina/zig/nicoletta/crystal/shard.yml
    create  /home/ralsina/zig/nicoletta/crystal/src/nicoletta.cr
    create  /home/ralsina/zig/nicoletta/crystal/spec/spec_helper.cr
    create  /home/ralsina/zig/nicoletta/crystal/spec/nicoletta_spec.cr
Initialized empty Git repository in /home/ralsina/zig/nicoletta/crystal/.git/

Some maybe interesting bits:

It inits a git repo, with a gitignore in it
Sets you up with a MIT license
Creates a reasonable README with nice placeholders
We get a shard.ymlwith metadata
Source code in src/
spec/ seems to be for tests?

Mind you, I still have zero idea about the language :-)

This apparently compiles into a do-nothing program, which is ok. Surprisied to see starship seems to support crystal in the prompt!

crystal on  main [?] is 📦 v0.1.0 via 🔮 v1.8.2 
> crystal build src/nicoletta.cr

crystal on  main [?] is 📦 v0.1.0 via 🔮 v1.8.2 
> ls -l
total 1748
-rw-rw-r-- 1 ralsina ralsina    2085 may 31 18:15 journal.md
-rw-r--r-- 1 ralsina ralsina    1098 may 31 18:08 LICENSE
-rwxrwxr-x 1 ralsina ralsina 1762896 may 31 18:15 nicoletta*
-rw-r--r-- 1 ralsina ralsina     604 may 31 18:08 README.md
-rw-r--r-- 1 ralsina ralsina     167 may 31 18:08 shard.yml
drwxrwxr-x 2 ralsina ralsina    4096 may 31 18:08 spec/
drwxrwxr-x 2 ralsina ralsina    4096 may 31 18:08 src/

Perhaps a bit surprising that the do-nothing binary is 1.7MB tho (1.2MB stripped) but it's just 380KB in "release mode" which is nice.

Learning a Bit of Crystal

At this point I will stop and learn some syntax:

How to declare a variable / a literal / a constant
How to do an if / loop
How to define / call a function

Because you know, one has to know at least that much 😁

There seems to be a decent set of tutorials at this level. let's see how it looks.

Good thing: this is valid Crystal:

module Nicoletta
  VERSION = "0.1.0"

  😀 = "Hello world"
  puts 😀 
end

Also nice that variables can change type.

Having the docs say integers are int32 and anything else is "for special use cases" is not great. int32 is small.

Also not a huge fan of separate unsigned types.

I hate the "spaceship operator" <==> which "compares its operands and returns a value that is either zero (both operands are equal), a positive value (the first operand is bigger), or a negative value (the second operand is bigger)" ... hate it.

Numbers have named methods, which is nice. However it randomly shows some weird syntax that has not been seen before. One of these is not like the others:

p! -5.abs,   # absolute value
  4.3.round, # round to nearest integer
  5.even?,   # odd/even check
  10.gcd(16) # greatest common divisor

Or maybe the ? is just part of the method name? Who knows! Not me!

Nice string interpolation thingie.

name = "Crystal"
puts "Hello #{name}"

Why would anyone add an underscore method to strings? That's just weird.

Slices are reasonable, whatever[x..y] uses negative indexes for "from the right".

We have truthy values, 0 is truthy, only nil, false and null pointers are falsy. Ok.

I strongly dislike using unless as a keyword instead of if with a negated condition. I consider that to be keyword proliferation and cutesy.

Methods support overloading. Ok.

Ok, I know just enough Crystal to be slightly dangerous. Those feel like good tutorials. Short, to the point, give you enough rope to ... make something with rope, or whatever.

Learning a Bit More Crystal

So: errors? Classes? Blocks? How?

Classes are pretty straightforward ... apparently they are a bit frowned upon for performance reasons because they are heap allocated, but whatevs.

Inheritance with method overloading is not my cup of tea but 🤷

Exceptions are pretty simple but begin / rescue / else / ensure / end? Eek.

Also, I find that variables have nil type in the ensure block confusing.

Requiring files is not going to be a problem.

Blocks are interesting but I am not going to try to use them yet.

Dinner Break

I will grab dinner, and then try to implement Nicoletta, somehow. I'll probably fail 😅

Implementing Nicoletta

The code for nicoletta is not long so this should be a piece of cake.

No need to have a main in Crystal. Things just are executed.

First, I need a way to read the configuration. It looks like this:

TITLE: "Nicoletta Test Blog"

That is technically YAML so surely there is a crystal thing to read it. In fact, it's in the standard library! This fragment works:

require "yaml"

VERSION = "0.1.0"

tpl_data = File.open("conf") do |file|
  YAML.parse(file)
end
p! tpl_data

And when executed does this, which is correct:

crystal on  main [!?] is 📦 v0.1.0 via 🔮 v1.8.2 
> crystal run src/nicoletta.cr
tpl_data # => {"TITLE" => "Nicoletta Test Blog"}

Looks like what I want to store this sort of data is a Hash

Next step: read templates and put them in a hash indexed by path.

Templates are files in templates/ which look like this:

<h2><a href="${link}">${title}</a></h2>
date: ${date}
<hr>
${text}

Of course the syntax will probably have to change, but for now I don't care.

To find all files in templates I can apparently use Dir.glob

And I swear I wrote this almost in the first attempt:

# Load templates
templates = {} of String => String
Dir.glob("templates/*.tmpl").each do |path|
  templates[path] = File.read(path)
end

Next is iterating over all files in posts/ (which are meant to be markdown with YAML metadata on top) and do things with them.

Iterating them is the same as before (hey, this is nice)

Dir.glob("posts/*.md").each do |path|
  # Stuff
end

But I will need a Post class and so on, so...

Here is a Post class that is initialized by a path, parses metadata and keeps the text.

class Post
  def initialize(path)
    contents = File.read(path)
    metadata, @text = contents.split("\n\n", 2)
    @metadata = YAML.parse(metadata)
  end
  @metadata : YAML::Any
  @text : String
end

Next step is to give that class a method to parse the markdown and convert it to HTML.

I am not implementing that so I googled for a Crystal markdown implementation and found markd which is sadly abandoned but looks ok.

Using it is surprisingly painless thanks to Crystal's shards dependency manager. First, I added it to shard.yml:

dependencies:
  markd:
   github: icyleaf/markd

Ran shards install:

crystal on  main [!+?] is 📦 v0.1.0 via 🔮 v1.8.2 
> shards install
Resolving dependencies
Fetching https://github.com/icyleaf/markd.git
Installing markd (0.5.0)
Writing shard.lock

Then added a require "markd", slapped this code in the Post class and that's it:

  def html
    Markd.to_html(@text)
  end

Here is the code to parse all the posts and hold them in an array:

posts = [] of Post

Dir.glob("posts/*.md").each do |path|
  posts << Post.new(path)
end

Now I need a Crystal implementation of some template language, something like handlebars, I don't need much!

The standard library has a template language called ECR which is pretty nice but it's compile-time and I need this to be done in runtime. So googled and found ... Kilt

I will use the crustache variant, which implements the Mustache standard.

Again, added the dependency to shard.yml and ran shards install:

dependencies:
  markd:
   github: icyleaf/markd
  crustache:
   github: MakeNowJust/crustache

After some refactoring of template code, the template loader now looks like this:

class Template
  @text : String
  @compiled : Crustache::Syntax::Template

  def initialize(path)
    @text = File.read(path)
    @compiled = Crustache.parse(@text)
  end
end

# Load templates
templates = {} of String => Template

Dir.glob("templates/*.tmpl").each do |path|
  templates[path] = Template.new(path)
end

I changed the templates from whatever they were before to mustache:

<h2><a href="{{link}}">{{title}}</a></h2>
date: {{date}}
<hr>
{{text}}

I can now implement Post.render... except that top-level variables like templates are not accessible from inside classes and that messes up my code, so it needs refactoring. So.

This sure as hell is not idiomatic Crystal, but bear with me, I am a beginner here.

This scans for all posts, then prints them rendered with the post.tmpl template:

class Post
  @metadata = {} of YAML::Any => YAML::Any
  @text : String
  @link : String
  @html : String

  def initialize(path)
    contents = File.read(path)
    metadata, @text = contents.split("\n\n", 2)
    @metadata = YAML.parse(metadata).as_h
    @link = path.split("/")[-1][0..-4] + ".html"
    @html = Markd.to_html(@text)
  end

  def render(template)
    Crustache.render template.@compiled, @metadata.merge({"link" => @link, "text" => @html})
  end
end

posts = [] of Post

Dir.glob("posts/*.md").each do |path|
  posts << Post.new(path)
  p! p.render templates["templates/post.tmpl"]
end

Believe it or not, this is almost done. Now I need to make it output that (passed through another template) into the right path in a output/ folder.

This almost works:

Dir.glob("posts/*.md").each do |path|
  post = Post.new(path)
  rendered_post = post.render templates["templates/post.tmpl"]
  rendered_page = Crustache.render(templates["templates/page.tmpl"].@compiled,
    tpl_data.merge({
      "content" => rendered_post,
    }))
  File.open("output/#{post.@link}", "w") do |io|
    io.puts rendered_page
  end
end

For some reason all my HTML is escaped, I think that's the template engine trying to be safe 😤

Turns out I had to use TRIPLE handlebars to print unescaped HTML, so after a small fix in the templates...

A small HTML page

So, success! It has been fun, and I quite like the language!

I published it at my git server but here's the full source code, all 60 lines of it:

# Nicoletta, a minimal static site generator.

require "yaml"
require "markd"
require "crustache"

VERSION = "0.1.0"

# Load config file
tpl_data = File.open("conf") do |file|
  YAML.parse(file).as_h
end

class Template
  @text : String
  @compiled : Crustache::Syntax::Template

  def initialize(path)
    @text = File.read(path)
    @compiled = Crustache.parse(@text)
  end
end

# Load templates
templates = {} of String => Template

Dir.glob("templates/*.tmpl").each do |path|
  templates[path] = Template.new(path)
end

class Post
  @metadata = {} of YAML::Any => YAML::Any
  @text : String
  @link : String
  @html : String

  def initialize(path)
    contents = File.read(path)
    metadata, @text = contents.split("\n\n", 2)
    @metadata = YAML.parse(metadata).as_h
    @link = path.split("/")[-1][0..-4] + ".html"
    @html = Markd.to_html(@text)
  end

  def render(template)
    Crustache.render template.@compiled, @metadata.merge({"link" => @link, "text" => @html})
  end
end

Dir.glob("posts/*.md").each do |path|
  post = Post.new(path)
  rendered_post = post.render templates["templates/post.tmpl"]
  rendered_page = Crustache.render(templates["templates/page.tmpl"].@compiled,
    tpl_data.merge({
      "content" => rendered_post,
    }))
  File.open("output/#{post.@link}", "w") do |io|
    io.puts rendered_page
  end
end

New minisite: book covers

2023-05-16 15:55

Since I wrote tapita to automatically create book covers, it was absurdly easy to turn it into a site where you can create book covers.

So, you can go to Covers.ralsina.me and create book covers.

Fun part: this is the whole backend for the site:

from json import loads
from tapita import Cover
from io import BytesIO
import base64


def handle(req):
    """handle a request to the function
    Args:
        req (str): request body

    {
        "title": "foo",
        "subtitle": "bar",
        "author": "bat",
    }
    """
    try:
        args = loads(req)
    except Exception:
        return "Bad Request", 400

    c = Cover(**args)
    byte_arr = BytesIO()
    c.image.save(byte_arr, format="JPEG")

    return (
        f'<img src="data:image/jpeg;base64, {base64.b64encode(byte_arr.getvalue()).decode("utf-8")}">',
        200,
        {"Content-Type": "text/html"},

Publicaciones anteriores

Ralsina.Me — El sitio web de Roberto Alsina

The First Problem (v0.1.0)

The Second Problem (commit 0626c86)

The Third Problem (commit 7db8fdc)

The Fourth Problem (commit ae03e46)

Why?

What?

How?

But really, How?

Just that?

And then?

Intro to Dataflow Programming

Examples

So, what is Croupier?

Why?

Anything else?

What?

So, what's Crystal?

Getting started

Learning a Bit of Crystal

Learning a Bit More Crystal

Dinner Break

Implementing Nicoletta