depot/third_party/tvl/users/Profpatsch/blog/posts/2017-05-04-ligature-emulation-in-emacs.md
Default email a291c8690a Project import generated by Copybara.
GitOrigin-RevId: e6e19f3d81a982a62e1bba08f0b4f7fdc21b4ea0
2022-05-19 16:39:52 +02:00

5.1 KiB
Raw Blame History

title: Ligature Emulation in Emacs date: 2017-05-04

Monday was (yet another) NixOS hackathon at OpenLab Augsburg. Maximilian was there and to my amazement he got working ligatures in his Haskell files in Emacs! Ever since Hasklig updated its format to use ligatures and private Unicode code points a while ago, the hack I had used in my config stopped working.

Encouraged by that I decided to take a look on Tuesday. Long story short, I was able to get it working in a pretty satisfying way.

Whats left to do is package it into a module and push to melpa.

elisp still sucks, but its bearable, sometimes

Im the kind of person who, when trying to fix something elisp related, normally gives up two hours later and three macro calls deep. Yes, homoiconic, non-lexically-scoped, self-rewriting code is not exactly my fetish. This time the task and the library (prettify-symbols-mode) were simple enough for that to not happen.

Some interesting technical trivia:

  • elisp literal character syntax is ?c. ?\t is the tab character
  • You join characters by (string c1 c2 c3 ...)
  • dash.el is pretty awesome and does what a functional programmer expects. Also, Rainbow Dash.
  • Hasklig and FiraCode multi-column symbols actually only occupy one column, on the far right of the glyph. my-correct-symbol-bounds fixes emacs rendering in that case.

Appendix A

For reference, heres the complete code as it stands now. Feel free to paste into your config; lets make it MIT. Maybe link to this site, in case there are updates.

 (defun my-correct-symbol-bounds (pretty-alist)
    "Prepend a TAB character to each symbol in this alist,
this way compose-region called by prettify-symbols-mode
will use the correct width of the symbols
instead of the width measured by char-width."
    (mapcar (lambda (el)
              (setcdr el (string ?\t (cdr el)))
              el)
            pretty-alist))

  (defun my-ligature-list (ligatures codepoint-start)
    "Create an alist of strings to replace with
codepoints starting from codepoint-start."
    (let ((codepoints (-iterate '1+ codepoint-start (length ligatures))))
      (-zip-pair ligatures codepoints)))

  ; list can be found at https://github.com/i-tu/Hasklig/blob/master/GlyphOrderAndAliasDB#L1588
  (setq my-hasklig-ligatures
    (let* ((ligs '("&&" "***" "*>" "\\\\" "||" "|>" "::"
                   "==" "===" "==>" "=>" "=<<" "!!" ">>"
                   ">>=" ">>>" ">>-" ">-" "->" "-<" "-<<"
                   "<*" "<*>" "<|" "<|>" "<$>" "<>" "<-"
                   "<<" "<<<" "<+>" ".." "..." "++" "+++"
                   "/=" ":::" ">=>" "->>" "<=>" "<=<" "<->")))
      (my-correct-symbol-bounds (my-ligature-list ligs #Xe100))))

  ;; nice glyphs for haskell with hasklig
  (defun my-set-hasklig-ligatures ()
    "Add hasklig ligatures for use with prettify-symbols-mode."
    (setq prettify-symbols-alist
          (append my-hasklig-ligatures prettify-symbols-alist))
    (prettify-symbols-mode))

  (add-hook 'haskell-mode-hook 'my-set-hasklig-ligatures)

Appendix B (Update 1): FiraCode integration

I also created a mapping for FiraCode. You need to grab the additional symbol font that adds (most) ligatures to the unicode private use area. Consult your system documentation on how to add it to your font cache. Next add "Fira Code" and "Fira Code Symbol" to your font preferences. Symbol only contains the additional characters, so you need both.

If you are on NixOS, the font package should be on the main branch shortly, I added a package.

Heres the mapping adjusted for FiraCode:

  (setq my-fira-code-ligatures
    (let* ((ligs '("www" "**" "***" "**/" "*>" "*/" "\\\\" "\\\\\\"
                  "{-" "[]" "::" ":::" ":=" "!!" "!=" "!==" "-}"
                  "--" "---" "-->" "->" "->>" "-<" "-<<" "-~"
                  "#{" "#[" "##" "###" "####" "#(" "#?" "#_" "#_("
                  ".-" ".=" ".." "..<" "..." "?=" "??" ";;" "/*"
                  "/**" "/=" "/==" "/>" "//" "///" "&&" "||" "||="
                  "|=" "|>" "^=" "$>" "++" "+++" "+>" "=:=" "=="
                  "===" "==>" "=>" "=>>" "<=" "=<<" "=/=" ">-" ">="
                  ">=>" ">>" ">>-" ">>=" ">>>" "<*" "<*>" "<|" "<|>"
                  "<$" "<$>" "<!--" "<-" "<--" "<->" "<+" "<+>" "<="
                  "<==" "<=>" "<=<" "<>" "<<" "<<-" "<<=" "<<<" "<~"
                  "<~~" "</" "</>" "~@" "~-" "~=" "~>" "~~" "~~>" "%%"
                  "x" ":" "+" "+" "*")))
      (my-correct-symbol-bounds (my-ligature-list ligs #Xe100))))