I17n in character sets and home media

Software

Internationalisation is one of those things you don’t need to think about when it’s done well, and becomes painfully obvious when it isn’t.

Clara’s and my home media server runs a combination of MusikCube, Plex, and some simple Netatalk and Samba shares for exporting content to our various devices. I didn’t realise just how much of the CJK spectrum we use:

  • Hong Kong cinema, which uses traditional Chinese
  • Singaporean drama, which uses simplified Chiniese
  • Anime and live-action Japanese shows and movies
  • Korean movies

The underlying FreeBSD OS and storage pools are all configured for UTF-8 and Unicode Form D normalisation, so I haven’t run into any issues. I think this is worth calling out and appreciating.

It doesn’t seem that long ago where even having reliable Hangul on an English/Latin character set machine was a dicey proposition, with plenty of question marks and squares if you attempted to list directories containing them. Every part of the chain has to gracefully handle UTF and i17n, from the OS and file system, to the application and networking, or the entire framework collapses.

Author bio and support

Me!

Ruben Schade is a technical writer and IaaS engineer in Sydney, Australia who refers to himself in the third person in bios. Wait, not BIOS… my brain should be EFI by now.

The site is powered by Hugo, FreeBSD, and OpenZFS on OrionVM, everyone’s favourite cloud infrastructure provider.

If you found this post helpful or entertaining, you can shout me a coffee or buy some silly merch. Thanks!