Unicode, Charsets, Strings, and Binaries | Marc Sugiyama | Code BEAM V

Conference: Code BEAM V 2020

Year: 2020

This video was recorded at Code BEAM V 2020 - https://codesync.global/conferences/code-beam-sto/ Unicode, Charsets, Strings, and Binaries | Marc Sugiyama - Software Engineer @ Datometry ABSTRACT Writing global software means our programs need to speak global human languages, but writing programs that work correctly with non-Western European languages is at best a confusing affair. UTF8, latin1, Unicode? What do these terms mean and how are they related to one another? And what does Erlang do? This talk demystifies the terminology around character encoding, explains how to retrofit your Erlang program for Unicode using Datometry HyperQ as a case study, and gives some best practices to help you break the one-byte/one-character assumption. THIS TALK IN THREE WORDS Character sets Character encoding Clarity OBJECTIVES Demystify terminology around character sets and character set encoding. Provide best practices to avoid common pitfalls. • Follow us on social: Website: https://codesync.global/conferences/code-beam-sto/ Twitter: https://twitter.com/CodeBEAMio • Looking for a unique learning experience? Attend the next Code Sync conference near you! See what's coming up at: https://codesync.global • SUBSCRIBE TO OUR CHANNEL https://www.youtube.com/channel/UC47eUBNO8KBH_V8AfowOWOw