Chapter 9 Encoding issues
- Character encoding: “used to represent a repertoire of characters by some kind of encoding system”
- Encoding in R Studio
- Tools -> Global options -> Code -> Saving -> UTF-8
- Encoding issues can be hugely annoying
- Use
readr()
(e.g.read_csv()
) to import data (less problems!) - Relevance: Unstructured data may come with different encodings
- See below: UTF-8 is spreading but not everywhere yet…
Source: Wikipedia