Quantcast
Channel: Encoding – The Wiert Corner – irregular stream of stuff
Viewing all articles
Browse latest Browse all 160

ext3 – How to tell the language encoding of a filename on Linux? – Server Fault

$
0
0

From ext3 – How to tell the language encoding of a filename on Linux? – Server Fault  [WayBack] I learned a few things:

  • filename encoding on Linux is undetermined – the file system just assumes a byte array of characters
  • FTP and SFTP suffer from this as well (SFTP is based on SSH which now prefers UTF-8 [WayBack])

A good default is UTF-8, but it’s never guaranteed.

Two tools can help to determine the encoding of a filename:

  • convmv [WayBack] converts filenames from one encoding to another
  • chardet (Python) The Universal Character Encoding Detector

–jeroen


Viewing all articles
Browse latest Browse all 160

Trending Articles