unicode Archives - Page 9 of 11

Python ascii utf unicode

August 12, 2022 by Magenaut

When I parse this XML with p = xml.parsers.expat.ParserCreate():

Get unicode code point of a character using Python

August 11, 2022 by Magenaut

In Python API, is there a way to extract the unicode code point of a single character?

How to replace unicode characters in string with something else python?

August 11, 2022 by Magenaut

I have a string that I got from reading a HTML webpage with bullets that have a symbol like “•” because of the bulleted list. Note that the text is an HTML source from a webpage using Python 2.7’s urllib2.read(webaddress).

Awesome symbols and characters in a bash prompt

August 11, 2022 by Magenaut

I just ran across a screenshot of someone’s terminal:

How can I remove the BOM from a UTF-8 file?

August 11, 2022 by Magenaut

I have a file in UTF-8 encoding with BOM and want to remove the BOM. Are there any linux command-line tools to remove the BOM from the file?

sed Match character range

August 10, 2022 by Magenaut

Is there a way to match some Unicode range exactly.
Let’s use the Cyrillic range as an example: U+400 to U+52f

In bash, how can I convert a Unicode Codepoint [0-9A-F] into a printable character?

August 10, 2022 by Magenaut

I have a list of Unicode codepoints, but I don’t know of a “simple” way to convert these hex values into the actual characters they represent…

How to determine the character encoding that a terminal uses in a C/C++ program?

August 10, 2022 by Magenaut

I’ve noticed that SyncTERM uses a different character encoding than the default MacOS terminal emulator, and they’re incompatible with one another. For example, say you want to print a block character in a format string. In SyncTERM, which uses the IBM Extended ASCII character encoding, you would use an octal escape sequence like 261. In Terminal.app (and probably iTerm2 as well), this just prints a question mark. Since these terminals use UTF-8, you need to use the uxxxx escape sequence.

copy filenames with special characters to an external ntfs volume

August 10, 2022 by Magenaut

I’m trying to copy the Documents and Settings folder of a Windows XP system over to an NTFS external disk using a USB Live of Puppy Linux.

Non-ASCII printable characters in sshd banner

August 9, 2022 by Magenaut

It’s possible to configure a banner for sshd that is to be displayed as a connection is opened, via Banner /etc/motd.ssh in sshd_config. Note that this is displayed before the authentication occurs, and even when an interactive shell is not launched (e.g. via scp).