printf("%s", name);@piefed.blahaj.zone to

C Programming Language@programming.devEnglish · 6 days ago

Why does the compiler understand ASCII by default?

7

Why does the compiler understand ASCII by default?

printf("%s", name);@piefed.blahaj.zone to

C Programming Language@programming.devEnglish · 6 days ago

Environment\ Compiler Clang on Termux on Samsung’s One UI 7 on a Samsung Galaxy Tab A9+

The title is an assumption on it’s own, so feel free to correct it/me!

I was experimenting with sanitizing user input, read to a character array with fgets. Specifically, I was trying to have a for loop remove (skip) certain input. Here is the code:

for (n = strlen(input) - 1; n >= 0; n--) { if (input[n] >= 0x30 && input[n] <= 0x39 || input[n] == ' ' || input[n] == '\t') { input[n] = 0x18; } }

While the program does behave as I want it to, I don’t understand why it seemlingy by default understands that the various hex codes refer to the character encoding as per the ASCII table. I cated my tablet’s filesystem encoding at /sys/fs/f2fs/dm-44/encoding, which yielded UTF-8. If I understand it correctly, the first 128 code points of Unicode are the same as ASCII’s. But according to this article on Wikipedia, there are no hexadecimal references in Unicode, only octal and decimal.

If the underlying filesystem uses UTF-8, and Unicode code points are not referred to by hex, how then does my compiler (Clang) understand what ASCII code points I’m referring to?

Is there some conversion going on under the hood that I am not aware of? I did find a libxml2/libxml/encoding.h, which contains comments about some conversion to and from UTF-8. Is this it? I can’t make head or tails of it because of my limited C knowledge…

Chat

kubica@fedia.io
link
fedilink
arrow-up
3·
6 days ago
Maybe this is a bit too much if you are just starting but read the first paragraph at least and then also look at “Relationship to ASCII” and “Relationship to Unicode”

https://en.wikipedia.org/wiki/Code_page
- printf("%s", name);@piefed.blahaj.zoneOP
  link
  fedilink
  English
  arrow-up
  1·
  6 days ago
  Thanks! Every piece of advice is appreciated! :D

C Programming Language@programming.dev

c_lang@programming.dev

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !c_lang@programming.dev

Welcome to the C community!

C is quirky, flawed, and an enormous success.
… When I read commentary about suggestions for where C should go, I often think back and give thanks that it wasn’t developed under the advice of a worldwide crowd.
… The only way to learn a new programming language is by writing programs in it.

© Dennis Ritchie

irc: #c

🌐 https://en.cppreference.com/w/c

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
37 users / week
70 users / month
159 users / 6 months
1 local subscriber
1.31K subscribers
81 Posts
150 Comments
Modlog