UTF-16 to UTF-8 without allocation

torhu

I'm looking into doing a Unicode build of Allegro 5.1 on Windows. To support both ANSI and UNICODE builds, I'm thinking about adding this function:

#ifdef UNICODE
   char *_al_win_strconv(char *dest, const TCHAR *src, size_t destsize);
#else
   #define _al_win_strconv _al_sane_strncpy
#endif

In unicode builds it would convert from utf-16 to utf-8. Does Allegro have internal functions that I can base it on? Or does anyone have this and would be willing to contribute it to Allegro? I'm lazy today

Elias

You probably could use al_ustr_new_from_utf16 followed by al_ustr_to_buffer and al_ustr_free.

billyquith

Allegro is UTF-8 internally, so it is probably best to keep all text as UTF-8, especially if you only want to display non-ASCII characters on the screen (i.e. you don't need to use any other Windows unicode functions). You don't need a Unicode build of your game to display non-ASCII.

The Windows "Unicode" compile mode is to change from narrow/multi-byte to wide (UTF-16) chars, and is there for historical reasons (UCS-2). It is a misleading name; both modes support Unicode. They should be called "multi-byte" and "wide".

See: http://www.utf8everywhere.org

I changed (a fork of) GWEN over to using UTF-8 everywhere, which simplifies using text, and makes Allegro 5 text rendering more efficient (i.e. nothing needs doing). It only needs to be converted to wide for Windows, on the fly, which has a very low overhead. Before it was doing it the other way round, everything was wide and converted to narrow on the fly, but UTF-8 everywhere is a better plan.

If you'd like the code to "widen" and "narrow" the unicode strings, it is here: https://github.com/billyquith/GWEN/blob/gwork/gwen/src/Utility.cpp

I'm using the C++11 locale API. There are other alternatives.

My work is in the "gwork" branch. If you are using Allegro I'd suggest using this branch/fork of GWEN as it has better Allegro support.

torhu

Elias said:

You probably could use al_ustr_new_from_utf16 followed by al_ustr_to_buffer and al_ustr_free.

I basically made a version of al_ustr_new_from_utf16 that takes a buffer and writes the output into that.

billyquith said:

This is about the Windows-specific part of the library, not a game...

Peter Wang

Bit late, but see also src/win/wunicode.c (though they allocate).

torhu

Yeah, I put my new function into that file.

Seems like I will be too busy with my studies to get much done the next few months, though

Thread #613172. Printed from Allegro.cc