SWI prolog, char_type, ascii / alnum, why so many chars? how to fix it?

Question

I've just wanted to check, what chars SWI-prolog treats as 'alnum'. My question clause was:

    findall(X,char_type(X,alnum),Lalnum),length(Lalnum,N).

and the SWI's answer:

    Lalnum = ['0', '1', '2', '3', '4', '5', '6', '7', '8'|...],
    N = 816459.

I was very surprised - why so many? Then I've decided to check pure 'ascii' set - after all, according to the doc page:

    http://www.swi-prolog.org/pldoc/doc_for?object=char_type/2

there are only 128 of them (7 bit char set). My obvious question was:

     findall(X,char_type(X,ascii),Lascii),length(Lascii,N).

and the SWI's answer:

    Lascii = ['\000\', '\001\', '\002\', '\003\', '\004\', 
    '\005\', '\006\', '\a', '\b'|...], 
    N = 2176.

I was surprised even more than before... What is wrong? Where is the problem? With my question? With my SWI-prolog installation? With my system? It is:

SWI Prolog 7.7.13, with ascii encoding:

    current_prolog_flag(encoding,X).
    X = ascii.

Win 8.1 64bit, with code page 852.

And how to fix it?

Thank you in advance

EDIT: probably I've found the answer to my second question: 'how to fix it'. It seems, that additional clause:

    sort(Lascii,SortedLascii)

removes repetitions and leaves the basic set of 128 chars alone.

but I still do not understand why the first clause generates so many results...???

SWI prolog, char_type, ascii / alnum, why so many chars? how to fix it?

Answers (1)

Including Unicode:

Now ASCII only:

What's causing the confusion?

Related Questions