I'm seeming to have a hard time getting Aspell configured to use UTF-8 on Kubuntu 24.04.
``` $ aspell check foo.org Error: /home/dotancohen/.aspell.en.pws: The word "Bahá" is invalid. The character '¡' (U+A1) may not appear at the end of a word.
$ file ~/.aspell.en.pws /home/dotancohen/.aspell.en.pws: Unicode text, UTF-8 text ```
The line containing Bahá is clearly UTF-8, with a LATIN SMALL LETTER A WITH ACUTE (UTF-8 Hex 0xC3 0xA1):
00000000 42 61 68 c3 a1 0a |Bah...|
Even though my locale specifies UTF-8, I tried adding UTF-8 to the Aspell config file, but that did not help:
``` $ locale | grep LANG LANG=en_IL.UTF-8 LANGUAGE=
$ grep -i utf /etc/aspell.conf encoding = UTF-8 ```
I even tried removing the ~/.aspell.en.pws file and adding the word manually:
``` $ rm ~/.aspell.en.pws $ aspell -a @(#) International Ispell Version 3.1.20 (but really Aspell 0.60.8.1) Bahá & Bahá 25 0: Bahia, Aha, Baa, Bah, Baht, Baha'i, Bash, Baal, Bach, Baas, Bath, Blah, BA, Ba, Bahama, Ha, Behan, Brahe, BIA, Bar, Bay, Boa, Bahia's, Baa's, Baha'i's
& Bahá Error: The word "bah?" is invalid. The character '?' (U+3F) may not appear at the end of a word. ```
Then with explicit UTF-8 encoding:
``` $ aspell --encoding=utf-8 -a @(#) International Ispell Version 3.1.20 (but really Aspell 0.60.8.1) Bahá & Bahá 25 0: Bahia, Aha, Baa, Bah, Baht, Baha'i, Bash, Baal, Bach, Baas, Bath, Blah, BA, Ba, Bahama, Ha, Behan, Brahe, BIA, Bar, Bay, Boa, Bahia's, Baa's, Baha'i's
& Bahá Error: The word "bah?" is invalid. The character '?' (U+3F) may not appear at the end of a word. ```
I'm all out of ideas. What else should I be checking or configuring?
submitted by /u/dotancohen
[link] [comments]