utf8proc

view lump.txt @ 7:fcfd8c836c64

Version 1.1.1

- Added a new PostgreSQL function 'unistrip', which behaves like 'unifold', but also removes all character marks (e.g. accents).
- Changed license from BSD to MIT style.
- Added a new function 'utf8proc_codepoint_valid' to the C library.
- Changed compiler flags in Makefile from -g -O0 to -O2
- The ruby script, which was used to build the utf8proc_data.c file, is now included in the distribution.
author jbe
date Sun Jul 22 12:00:00 2007 +0200 (2007-07-22)
parents 4ee0d5f54af1
children
line source
1 U+0020 <-- all space characters (general category Zs)
2 U+0027 ' <-- left/right single quotation mark U+2018..2019,
3 modifier letter apostrophe U+02BC,
4 modifier letter vertical line U+02C8
5 U+002D - <-- all dash characters (general category Pd),
6 minus U+2212
7 U+002F / <-- fraction slash U+2044,
8 division slash U+2215
9 U+003A : <-- ratio U+2236
10 U+003C < <-- single left-pointing angle quotation mark U+2039,
11 left-pointing angle bracket U+2329,
12 left angle bracket U+3008
13 U+003E > <-- single right-pointing angle quotation mark U+203A,
14 right-pointing angle bracket U+232A,
15 right angle bracket U+3009
16 U+005C \ <-- set minus U+2216
17 U+005E ^ <-- modifier letter up arrowhead U+02C4,
18 modifier letter circumflex accent U+02C6,
19 caret U+2038,
20 up arrowhead U+2303
21 U+005F _ <-- all connector characters (general category Pc),
22 modifier letter low macron U+02CD
23 U+0060 ` <-- modifier letter grave accent U+02CB
24 U+007C | <-- divides U+2223
25 U+007E ~ <-- tilde operator U+223C

Impressum / About Us