Oniguruma

(c) K.Kosako, updated at: 2007/01/09

What's new

Oniguruma is a regular expressions library.
The characteristics of this library is that different character encoding
for every regular expression object can be specified.
(supported APIs: GNU regex, POSIX and Oniguruma native)

Supported character encodings:
ASCII, UTF-8, UTF-16BE, UTF-16LE, UTF-32BE, UTF-32LE,
EUC-JP, EUC-TW, EUC-KR, EUC-CN,
Shift_JIS, Big5, GB18030, KOI8-R, CP1251,
ISO-8859-1, ISO-8859-2, ISO-8859-3, ISO-8859-4, ISO-8859-5,
ISO-8859-6, ISO-8859-7, ISO-8859-8, ISO-8859-9, ISO-8859-10,
ISO-8859-11, ISO-8859-13, ISO-8859-14, ISO-8859-15, ISO-8859-16
(GB18030 encoding was contributed by KUBO Takehiro)
(CP1251 encoding was contributed by Byte)


License:
BSD license.
Platform:
  • Unix (include Mac OS X)
  • Cygwin
  • Win32

Download:
* 5.x supports Unicode Porperty/Script.
* 5.x is unrelated to Ruby. Don't use 5.x as a built-in regexp of Ruby.
* 4.x supports Ruby1.9.
* 2.x supports Ruby1.6/1.8. (Maintenance was finished by the end of 2006.)

Documents: (version 5.5.2)
Sample Programs:
Links:
References:

and I'm thankful to Akinori MUSHA.


Other Libraries:

Back to Home