summaryrefslogtreecommitdiff
path: root/inc/utf8.php
Commit message (Collapse)AuthorAge
...
* code to remove bad UTF-8 bytes addedAndreas Gohr2006-08-26
| | | | | | | This adds code to remove or replace invalid UTF-8 bytes and uses it in the ft_snippets function. darcs-hash:20060826082919-7ad00-a94004de159ae93ff5b7270fd3e631ff467233cd.gz
* cleanID unit testschris2006-08-09
| | | | | | | + fix missing utf8 deaccent character mapping + set utf-8 charset for HTMLReporter (unit tests) darcs-hash:20060809160209-9b6ab-26c80a4830643b9790536f6d3a4adee0f451e4f0.gz
* make sure UTF8 lookup tables are always globalAndreas Gohr2006-06-11
| | | | darcs-hash:20060611173240-7ad00-9bc775163fd9fc65917ffe10f78f872a302bdbcf.gz
* fix for utf8_strpos #827Andreas Gohr2006-06-08
| | | | darcs-hash:20060608200438-7ad00-05fbb18c15df64725ca4ef1ffdc0aa817a508ea4.gz
* more efficient mb_string checking in utf8.phpAndreas Gohr2006-04-03
| | | | darcs-hash:20060403194930-7ad00-499940017f74cfe297f2aa4e65d441243f8572a1.gz
* better utf8_substr functionAndreas Gohr2006-04-03
| | | | darcs-hash:20060403192537-7ad00-72b129ce494066bce491821a0396db7576873ec2.gz
* nicer russian romanizationDenis Simakov2006-03-03
| | | | darcs-hash:20060303032557-3c565-36015a29e83f000f0a23d8ea039c954766c1223e.gz
* hebrew romanization fixDenis Simakov2006-03-03
| | | | darcs-hash:20060303031656-3c565-2458122a2481ea3acfbf772e4faae883808cbf71.gz
* fixes for utf-8 to/from unicode conversionAndreas Gohr2006-02-21
| | | | | | | | The functions utf8_to unicode and unicode_to_utf8 didn't work correctly with some 3 and 4 byte strings. This exchanges those functions against two more sophisticated ones. It also adds unit testing for them. darcs-hash:20060221212605-7ad00-7bfefe8c9615d5a7f3b33c279ce79d4200d4778c.gz
* file cleanupsAndreas Gohr2006-02-17
| | | | | | | | | | This patch cleans up the source code to satisfy the coding guidelines (see http://wiki.splitbrain.org/wiki:development#coding_style) It converts files to UNIX lineendings and removes tabs and trailing whitespace. Not all files were cleaned yet. darcs-hash:20060217222040-7ad00-bba3d2bee3b5aa7cbb5184258abd50805cd071bf.gz
* romanization support in utf8 libraryAndreas Gohr2006-02-10
| | | | | | | | | | | | | | | | | This patch addes basic romanization support to the utf-8 library. It converts non-latin languages to ASCII. The transliteration tables used where gathered from various places on the net. I do not speak any of those languages so I can't say how good they are. Any recommendations and fixes are welcome! This can be enabled for ID cleaning by setting the deaccent option to 2. It is also used in the XHTML renderer to generate section ids based on the header titles. Leading digits and any remaining non-ASCII chars are removed as well. This is the first step to make section ID always XHTML compatible. Making sure they are unique is not implemented yet. darcs-hash:20060210200627-7ad00-61a633563bb92a00ef4a3f699d73117139cbf367.gz
* Wildcardsearch added #552 #632Andreas Gohr2005-11-27
| | | | | | | | | | | Now searching for word parts is possible by adding or prepending a * character to the searchword: 'foo*' searches for words beginning with 'foo' eg. 'foobar' '*foo' looks for words ending in 'foo' eg. 'barfoo' '*foo*' gets anything with 'foo' in it eg. 'barfoobaz' darcs-hash:20051127180723-7ad00-1eb29e812ddaf38d9812697bb1cffffe9a5fb330.gz
* fixed two bugs reported by Timo FalkAndreas Gohr2005-11-08
| | | | darcs-hash:20051108195013-7ad00-f2b2b6d955e64fe2d2abfc1d9421ed619f70abd3.gz
* get_tests_running_1hfuecks2005-11-07
| | | | darcs-hash:20051106233800-e96b6-90328f1b6cc5c8b25c6e02f113cf80475789d297.gz
* removed mb_string requirement in JSON.php #592Andreas Gohr2005-10-17
| | | | darcs-hash:20051017200431-7ad00-c9ed74326a44b078cc516e08515375d9d0fba811.gz
* fixed $UTF8_UPPER_ACCENTS arrayAndreas Gohr2005-10-06
| | | | darcs-hash:20051006175402-7ad00-f0bccfd8a14f231aab7c15a571628f9320e4fead.gz
* small fix for utf8.phpAndreas Gohr2005-09-18
| | | | | | utf8_strtoupper called mb_strtolower darcs-hash:20050918094308-7ad00-c595455bcb6e312f29c0d27fa83d0905cd2303cc.gz
* a first step for search indexing - nothing to see yetAndreas Gohr2005-08-07
| | | | darcs-hash:20050807203322-7ad00-6db6733f8fcd861366856635ba3d205fd3bb54da.gz
* relicensed utf8 lib under LGPL to make it available to a broader audienceandi2005-06-28
| | | | darcs-hash:20050628201920-9977f-91d30cb4391f49979415edf31b798e31177d76c2.gz
* spellchecker fixes for Konqerorandi2005-06-11
| | | | | | Konqeror seems to ignore the charset darcs-hash:20050611092916-9977f-311b99e63b66a83c9f9022c468ba61d0687822c2.gz
* AJAX spellchecker #29andi2005-06-07
| | | | | | | | | | | | | | | | | | | | This is nearly a complete rewrite of the gmail like AJAX spellchecker from http://www.broken-notebook.com/spell_checker/index.php Here are the differences and features * seemless integrated into DokuWiki * no need for the pspell extension * needs GNU aspell installed (not sure about the version I guess 0.60+ for UTF8) * needs PHP 4.3.0+ * uses SACK for AJAX * gets errors and suggestions in one transfer So far only tested in Firefox. It should work in IE, Safari and Opera 8, too. Please test and report back. darcs-hash:20050607194456-9977f-f699144d1fd28359742b2ce0f28c839a1f4cefbb.gz
* fixed problem with utf8_substr and feed generation #340andi2005-05-20
| | | | darcs-hash:20050520203448-9977f-4381d98899e730a301c655b5aae59e83b5580a88.gz
* performance enhancement for utf8_stripspecialsandi2005-04-29
| | | | darcs-hash:20050429211602-9977f-70bda6c598e239d227663a6c2ff1c19fbf069bb8.gz
* optimized count calls in for loopsandi2005-04-27
| | | | darcs-hash:20050427182243-9977f-4c1ec6ce278925147379430540eec723d109444a.gz
* removed closing php tag for includesandi2005-04-13
| | | | darcs-hash:20050413175338-9977f-c969db6f86df5df175db9a39b4fabb9f072f8f67.gz
* UTF8 performance enhancementsandi2005-03-19
| | | | darcs-hash:20050319141940-9977f-480f5b4d61060e1c36774859677671cb7cb32e4b.gz
* better mailheader handling #168andi2005-02-27
| | | | darcs-hash:20050227094102-9977f-a481ce76ccdb40f8eaa762a332b93093de8f7a87.gz
* fix for #164andi2005-02-25
| | | | darcs-hash:20050225164823-9977f-c45bc34160b06a1bc1aa41c1f238daaeaae4c7ff.gz
* added init.php - may have broken something! (related to #153)andi2005-02-19
| | | | darcs-hash:20050219102055-9977f-575d654e742934c911ffab855d82aa91f198b5cf.gz
* utf8_stripspecials added (related to #152)andi2005-02-16
| | | | darcs-hash:20050216201830-9977f-57c89f9f77c9ee598af884efac9fb62d3b977961.gz
* utf8 safe mailfunctions added (fixes #82)andi2005-01-27
| | | | darcs-hash:20050127211411-9977f-77f25921e855969dd8b7d4ded732bfdba5aec7ab.gz
* more deaccentingandi2005-01-25
| | | | darcs-hash:20050125184855-9977f-bcb8c1e22c8814d1d9b2c65a5673dae5ef7d86e4.gz
* safe mode for utf8_encodeFNandi2005-01-23
| | | | darcs-hash:20050123182753-9977f-9226687e01f083dbeea7a205db30a6d74ed2eb5f.gz
* utf8 substr replacementandi2005-01-23
| | | | darcs-hash:20050123163344-9977f-38c3f78a3b76ab0e7f181698cb79711b5d3469c1.gz
* utf8 replacements for strpos and strlenandi2005-01-23
| | | | darcs-hash:20050123155239-9977f-2ddc1e19ccf48579c71382e8933166a86ee750a4.gz
* utf8_checkandi2005-01-23
| | | | darcs-hash:20050123113432-9977f-c9ffd2cf315325fa6abc04323e767875d0f11d84.gz
* Unicode filenames with URL encodingandi2005-01-23
| | | | darcs-hash:20050123102905-9977f-1065a1112bfd47caed0f198b94e5226c81351b64.gz
* utf8 library addedandi2005-01-17
darcs-hash:20050117191842-9977f-301d0572959d7baf3e8ef8b29d7bfe6159c6b574.gz