summaryrefslogtreecommitdiff
path: root/inc/indexer.php
Commit message (Collapse)AuthorAge
* ignore regexp failures when handling asian charsAndreas Gohr2005-10-09
| | | | | | | | | The new handling of asian chars as single words needs a recent PCRE library (PHP 4.3.10 is known work). If this support isn't available the regexp compilation will fail. This patch adds a workaround - this means the search will not work as expected with asian words on older PHP versions. darcs-hash:20051009124833-7ad00-1319829be5cb73246e13eb65e4c950d43c6ce5bf.gz
* asian language support for the indexer #563Andreas Gohr2005-09-25
| | | | | | | | | | | | | | Asian languages do not use spaces to seperate words. The indexer however does a word based lookup. Splitting for example Japanese texts into real words is only possible with complicated natural language processing, something completely out of scope for DokuWiki. This patch solves the problem by treating all asian characters as single words. When an asian word (consisting of multiple characters) is searched it is treated as a phrase search, looking up each charcter by it self first, then checking for the phrase in found documents. darcs-hash:20050925175451-7ad00-933b33b51b5f2fa05e736c18b8db58a5fdbf41ce.gz
* backlinkfix for pages with special characters #548Andreas Gohr2005-09-21
| | | | darcs-hash:20050921195118-7ad00-9070166cbaa26e3f27f7b92382346a70f5c479a1.gz
* more efficient changelog reading for recent changesAndreas Gohr2005-09-18
| | | | | | | | | | | getRecents now reads the changelog backwards in 4KB chunks instead of loading the whole file into an array and rsort it. This should be more memory efficient (and probably faster) for large change logs. Note: the format of the array returned by getRecents changed slightly plugins relying on it need to be adjusted. Sorry. darcs-hash:20050918121008-7ad00-1fdba47d29b0c038c6e4e4edc1d4c93e5ba769e9.gz
* fixed stupid bug in indexerAndreas Gohr2005-09-12
| | | | | | | There was a stupid bug in the indexer which prevented the adding of new words (only non ASCII words were added) darcs-hash:20050912145813-7ad00-4351dbb1ab984d97322953c0ba4c9962ad887697.gz
* added missing ACL checks for new index based searchesAndreas Gohr2005-09-12
| | | | darcs-hash:20050912143027-7ad00-b2f3165d8db7122a453ecc63ad031af4467f691f.gz
* try faster rename before falling back to copy in indexerAndreas Gohr2005-09-07
| | | | darcs-hash:20050907210643-7ad00-a5cd36dc8b48ca445af87e9f066c7a54a98a3658.gz
* indexer rename bugfix for Win32Dave Doyle2005-09-06
| | | | darcs-hash:20050906214043-a62d3-65097acf0b035fd6fe9833136a15f9562e69970f.gz
* new fulltext search function using the indexAndreas Gohr2005-08-28
| | | | | | | The new search function was added but is not yet integrated into DokuWikis interface. darcs-hash:20050828152821-7ad00-a6e79a9dc5aaf41c547cf42dccdbc3b5bc8d303e.gz
* index lookup function addedAndreas Gohr2005-08-27
| | | | darcs-hash:20050827174813-7ad00-fe84d120801b63aaaf9f8482a66d1ed1181851bd.gz
* indexer improvements & fix for underscoreschris2005-08-16
| | | | darcs-hash:20050816032408-50fdc-6e41585c9b97d70a218877b8ad169df9117d9965.gz
* much faster implementation of idx_getPageWords()Chris Smith2005-08-15
| | | | darcs-hash:20050815184030-d26fc-bb7d0a36885ddcaa3c680501c54dd7979056f73e.gz
* added stopword support to the indexer, added indexer webbugAndreas Gohr2005-08-14
| | | | darcs-hash:20050814181035-7ad00-ed5d879d29fcee7f925f806456675605b058966a.gz
* more indexer functionsAndreas Gohr2005-08-13
| | | | darcs-hash:20050813142047-7ad00-8b4417587c577220191d6b6d35574e82b5bf5596.gz
* a first step for search indexing - nothing to see yetAndreas Gohr2005-08-07
darcs-hash:20050807203322-7ad00-6db6733f8fcd861366856635ba3d205fd3bb54da.gz