Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Indexer v3 Rewrite part two, update uses of indexer | Tom N Harris | 2010-12-29 |
| | |||
* | Indexer v3 Rewrite part one (unstable) | Tom N Harris | 2010-12-27 |
| | | | | | | | | | | | | | | | | | | | | | | | | The indexer functions have been converted to a class interface. Use the Doku_Indexer class to access the indexer with these public methods: addPageWords addMetaKeys deletePage tokenizer lookup lookupKey getPages histogram These functions are provided for general use: idx_get_version idx_get_indexer idx_get_stopwords idx_addPage idx_lookup idx_tokenizer These functions are still available, but are deprecated: idx_getIndex idx_indexLengths All other old idx_ functions are unsupported and have been removed. | ||
* | Merge branch 'tokenizer-rewrite' into michitux | Tom N Harris | 2010-11-20 |
|\ | |||
| * | Restore io_runcmd, use io_exec for exec with pipes | Tom N Harris | 2010-11-18 |
| | | |||
| * | Use a different indexer version when external tokenizer is enabled | Tom N Harris | 2010-11-17 |
| | | |||
| * | Use external program to split pages into words | Tom N Harris | 2010-11-16 |
| | | | | | | | | | | | | | | | | | | An external tokenizer inserts extra spaces to mark words in the input text. The text is sent through STDIN and STDOUT file handles. A good choice for Chinese and Japanese is MeCab. http://sourceforge.net/projects/mecab/ With the command line 'mecab -O wakati' | ||
| * | Repurpose io_runcmd for pipes | Tom N Harris | 2010-11-16 |
| | | |||
| * | tokenizer was returning prematurely | Tom N Harris | 2010-11-15 |
| | | |||
| * | Refactor tokenizer to avoid splitting multiple times | Tom N Harris | 2010-11-14 |
| | | |||
| * | Measure length of multi-character Asian words | Tom N Harris | 2010-11-14 |
| | | |||
| * | Remove unused idx_touchIndex function | Tom N Harris | 2010-11-14 |
| | | |||
* | | Indexer improvement: regex instead of arrays for lines | Michael Hamann | 2010-11-15 |
| | | | | | | | | | | | | | | | | | | | | When updating a single line that line was split into an array and in a loop over that array one entry was removed and afterwards a new one added. Tests have shown that using a regex for doing that is much faster which can be easily explained as that regex is very simple to match while a loop over an array isn't that fast. As that update function is called for every word in a page the impact of this change is significant. | ||
* | | Indexer improvement: Only write the words index when needed | Michael Hamann | 2010-11-15 |
| | | | | | | | | | | | | | | | | This adds a simple boolean variable that tracks if new words have been added. When editing a page in many cases all words have already been used somewhere else or just one or two words are new. Until this change all words indexes read were always written, now only the changed ones are written. The overhead of the new boolean variable should be low. | ||
* | | Indexer improvement: replace _freadline by fgets | Michael Hamann | 2010-11-15 |
| | | | | | | | | | | | | | | In PHP versions newer than 4.3.0 fgets reads a whole line regardless of its length when no length is given. Thus the loop in _freadline isn't needed. This increases the speed significantly as _freadline was called very often. | ||
* | | Indexer speed improvement: joined array vs. single lines | Michael Hamann | 2010-11-15 |
| | | | | | | | | | | | | | | | | From my experience with a benchmark of the indexer it is faster to first join the array of all index entries and then write them back together instead of writing every single entry. This might increase memory usage, but I couldn't see a significant increase and this function is also only used for the small index files, not for the large pagewords index. | ||
* | | Use native PHP JSON functions when available | Michael Hamann | 2010-11-15 |
|/ | |||
* | improved some metadata comments | Dominik Eckelmann | 2010-11-13 |
| | |||
* | FS#2079: always show profile and subscribe links/buttons | Anika Henke | 2010-11-13 |
| | |||
* | Introduce metadata write wrapper p_save_metadata | Adrian Lang | 2010-11-13 |
| | | | | p_purge_metadata now updates the metadata cache and the INFO array like the other metadata writing functions | ||
* | Simpler ID trimming | Adrian Lang | 2010-11-13 |
| | |||
* | Remove metadata conversion from 0a7e3bce (2006-11-26) | Adrian Lang | 2010-11-13 |
| | |||
* | Merge branch 'master' of github.com:splitbrain/dokuwiki | Anika Henke | 2010-11-12 |
|\ | |||
| * | Merge branch 'sitemap' | Michael Hamann | 2010-11-12 |
| |\ | |||
| | * | Only add successfully created sitemap items to the sitemap | Michael Hamann | 2010-11-02 |
| | | | |||
| | * | Merge branch 'master', remote branch 'sitemap/master' | Michael Hamann | 2010-11-02 |
| | |\ | |||
| | | * | Merge remote branch 'origin/master' into sitemap | Michael Hamann | 2010-10-10 |
| | | |\ | |||
| | | * | | Action handler for sitemaps improved | Michael Hamann | 2010-09-22 |
| | | | | | | | | | | | | | | | | | | | | | | | | | The action handler for the sitemap now makes use of the sitemapper methods for determining the filename and uses http conditional requests. | ||
| | | * | | Sitemapper code improved and documentation added | Michael Hamann | 2010-09-22 |
| | | | | | | | | | | | | | | | | | | | | | | | | | Removed some calls to dbglog, improved the code performance and added documentation for all functions and classes of the Sitemapper. | ||
| | | * | | Restructured the sitemapper | Michael Hamann | 2010-09-22 |
| | | | | | |||
| | | * | | Transformed the sitemapper into a class | Michael Hamann | 2010-09-22 |
| | | | | | | | | | | | | | | | | | | | | This makes it possible to autoload the sitemapper when needed. | ||
| | | * | | Sitemap rewrite | Michael Hamann | 2010-09-22 |
| | | | | | |||
* | | | | | FS#1839: take favicon from mediadir (if it exists) | Anika Henke | 2010-11-12 |
|/ / / / | |||
* | | | | Merge branch 'master' of github.com:splitbrain/dokuwiki | Anika Henke | 2010-11-07 |
|\ \ \ \ | |||
| * | | | | Chinese Language update | Choicky Chou | 2010-11-07 |
| | | | | | |||
| * | | | | handle mailfrom replacements in a central place FS#2091 | Andreas Gohr | 2010-11-05 |
| |/ / / | |||
| * | | | Fix XSS vulnerability FS#2085 | Vadim Nevorotin | 2010-10-31 |
| | | | | |||
* | | | | make custom buttons possible with html_btn() without the need of global ↵ | Anika Henke | 2010-11-07 |
|/ / / | | | | | | | | | | $lang (more consistent with tpl_pagelink()) | ||
* | | | Merge remote branch 'lupo49/master' | Andreas Gohr | 2010-10-30 |
|\ \ \ | |||
| * | | | de-informal / typo fix | Matthias Schulte | 2010-10-27 |
| | | | | | | | | | | | | | | | | de / typo fix | ||
| * | | | de / typo fixed | Matthias Schulte | 2010-10-27 |
| | | | | |||
* | | | | Basque language update | Inko I.A | 2010-10-30 |
|/ / / | |||
* | | | Fix SafeFN-encoded file names | Adrian Lang | 2010-10-26 |
| | | | | | | | | | | | | SafeFN encoding now always end the encoding block by appending a dot at the end of the file name. This is necessary since the file name may get an extension which is not encoded. | ||
* | | | do not strip .txt in namespaces in index | Andreas Gohr | 2010-10-26 |
| | | | |||
* | | | Fix sorting of quicksearch results | Adrian Lang | 2010-10-24 |
| | | | |||
* | | | fixed missing global | Andreas Gohr | 2010-10-24 |
| | | | |||
* | | | detect corrupt title indexes FS#2076 | Andreas Gohr | 2010-10-24 |
| | | | |||
* | | | Correctly decode user names in unsubscriptions | Adrian Lang | 2010-10-22 |
| | | | |||
* | | | Correctly decode user names in subscriptions | Adrian Lang | 2010-10-21 |
| | | | |||
* | | | Chinese language update | lainme | 2010-10-19 |
| | | | |||
* | | | Portuguese language update | André Neves | 2010-10-19 |
| | | |