summaryrefslogtreecommitdiff
Commit message (Collapse)AuthorAge
* Merge remote-tracking branch 'lupo49/master'Andreas Gohr2011-03-18
|\
| * de/de-informal: added and updated translationsMatthias Schulte2011-03-14
| |
* | Slovak language updateMatej Urban2011-03-18
| |
* | Traditional Chinese language updateShuo-Ting Jian2011-03-18
|/
* Remove relation_references from the index when it is missingMichael Hamann2011-03-08
|
* Merge branch 'indexer_rewrite'Michael Hamann2011-03-08
|\
| * Adjust bin/indexer.php for the new indexerMichael Hamann2011-03-06
| | | | | | | | | | Now the indexer is directly called instead of duplicating a large part of the indexer code.
| * Merge the two indexer events and use string keysMichael Hamann2011-03-06
| | | | | | | | | | | | | | This merges the INDEXER_PAGE_ADD and INDEXER_METADATA_INDEX events and introduces the new string keys 'page', 'body' and 'metadata' in the event data. All plugins that use INDEXER_PAGE_ADD need to be adjusted to use the key 'page' instead of 0 and 'body' instead of 1.
| * Fix wildcard searchTom N Harris2011-02-27
| |
| * Restrict metadata values in indexer to string; skip unnecessary testTom N Harris2011-02-25
| |
| * Reduce memory footprint of tokenizer; make returned arrays use contiguous keysTom N Harris2011-02-25
| |
| * Fix pass by reference error, always return an array in lookupKey()Michael Hamann2011-02-24
| |
| * Merge branch 'master' into indexer_rewriteMichael Hamann2011-02-24
| |\ | | | | | | | | | | | | | | | | | | Conflicts: inc/fulltext.php inc/indexer.php lib/exe/indexer.php
| * | Add minimum length option to index histogramTom N Harris2011-02-23
| | |
| * | Increase version tag for new indexerTom N Harris2011-02-23
| | |
| * | Fix variable name type in indexerTom N Harris2011-02-22
| | |
| * | Implement histogram method of indexerTom N Harris2011-02-22
| | |
| * | Indexer version tag should include plugin namesTom N Harris2011-02-22
| | |
| * | Removing a page from the index deletes related metadata. Cache key names in ↵Tom N Harris2011-02-22
| | | | | | | | | | | | index.
| * | Indexer::lookupKey callback receives value reference as first argTom N Harris2011-02-22
| | |
| * | Special handling of title metadata indexTom N Harris2011-02-18
| | |
| * | Merge remote-tracking branch 'my-fork/master' into indexer_improvementsMichael Hamann2011-02-02
| |\ \
| | * | Indexer Rewrite v3: wildcards in lookupKey and automatically unwrap single ↵Tom N Harris2011-01-24
| | | | | | | | | | | | | | | | result
| | * | Indexer v3 Rewrite: streamline indexing of deleted or disabled pagesTom N Harris2011-01-24
| | | |
| * | | Merge branch 'master' into indexer_improvementsMichael Hamann2011-01-23
| |\ \ \ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Conflicts: inc/fulltext.php inc/indexer.php lib/exe/indexer.php
| * | | | Add INDEXER_VERSION_GET event so plugins can add their versionMichael Hamann2011-01-23
| | |/ / | |/| | | | | | | | | | | | | | | | | | This allows plugins to add their own version strings like plugin_tag=1 so pages can be reindexed when plugins update their index content.
| * | | Indexer v3 Rewrite: Use the metadata index for backlinks; add ↵Michael Hamann2011-01-23
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | INDEXER_METADATA_INDEX event This new event allows plugins to add or modify the metadata that will be indexed. Collecting this metadata in an event allows plugins to see if other plugins have already added the metadata they need and leads to just one single indexer call thus fewer files are read and written. Plugins could also replace/prevent the metadata indexer call using this event.
| * | | Indexer v3 Rewrite: fix addMetaKeys and lockingMichael Hamann2011-01-23
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This fixes addMetaKeys so it actually removes values. This also changes the functionality of the function: It now updates the key for the page with the current value instead of adding new values as this will be the default use case. A new parameter could be added to restore the "old" behavior when needed. addMetaKeys now only saves the index when the content has really been changed. Furthermore no empty number is added anymore to the reverse index when it has been empty previously. addMetaKeys now releases the lock again and really fails when the lock can't be gained.
| * | | Indexer v3 Rewrite: implement lookupKey()Michael Hamann2011-01-22
| | | | | | | | | | | | | | | | | | | | Saving and looking up metadata key/value pairs seems to work now at least with some basic tests.
| * | | Indexer v3 Rewrite: _saveIndexKey now really writes on the desired lineMichael Hamann2011-01-22
| | | | | | | | | | | | | | | | | | | | | | | | | | | | Now _saveIndexKey inserts empty lines when the index isn't long enough. This is necessary because the page ids are taken from the global page index, but there is not every page in the metadata key specific index so e.g. line 10 might be the first entry in the index.
| * | | Indexer v3 Rewrite: fix obvious typos and type errorsMichael Hamann2011-01-22
| | | |
| * | | Indexer v3 Rewrite part two, update uses of indexerTom N Harris2010-12-29
| | | |
| * | | Indexer v3 Rewrite part one (unstable)Tom N Harris2010-12-27
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The indexer functions have been converted to a class interface. Use the Doku_Indexer class to access the indexer with these public methods: addPageWords addMetaKeys deletePage tokenizer lookup lookupKey getPages histogram These functions are provided for general use: idx_get_version idx_get_indexer idx_get_stopwords idx_addPage idx_lookup idx_tokenizer These functions are still available, but are deprecated: idx_getIndex idx_indexLengths All other old idx_ functions are unsupported and have been removed.
| * | | Merge branch 'tokenizer-rewrite' into michituxTom N Harris2010-11-20
| |\ \ \
| | * | | Restore io_runcmd, use io_exec for exec with pipesTom N Harris2010-11-18
| | | | |
| | * | | Use a different indexer version when external tokenizer is enabledTom N Harris2010-11-17
| | | | |
| | * | | Use external program to split pages into wordsTom N Harris2010-11-16
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | An external tokenizer inserts extra spaces to mark words in the input text. The text is sent through STDIN and STDOUT file handles. A good choice for Chinese and Japanese is MeCab. http://sourceforge.net/projects/mecab/ With the command line 'mecab -O wakati'
| | * | | Repurpose io_runcmd for pipesTom N Harris2010-11-16
| | | | |
| | * | | tokenizer was returning prematurelyTom N Harris2010-11-15
| | | | |
| | * | | Refactor tokenizer to avoid splitting multiple timesTom N Harris2010-11-14
| | | | |
| | * | | Measure length of multi-character Asian wordsTom N Harris2010-11-14
| | | | |
| | * | | Do not assume that index files will be backward compatibleTom N Harris2010-11-14
| | | | |
| | * | | Remove unused idx_touchIndex functionTom N Harris2010-11-14
| | | | |
| * | | | Indexer improvement: regex instead of arrays for linesMichael Hamann2010-11-15
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | When updating a single line that line was split into an array and in a loop over that array one entry was removed and afterwards a new one added. Tests have shown that using a regex for doing that is much faster which can be easily explained as that regex is very simple to match while a loop over an array isn't that fast. As that update function is called for every word in a page the impact of this change is significant.
| * | | | Indexer improvement: Only write the words index when neededMichael Hamann2010-11-15
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This adds a simple boolean variable that tracks if new words have been added. When editing a page in many cases all words have already been used somewhere else or just one or two words are new. Until this change all words indexes read were always written, now only the changed ones are written. The overhead of the new boolean variable should be low.
| * | | | Indexer improvement: replace _freadline by fgetsMichael Hamann2010-11-15
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | In PHP versions newer than 4.3.0 fgets reads a whole line regardless of its length when no length is given. Thus the loop in _freadline isn't needed. This increases the speed significantly as _freadline was called very often.
| * | | | Indexer speed improvement: joined array vs. single linesMichael Hamann2010-11-15
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | From my experience with a benchmark of the indexer it is faster to first join the array of all index entries and then write them back together instead of writing every single entry. This might increase memory usage, but I couldn't see a significant increase and this function is also only used for the small index files, not for the large pagewords index.
* | | | | Pass edid to the mediamanagerAdrian Lang2011-03-06
| | | | |
* | | | | avoid broken page on bad non-UTF8 highlight stringAndreas Gohr2011-03-05
| | | | |
* | | | | avoid premature output before headers are sentAndreas Gohr2011-03-05
| | | | |