| Commit message (Collapse) | Author | Age |
| |
|
|
|
|
|
|
| |
it's part
of another term. Word boundaries are now respected.
|
|
|
|
| |
this was broken by d0bdf7659fce98c2922b151766d51d5c7e8814d6
|
| |
|
|
|
|
|
|
|
|
|
|
|
| |
This patch changes the ft_pageLookup function to always return the
title of pages with the result. This makes it easier to work with the
array, as it no longer changes between numeric and key indexes depending
on useheading.
This also means that action plugins subscribed to
SEARCH_QUERY_PAGELOOKUP need to be adjusted. The event contains a new
data field called 'has_titles' which plugins can use to check for
backwards compatibility.
|
| |
|
| |
|
| |
|
| |
|
|\
| |
| |
| |
| | |
Conflicts:
inc/fulltext.php
|
| | |
|
| |
| |
| |
| |
| |
| |
| |
| | |
Classes are loaded throug PHP5's class autoloader, all other
includes are just loaded by default. This skips a lot of
require_once calls.
Parser and Plugin stuff isn't handled by the class loader yet.
|
|/ |
|
| |
|
|
|
|
|
|
| |
Ignore-this: b48abc447a5d32123322eb08717d275d
darcs-hash:20091115121708-9b77a-4a0b6c7afb8db60e22ca07a4d9725474a869c1c8.gz
|
|
|
|
|
|
| |
Ignore-this: a0ebcdd129f4256e4be029e7fdf7ca45
darcs-hash:20091026092359-6e07b-4c41896825e091a3c8fbbeadc3bc7764d0735bf6.gz
|
|
|
|
|
|
| |
Ignore-this: f8687eff4a046379700068b01d94807
darcs-hash:20090921105201-9b77a-6e9ea14dfd7763f8b52ba02f6654f289ac9237e8.gz
|
|
|
|
|
|
|
|
|
| |
Ignore-this: cb05f50ca4de12e1cdf3a6cfb0e1b8bc
- better search experience in Asian language
- sophisticated search query syntax (OR, grouping, etc.)
darcs-hash:20090920121116-9b77a-2718be7a043374669037b10d94101fc70efb95e3.gz
|
|
|
|
|
|
|
|
| |
Ignore-this: fdf33ea5a6c50a597bd432c0da98e927
Snippets containing more of the seached words are preferred
over ones containing less search keywords
darcs-hash:20090504183835-a07b1-7b0da249fcb5680019fc3032dfd6fb063e94576a.gz
|
|
|
|
|
|
|
|
|
| |
Ignore-this: 4cd6bddacb795ef15f133559c223ac1f
* Adds the possibility to exclude namespace(s) from search, by preceding them with "^".
* Changed handling of search parameters to allow any order of words and namespaces.
darcs-hash:20090311160255-13810-c2e00cc7764d180967b4c6f22e17b1c0dafe36f4.gz
|
|
|
|
|
|
| |
to pages to which the user had no access
darcs-hash:20090119062124-f07c6-5d761a76a50c6c9bcc124fa89feb2fb7b0a9a9b5.gz
|
|
|
|
| |
darcs-hash:20081213090400-7ad00-4e21cd75978bb07513f32f5d750658e8d777c59e.gz
|
|
|
|
|
|
|
|
|
| |
The pagename matching search (AJAX and "real" search) now sort results based
on the namespace hierarchy levels before doing an alphabetically search.
This means pages with fewer namespace (ie. higher up in the hierarchy) wil
be shown first.
darcs-hash:20080812200649-7ad00-b58f152923864c3440e6412be58fb6fb25373583.gz
|
|
|
|
|
|
|
|
|
|
| |
SEARCH_QUERY_FULLPAGE event wraps around ft_pageSearch() call, the function which handles the
search action and feed searching.
The event data is the parameters of this function:
data['query']
darcs-hash:20080811110656-f07c6-a149de6cd0ebc997541fa6e3f4bc6788d663dbd3.gz
|
|
|
|
|
|
|
|
|
|
| |
Now higlighting phrases are passed as an array which then is quoted correctly
when used in a regexp. This should make phrase highlighting work completely
correct.
Please everyone test it.
darcs-hash:20080310214939-7ad00-1abefb02dde40edeead50b4fa5c866c46b95ca3a.gz
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
This changes how DokuWiki looks for reference toa media file which is
about to deleted. Instead of doing a full grep through all pages it now
uses the fulltext index first, then does an exact match on the found
pages.
This speeds up the search significantly on larger wikis. However the
fulltext search limits now apply: images with names shorter than 3 charcters
may not be found.
This needs extensive testing!
darcs-hash:20080223205254-7ad00-486de0a4125d51b4e7999827f710d1d9de8bc60d.gz
|
|
|
|
|
|
|
|
|
| |
This patch makes the highlighting of phrases in search snippets and on
the pages itself much better.
Now a regexp gets passed to the ?s
darcs-hash:20080215174653-7ad00-cd2d6f7d408db7b7dd3cb9974c3eb27f3a9baeac.gz
|
|
|
|
|
|
| |
bool page_exists($id, $rev
darcs-hash:20070930021040-d26fc-e3847bfdd20a36154685262eca94211cfd461e83.gz
|
|
|
|
|
|
|
|
|
|
|
| |
The use of realpath() to clean up relative file names caused some
trouble in certain setups relying on symlinks or having restricitve
file structure setups.
This patch replaces all realpath() calls with a PHP only replacement
which should solve those problems.
darcs-hash:20070930184250-7ad00-512ff04c95f57fc9eaf104f80372237a3c94286f.gz
|
|
|
|
| |
darcs-hash:20070804081226-7ad00-a8e7127c7122a96f9817158d87e1a364d8cdbc9f.gz
|
|
|
|
| |
darcs-hash:20070718104839-7ad00-50348c1834c78e891f049023d2e8894d6bb0a00b.gz
|
|
|
|
| |
darcs-hash:20070514222527-d5083-53ed619daf07d0a84c52161465d163abf1400529.gz
|
|
|
|
| |
darcs-hash:20070330215042-19e2d-3528f2412ff044eb45158f349db5bbb5e32d907b.gz
|
|
|
|
| |
darcs-hash:20070303220143-7ad00-5d592dbebaae371c03102b20ae7e0d9e433b378b.gz
|
|
|
|
| |
darcs-hash:20070205191848-7ad00-77ad5a398534a7a64884e155c4607350e0f25a7c.gz
|
|
|
|
| |
darcs-hash:20061124215413-7ad00-f2bd46b7edf70660cc3e0274bd222eafba1edbc6.gz
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
A modification to the indexer that sorts words based on length. This should make
searching a little bit more efficient. After the patch is applied, your old index
will be automatically converted to the new format (when you visit a page). The
new index format is:
1. Index files are stored in savedir/index
2. Word lists are stored as wlen.idx. This used to be word.idx.
3. Word indexes are stored as ilen.idx. This used to be index.idx.
4. The page list, page.idx, is simply copied to the new location.
Any plugins you have, such as the blog plugin, that read the index files need to
be updated.
darcs-hash:20061112194900-2b9f0-a975498ccf0a1d39c6df73b79bcd028d5e81c389.gz
|
|
|
|
|
|
|
| |
- add deaccented and romanised page names to index word list
- remove stop words from tokens used in backlink search
darcs-hash:20061105195453-9b6ab-6c4989eb75782af60a3de3bddbc99a83de2b4c80.gz
|
|
|
|
|
|
|
| |
This patch removes some commented code fragments and alternative
snippet generators
darcs-hash:20061008090624-7ad00-14bfee2ded6c6c8ef43ad02a4c02a5d95ee9daf7.gz
|
|
|
|
|
|
|
| |
- rework utf8_substr() NOMBSTRING code to always use pcre
- remove work around for utf8_substr() and large strings from ft_snippet()
darcs-hash:20060928165122-9b6ab-0eefc216f07f9d7e7d8eb62ce26605c28ee340fa.gz
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
This patch primarily updates p_cached_xhtml() and p_cached_instructions() to
allow their caching logic to be surrounded by an event trigger.
p_cached_xhtml() has been rewritten as the more general p_cached_output() to
support other render output formats besides 'xhtml'. All calls to
p_cached_xhtml() have been changed to refer to the new function.
New event:
name: PARSER_CACHE_USE
data: cache object (see below)
action: determine if cache file can be used
preventable: yes
result: bool, true to use cache file, false otherwise
Cache operations have been generalised in a new class, cache, extended to
cache_parser, cache_renderer & cache_instructions. Details can be found in
inc/cache.php
For handling of above event, key properties are:
- page, if present the wiki page id,
may not always be present, e.g. when called for locale xhtml files
- file, source file
- mode, renderer mode (e.g. 'xhtml') or 'i' for instructions
Other changes:
- cache class counts cache hits against attempts, results are stored in
{cache_dir}/cache_stats.txt
- adds metadata dependency to renderer page cache
- replaces purgefile dependency for renderer cache with metadata
'relation references' (internal link) dependency for wiki pages only
darcs-hash:20060911021418-9b6ab-19601ed194b8c8e45236ab72c3e23d78bf777e6c.gz
|
|
|
|
| |
darcs-hash:20060901002016-9b6ab-716518138edf541a869510d7c2934b9474547fc3.gz
|
|
|
|
| |
darcs-hash:20060831092146-9b6ab-b00aa29c982ab18117f476b3d01d5111915c9d4b.gz
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
ft_snippet()
- make utf8 algorithm default
- add workaround for utf8_substr() limitations, bug #891
- fix some indexes which missed out on conversion to utf8
character counts
- minor improvements
idx_lookup()
- minor changes to wildcard matching code to improve performance
(changes based on profiling results)
utf8
- specifically set mb_internal_coding to utf-8 when mb_string
functions will be used.
darcs-hash:20060831003413-9b6ab-712021eda3c959ffe79d8d3fe91d2c9a8acf2b58.gz
|
|
|
|
|
|
|
|
|
|
|
|
| |
- correct "opt1" algorithm for multibyte utf8
- minor improvement to "opt2" for short pages
- add "utf8" algorithm, this algorithm endeavours
to work with whole utf8 character as much as
possible. The resulting snippet will tend to
100 characters, rather than the 100 bytes of
"opt1" and "opt2".
darcs-hash:20060826234333-9b6ab-ae4c60c8855a92b133cb8d5a230098203f610e7b.gz
|
|
|
|
| |
darcs-hash:20060826095311-9b6ab-9a6f272cc7c7532eb2bad8f7b4404c5a16b71109.gz
|
|
|
|
|
|
|
| |
This adds code to remove or replace invalid UTF-8 bytes and uses it
in the ft_snippets function.
darcs-hash:20060826082919-7ad00-a94004de159ae93ff5b7270fd3e631ff467233cd.gz
|
|
|
|
| |
darcs-hash:20060825134730-9b6ab-086ee0647af39c4398cf1726324d8215722a39db.gz
|
|
|
|
|
|
|
|
|
|
|
| |
This patch includes two alternative algorithms for ft_snippet(), the code
which prepares the snippets seen on the search page - and the most time
consuming part of the production of that page.
If you have $conf['allowdebug'] on, you can specify the search algorithm to
use by adding &_search
darcs-hash:20060825104046-9b6ab-942d81a43cf0f85bfd235cabf6c35dd4b20e0b71.gz
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
- now its possible to restrict the fulltext-search to
multible namespaces
Examples:
searchword @ns1 @ns2 @ns3
"exact phrase" @ns1 @ns2 @ns3
darcs-hash:20060518204647-484ab-061521a81f13360e33496e5163e3cd263a9c1ad6.gz
|