Basic ideas

More user influence:

Add more things you'd like to the list above, if they're not too exotic.

See

Suggestion

Use special chars to define what to do with the search pattern

Query language

This is a draft of a possible query language. Please disscuss variants at the "Suggestion" section above.

Architecture

Frontend -> Searchpattern -> Results -> Sorted Results -> Output

String -> Query tree -> list of result objects -> list of result objects -> request.write(formatter.xxx())

Implementation Log

I started implementing some of the data structures above. Things look already usable. I now think about how it is possible to integrate indexed and attachment search -- FlorianFesti 2004-03-01 17:37:27

see also /AttachmentSearch

Indexing

As mentioned in the article above, this will run into problems if the database will grow and no longer fits into memory. 1.000.000 pages per several kB are several GB of data. I would not expect that the database will be much smaller than the wiki pages. To handle this amount of data real databases are needed.

Problems

Using an external search engine

As an alternative to writing such an indexing algorithm an external search engine program could be used, which would give us more functionality with less work. Of cause the use of an external search engine will be optional.

Requirements:

Possible candidates are:

I've installed xapian and played a bit with it. As far as I can see it looks good. The python binding is not beautiful but acceptable. Reading the mailing list it looks like xapian is capable of dealing with large amounts of data (several GB, hundred thousands of documents). Looks good -- FlorianFesti 2004-02-06 21:13:33

MoinMoin: MoinMoinTodo/ExtendedSearch (last edited 2009-01-25 21:49:59 by ThomasWaldmann)