1 2011-02-24T00:01:13 <CIA-46> MattMaker <trip@entropyreductionservices.com> default * 15:b78b2a36947c 2.0/setup.py: require sqla 0.6 because of using the new api call ResultProxy.inserted_primary_key. confirmed at http://www.sqlalchemy.org/changelog/CHANGES_0_6_6
2 2011-02-24T00:01:16 <CIA-46> Thomas Waldmann <tw AT waldmann-edv DOT de> default * 16:f835165ac6f8 2.0/setup.py: merged MattMaker/moin-2.0
3 2011-02-24T00:02:47 *** Pranav_rcmas1
4 2011-02-24T00:08:52 *** Pranav_rcmas1
5 2011-02-24T00:09:45 *** Pranav_rcmas1
6 2011-02-24T00:10:42 *** izibi_
7 2011-02-24T00:16:03 *** Pranav_rcmas1
8 2011-02-24T00:24:46 *** Pranav_rcmas2
9 2011-02-24T00:36:51 <CIA-46> Thomas Waldmann <tw AT waldmann-edv DOT de> default * 17:41e2918dcafd 2.0/MoinMoin/items/__init__.py: escape metadata to avoid XSS / html / js injection via item_name, comment or other user-settable metadata (should fix #3 )
10 2011-02-24T00:37:49 <ThomasWaldmann> nice: https://bitbucket.org/thomaswaldmann/moin-2.0/issue/3/security-xss-by-url-item_name#comment-382300
11 2011-02-24T00:38:01 <ThomasWaldmann> auto-closing an issue via commit comment
12 2011-02-24T00:58:03 *** Pranav_rcmas2
13 2011-02-24T01:03:51 <ThomasWaldmann> eSyr: https://bitbucket.org/thomaswaldmann/moin-2.0/issue/2/security-rst-converter-javascript-urls same issue as in 1.x
14 2011-02-24T01:36:57 *** Matt
15 2011-02-24T01:37:23 *** Matt
16 2011-02-24T01:39:07 *** Guest73083
17 2011-02-24T01:41:11 <MattMaker> regarding rss to irc, I see several scripts out there, maybe this one would be useful (I have not tried it): http://paste.org/pastebin/view/18008
18 2011-02-24T02:44:24 *** diofeher
19 2011-02-24T04:10:45 *** diofeher[PB]
20 2011-02-24T05:04:46 *** MattMaker
21 2011-02-24T05:45:32 *** grzywacz
22 2011-02-24T05:56:59 *** Matt
23 2011-02-24T05:57:25 *** Matt
24 2011-02-24T05:57:40 *** Guest30668
25 2011-02-24T07:21:49 *** izibi
26 2011-02-24T07:33:46 *** Marchael
27 2011-02-24T08:04:23 *** Marchael1
28 2011-02-24T08:04:57 *** Marchael
29 2011-02-24T08:13:45 *** Marchael1
30 2011-02-24T08:13:50 *** Marchael
31 2011-02-24T08:42:14 *** izibi_
32 2011-02-24T08:44:57 *** izibi
33 2011-02-24T08:55:43 *** Marchael
34 2011-02-24T08:55:48 *** Marchael
35 2011-02-24T09:06:27 *** izibi_
36 2011-02-24T09:54:35 <dreimark_> moin
37 2011-02-24T09:58:19 *** dreimark_
38 2011-02-24T10:15:28 *** Marchael
39 2011-02-24T10:25:48 *** Marchael
40 2011-02-24T10:29:42 *** gwork
41 2011-02-24T10:59:04 *** Marchael
42 2011-02-24T10:59:39 *** greg_f
43 2011-02-24T11:40:33 <ThomasWaldmann> moin
44 2011-02-24T11:52:35 *** Pranav_rcmas1
45 2011-02-24T12:25:54 *** Marchael
46 2011-02-24T12:31:51 <Marchael> moin
47 2011-02-24T12:38:12 *** Marchael
48 2011-02-24T12:38:20 *** Marchael
49 2011-02-24T12:40:50 <Marchael> ThomasWaldmann: I read http://moinmo.in/ThomasWaldmann/Moin2BrainDump about searching. Am I understand right what currently moin1.9 using xapian search?
50 2011-02-24T12:44:47 <ThomasWaldmann> Marchael: in 1.9, xapian indexed search is an option. the default is non-indexed, slow search (builtin, pure python, only pages)
51 2011-02-24T12:49:39 *** Pranav_rcmas1
52 2011-02-24T12:54:40 *** Marchael
53 2011-02-24T12:54:46 *** Marchael
54 2011-02-24T12:56:38 <Marchael> ThomasWaldmann: hm, and if I could help moin2 with searching and indexing for moin2 based on xapian. Will this be enough for GSoC?
55 2011-02-24T12:56:51 <Marchael> Also, why Xapian?
56 2011-02-24T13:02:14 *** Pranav_rcmas1
57 2011-02-24T13:07:32 <ThomasWaldmann> implementing indexing, query parsing, searching, results presentation will likely be enough for soc
58 2011-02-24T13:08:18 <ThomasWaldmann> Marchael: we are open to any reasonable suggestion, using xapian is no precondition
59 2011-02-24T13:08:27 <TheSheep> whoosh :)
60 2011-02-24T13:08:59 <TheSheep> would be nice to be able to fall back to whoosh if xapian is not available, for example
61 2011-02-24T13:09:11 <ThomasWaldmann> TheSheep: apache license v2
62 2011-02-24T13:09:31 <TheSheep> ThomasWaldmann: it's not compatible with gpl?
63 2011-02-24T13:09:41 <TheSheep> ThomasWaldmann: I was under impression it is
64 2011-02-24T13:10:10 <ThomasWaldmann> http://www.apache.org/licenses/GPL-compatibility.html
65 2011-02-24T13:10:50 <TheSheep> Apache License, Version 2.0
66 2011-02-24T13:10:50 <TheSheep> This is a free software license, compatible with version 3 of the GPL.
67 2011-02-24T13:10:51 <TheSheep> Please note that this license is not compatible with GPL version 2, because it has some requirements that are not in the older version. These include certain patent termination and indemnification provisions.
68 2011-02-24T13:11:13 <TheSheep> ThomasWaldmann: we could ask the author for double licensing
69 2011-02-24T13:12:30 <ThomasWaldmann> yeah, cc: mobwrite author X)
70 2011-02-24T13:12:57 <ThomasWaldmann> TheSheep: you used woosh somewhere already?
71 2011-02-24T13:13:42 <ThomasWaldmann> an issue with supporting multiple search engines is always query parsing and user expectations of search behaviour
72 2011-02-24T13:19:46 <ThomasWaldmann> also, we have to consider scalability. some people have wikis with many gigabytes (mostly attachments)
73 2011-02-24T13:21:14 *** Marchael
74 2011-02-24T13:26:10 <TheSheep> ThomasWaldmann: I'm maintaining one project where whoosh is used, but I didn't write the code
75 2011-02-24T13:36:46 <dreimark> http://moinmo.in/MoinMoinChat/Logs/moin-dev/2010-08-15?highlight=%28whoosh%29
76 2011-02-24T13:42:17 <TheSheep> dreimark: that's just a link
77 2011-02-24T13:43:32 *** Marchael
78 2011-02-24T13:43:38 <dreimark> i prefer a python only solutions
79 2011-02-24T13:44:41 <dreimark> and we should not let block us from attachments currently.
80 2011-02-24T13:45:09 <Marchael> and what about search, where do I start?
81 2011-02-24T13:45:10 <dreimark> the filter trick should work by every engine
82 2011-02-24T13:54:51 *** Pranav_rcmas1
83 2011-02-24T13:55:56 <dreimark> TheSheep: can you link to the project above
84 2011-02-24T13:58:24 <TheSheep> dreimark: to whoosh?
85 2011-02-24T13:58:46 <dreimark> yes, is that you maintain public?
86 2011-02-24T13:58:53 <dreimark> +project
87 2011-02-24T14:00:31 <TheSheep> dreimark: no, it's not public
88 2011-02-24T14:01:02 * ThomasWaldmann also prefers python, but we have to keep performancs in mind. a full reindex run shouldn't take ages. :)
89 2011-02-24T14:01:21 <dreimark> i remember that it is quite fast
90 2011-02-24T14:01:24 <TheSheep> it's a homegrown cms that a certain company built and then outsourced maintenance to us
91 2011-02-24T14:03:46 <ThomasWaldmann> a pure python solution would have the plus that we could just require it for everybody
92 2011-02-24T14:04:14 <ThomasWaldmann> that could save us from having to build a compatibility layer for query parsing and results presentation
93 2011-02-24T14:08:52 <ThomasWaldmann> Marchael: you should look at storage stuff and get familiar with it
94 2011-02-24T14:08:58 *** Pranav_rcmas1
95 2011-02-24T14:10:42 <ThomasWaldmann> btw, we have sqlalchemy indexing for some metadata - that is also something that should be considered
96 2011-02-24T14:11:44 <ThomasWaldmann> (but it is more for precise search (== selecting stuff), not so much for fuzzy search, full text search, search in word stems)
97 2011-02-24T14:13:36 <ThomasWaldmann> dreimark: * XQuery/XPath syntax support? /!\ for what exactly would this be good for?
98 2011-02-24T14:14:00 <ThomasWaldmann> did you add that? can you please answer the question or just rephrase it so it is directly understandable?
99 2011-02-24T14:14:21 <dreimark> i think that was eSyr, but i look
100 2011-02-24T14:14:59 <dreimark> http://moinmo.in/4ct10n/info/GoogleSoc2011/InitialProjectIdeas?action=diff&rev2=10&rev1=9
101 2011-02-24T14:15:36 <Marchael> Access denied
102 2011-02-24T14:15:49 <dreimark> Marchael: i know, it is for mentors
103 2011-02-24T14:15:53 <Marchael> ok
104 2011-02-24T14:16:00 <dreimark> until we are clear what we can do
105 2011-02-24T14:16:10 <dreimark> +about
106 2011-02-24T14:20:53 <Marchael> ThomasWaldmann: what you recommend me to start getting familiar with storage? I mean maybe you need implement some feature or fix a bug?
107 2011-02-24T14:21:43 <Marchael> not very difficult of course :)
108 2011-02-24T14:22:48 <ThomasWaldmann> https://bitbucket.org/mchaput/whoosh/issue/115/dual-licensing-woosh
109 2011-02-24T14:23:42 <ThomasWaldmann> Marchael: storage has quite some issues still :)
110 2011-02-24T14:24:13 <Marchael> new moin logo is a bit scary...
111 2011-02-24T14:24:15 <ThomasWaldmann> one thing I am currently wondering about is why rev.timestamp is implemented as a property
112 2011-02-24T14:24:43 <ThomasWaldmann> Marchael: the round blue one?
113 2011-02-24T14:25:20 <Marchael> https://bitbucket-assetroot.s3.amazonaws.com/c/photos/2010/Sep/26/mastermoin2_avatar.png
114 2011-02-24T14:25:47 <ThomasWaldmann> no, that's not a new moin logo, rather a modification of a very old moin logo :)
115 2011-02-24T14:26:23 <Marchael> ah, ok.
116 2011-02-24T14:31:56 <Marchael> ThomasWaldmann: please go on :) I read EasyToDo but didn`t find any requests for storage...
117 2011-02-24T14:32:22 <ThomasWaldmann> the question about timestamp is why it is not just in the metadata dict as everything else
118 2011-02-24T14:32:59 <ThomasWaldmann> that would simplify serialization, metadata display, indexing...
119 2011-02-24T14:33:17 <ThomasWaldmann> but I guess it had some reason why we did it as property back then
120 2011-02-24T14:33:58 <ThomasWaldmann> Marchael: maybe read MoinMoin/storage/__init__.py for the basics
121 2011-02-24T14:34:05 <Marchael> ok
122 2011-02-24T14:59:41 <ThomasWaldmann> Marchael: also, on the MoinMoin2.0 page there are some talks about moin2 with background infos. the german dlr talk has some pictures about storage.
123 2011-02-24T15:06:01 <Marchael> ThomasWaldmann: in previous commits you are replace pytest on py.test, but `find . -name '*py.test*'` says nothing
124 2011-02-24T15:11:06 <Marchael> where '.' is dir with moin
125 2011-02-24T15:53:23 <CIA-46> Reimar Bauer <rb.proj AT googlemail DOT com> default * 547:983196519144 extensions/data/plugin/macro/pdf2svg.py:
126 2011-02-24T15:53:23 <CIA-46> macro.pdf2svg: fetches a pdf file from given url and converts it to svg.
127 2011-02-24T15:53:23 <CIA-46> It is rendered as svg image on the wiki page and shows a link to the original pdf file.
128 2011-02-24T15:54:13 * dreimark is annoyed from that stupid bugs in a browser plugin for rendering pdf files
129 2011-02-24T16:00:40 <ThomasWaldmann> Marchael: py.test is now installed into your virtualenv, after entering the env you can run py.test
130 2011-02-24T16:00:53 <ThomasWaldmann> (use quickinstall)
131 2011-02-24T16:14:23 *** izibi
132 2011-02-24T16:24:49 *** gwork
133 2011-02-24T17:11:46 <dreimark> bbl
134 2011-02-24T17:28:24 *** Marchael
135 2011-02-24T17:35:56 *** m4k3r
136 2011-02-24T18:05:54 *** Pranav_rcmas
137 2011-02-24T18:07:29 *** Pranav_rcmas1
138 2011-02-24T18:08:58 *** Marchael
139 2011-02-24T18:25:44 *** Marchael1
140 2011-02-24T18:26:45 *** Marchael
141 2011-02-24T18:38:55 *** Marchael1
142 2011-02-24T18:38:59 *** Marchael
143 2011-02-24T19:04:29 *** grzywacz
144 2011-02-24T19:04:30 *** grzywacz
145 2011-02-24T19:26:22 *** greg_f
146 2011-02-24T19:49:32 *** dreimark
147 2011-02-24T19:49:32 *** dreimark
148 2011-02-24T19:49:56 *** Marchael
149 2011-02-24T19:50:20 <dreimark> re
150 2011-02-24T19:54:49 *** kasun
151 2011-02-24T20:10:37 *** kasun
152 2011-02-24T20:11:02 <ThomasWaldmann> re
153 2011-02-24T20:24:51 *** grzywacz
154 2011-02-24T20:37:03 *** franklin
155 2011-02-24T20:37:11 *** franklin
156 2011-02-24T20:41:18 *** grzywacz
157 2011-02-24T20:41:19 *** grzywacz
158 2011-02-24T22:27:31 *** m4k3r
159 2011-02-24T22:27:40 *** m4k3r
160 2011-02-24T23:38:31 <MattMaker> it might be simpler to implement wiki item renaming as a "move wiki item from node A to node B" operation: 1) Item B is created if it does not exist. 2) Item B adds a Revision with A's content and with metadata that says "moved from A". 2) Item A adds a Revision that sets content to zero-length and with metadata that says "moved to B".
161 2011-02-24T23:38:37 <MattMaker> In this way all History is stored per name, which I think suits a wiki just fine and is even an improvement, since now anytime any node had this name, we will see its history under that name, a name which presumably gave meaning to those Revisions which were known by that name. Also, this moves the whole problem of maintaining history out of the hotter code paths. Displaying History can be handled in two ways: 1) When viewing the
162 2011-02-24T23:38:37 <MattMaker> history, the "moved from A" could be expressed as a wikilink to the history of B, and vice versa. I suggest this would be sufficient for most purposes. 2) the History view could follow the "moved to" and "moved from" links to gather a chain of Revisions to display what we are currently used to seeing as History.
163 2011-02-24T23:45:27 <ThomasWaldmann> http://xapian.wordpress.com/2009/02/12/xapian-performance-comparision-with-whoosh/
164 2011-02-24T23:48:37 <ThomasWaldmann> MattMaker: that'll potentially mix up histories of different items
165 2011-02-24T23:49:18 <MattMaker> I argue that if so, it would mix them up in a way that was actually closer to the user's intent
166 2011-02-24T23:51:08 <ThomasWaldmann> btw, we use rename for trashing, too
167 2011-02-24T23:52:33 <MattMaker> what you are thinking of as an item, is an abstract entity that could still be reconstructed as a path that follows a series of nodes, perhaps by giving a node a globally unique "origin" at birth, and remains readonly, which would then follow any moves as a bit of metadata
168 2011-02-24T23:52:59 <MattMaker> the problem is similar to threading email discussions
169 2011-02-24T23:53:44 <MattMaker> (when does the subject change so much that it should not be thought of as a reply? this happens often.)
170 2011-02-24T23:55:40 <MattMaker> I haven't looked at trashing yet, but various possible definitions of it could be expressed in the scheme I am suggesting
171 2011-02-24T23:56:54 <MattMaker> the question of security is an interesting one too.
172 2011-02-24T23:57:05 <MattMaker> it depends on what you intend to protect
173 2011-02-24T23:57:41 <ThomasWaldmann> the question is why should make it so complicated and not use anon items with uuid
174 2011-02-24T23:59:10 <MattMaker> I think this is actually orthogonal to the uuid question.
175 2011-02-24T23:59:24 *** Marchael
176