[18:23:54] <JamesF> OK, well, the current agenda is: (prepare for a bit of a flood)
[18:24:13] <JamesF> # Single login
[18:24:13] <JamesF> * Review Brion's collision estimates
[18:24:13] <JamesF> * Review specification status
[18:24:13] <JamesF> * Set timetable
[18:24:13] <JamesF> # Wikidata
[18:24:13] <JamesF> * Status
[18:24:13] <JamesF> * Taking things forward
[18:24:28] <JamesF> # Metadata
[18:24:28] <JamesF> * Use IEEE LOM?
[18:24:28] <JamesF> * Based off Wikidata?
[18:24:28] <JamesF> # Watchlist analysis
[18:24:28] <JamesF> * How to do this within privacy policy?
[18:24:28] <JamesF> # Shared watchlist feature
[18:24:40] <JamesF> Is there anything else people think we should add?
[18:24:45] <JamesF> Or remove, of course.
[18:24:52] <JamesF> We don't want the meeting to drag on for too long.
[18:24:57] <Eskimo> no problem for me
[18:25:35] * Solensean (n=solensea@wikipedia/Solensean) has joined #wikimedia-research
[18:25:44] <Creidieki> I've wanted to suggest a more robust development bounty system for a while.
Is this a good forum for making that sort of initial suggestion?
[18:25:58] <JamesF> Possibly.
[18:26:05] <JamesF> Hello Solensean.
[18:26:10] <Solensean> Hello, JamesF.
[18:26:12] <Solensean> Hello, all.
[18:27:24] * brian0918 (i=brian091@69-168-139-28.clvdoh.adelphia.net) has joined #wikimedia-
research
[18:27:29] <Creidieki> Well, if we don't have time that would be fine, I can always write a post
to wikidev seeing what people think.
[18:28:16] <nichtich> I just added "* Possible [[Wikimedia budget|Budget]] needs" to the agenda
[18:28:44] <JamesF> Oh, good idea.
[18:28:53] <JamesF> I've got to come up with some budget ideas.
[18:29:07] <nichtich> Mav wrote: "I would really like to encourage the research department to
develop ideas for special projects that aim to improve our software and content. Think
big while keeping our mission in mind. Any ideas we can't fund with regular donation
income could be used in the future to entice big donors or used in a grant application."
[18:29:29] * mindspillage is now known as brain|food
[18:29:42] <JamesF> Yeah.
[18:30:22] * karynn (n=kmartin@pdpc/supporter/monthlybronze/karynn) has joined #wikimedia-research
[18:31:11] <nichtich> JamesF: Where should I add ideas?
[18:31:44] <JamesF> nichtich> To the agenda?
[18:31:50] <JamesF> (http://meta.wikimedia.org/wiki/Wikimedia_Research_Network/Meetings/2005-12-03)
[18:32:07] <JamesF> ... which you moved there, so know where it is.
[18:32:09] <JamesF> :-)
[18:32:24] <nichtich> I meant ideas for budget needs
[18:32:44] <JamesF> Those can go under the umbrella item, I suppose.
[18:32:55] * CXI (i=Sanity@dialup-164.89.221.203.acc50-kent-syd.comindico.com.au) has joined
#wikimedia-research
[18:36:03] <Eskimo> also, some people wanted to have a MediaWiki integrated forum, is it something
wikimedia-research can discuss about ?
[18:36:06] <Eskimo> or is it a mediawiki developpers problem ?
[18:36:45] * delphine (n=Delphine@wikipedia/notafish) has joined #wikimedia-research
[18:37:57] <JamesF> We can talk about it.
[18:38:07] <nichtich> hi delphine!
[18:38:08] <JamesF> There have been lots of discussion about it by the developers, though.
[18:38:12] <JamesF> So they might be interested.
[18:38:16] <JamesF> Hello Delphine.
[18:38:50] <delphine> salut Jakob :)
[18:38:53] <delphine> hiho JamesF :)
[18:38:58] <nichtich> Eskimo: I'd prefer not to talk about it ;-)
[18:39:12] <delphine> salut illustrious unknown ;)
[18:39:23] * avar (n=Huld@adsl6-56.simnet.is) has joined #wikimedia-research
[18:39:25] <Eskimo> ok
[18:40:07] <Eskimo> hi delphine :d
[18:41:08] <JamesF> Hello avar.
[18:41:54] <avar> foo
[18:42:11] * CXI waves
[18:42:14] <Eskimo> ...bar
[18:42:19] <karynn> Eskimo: a mediawiki integrated forum?
[18:42:29] <Eskimo> yes :p
[18:42:59] <JamesF> karynn> I assume Eskimo means something like LiquidThreads?
[18:43:24] <Eskimo> yes
[18:43:34] <Eskimo> ok, I didn't know this was planed :p
[18:43:43] <Eskimo> (just googled it)
[18:44:25] <JamesF> Well, discussed, certainly.
[18:45:22] <Eskimo> yes, MediaWiki 2.0 is not for tomorrow :d
[18:45:36] <CXI> hmm, interesting
[18:45:59] <ambi2> interesting, but kind of goes against the whole "wiki" idea
[18:46:04] <CXI> I'm all for in-band ways of wikipedians connecting, which I kinda feel we're
missing at this point
[18:46:26] <JamesF> ambi> It'd be interesting to see it in action.
[18:47:31] <Creidieki> Wow, LiquidThreads looks sexy, but I don't know how I feel about disabling
the ability to edit peoples' comments. Sometimes people make formatting errors, etc.
[18:48:44] <sannse> It's become quite a social no-no to edit almost anything in other's comments now
[18:48:54] <JamesF> Well.
[18:48:59] <JamesF> I do it every now and then.
[18:49:04] <JamesF> Mainly for formatting, though.
[18:49:13] <JamesF> And these would be properly formatted automagically, so...
[18:49:19] <sannse> *nod* but it's much less accepted now than it used to be
[18:49:22] <CXI> just a brief question... is it more likely that people will use this than VP?
[18:49:31] <JamesF> But you'd be able to "refactor" a set of comments into a summary, of course.
[18:49:36] <JamesF> VP?
[18:49:54] <Creidieki> Yeah, but formatting and links and stuff still get edited these days...
[18:50:17] <Eskimo> Village pump ?
[18:50:20] <JamesF> Indeed - correcting links in people's comments happens quite often.
[18:50:31] <CXI> village pump (generalise that to other non-software-supported discussion areas)
[18:50:41] <JamesF> The Village Pump is a place where we'd use LT, I imagine.
[18:50:50] <JamesF> (Hmm, we seem to have started the meeting early. :-))
[18:50:52] <Creidieki> I mean, aside from maybe Lysdexia, have there been many problems with comment
editing?
[18:51:21] <Creidieki> (JamesF, would you prefer us to delay?)
[18:51:49] <JamesF> Creidieki> Well, it would allow people who aren't here early to have input...
[18:51:55] * leoadec (i=leoadec@201-1-135-198.dsl.telesp.net.br) has joined #wikimedia-research
[18:52:13] <Eskimo> actually, I don't think LiquidThreads should be discussed today :)
[18:52:36] <Creidieki> JamesF, that's true, but I think we're mostly going through background
material at this point. But I'll defer to your judgment.
[18:52:36] <JamesF> Eskimo> Premature?
[18:52:49] <Eskimo> since it's won't be added until 2.0
[18:53:08] <JamesF> Eskimo> Well, the Research Network can work on long-gestation projects too.
[18:53:09] <Creidieki> Eskimo, that's the case with Wikidata too, right?
[18:53:19] <JamesF> Single User Login seems to be one of them, for instance.
[18:53:26] <Eskimo> ok
[18:53:30] <JamesF> (Sadly.)
[18:53:43] <Eskimo> yes...
[18:53:44] <JamesF> I think we'll have it as a later part of the agenda.
[18:53:50] <JamesF> So we can drop it if pushed for time.
[18:54:15] <Eskimo> single login or LT ?
[18:54:24] <JamesF> LiquidThreads.
[18:54:24] <nichtich> I'd like to strictly follow the agenda for not to talk endlessly about
everything and nothing
[18:54:38] <Eskimo> ok, great, single login is very important :)
[18:54:45] <JamesF> Brion said he'd try to get some statistucs for us to work with for SUL.
[18:54:59] <JamesF> ... but he's not yet online, so perhaps he's had something come up.
[18:55:10] * ambi2 is not necessarily convinced that either of them are of tremendous importance
[18:56:08] <JamesF> The budget ideas blue-skying is the most time-pressured item.
[18:56:48] <CXI> the which?
[18:57:29] <JamesF> We (I?) would like ideas for budget items for the Research division.
[18:57:38] <JamesF> Last year's spending was a princely US$0.
[18:57:44] * porao (n=noname@166.Red-83-32-95.dynamicIP.rima-tde.net) has joined #wikimedia-research
[18:57:47] <JamesF> So we can only spend more. :-)
[18:57:48] <porao> hi
[18:57:52] <JamesF> Hello porao.
[18:58:16] <CXI> well, more than or equal to anyway... heck, I'm sure we could achieve less if we
started doing some kind of fundraiser :oP
[18:58:22] <CXI> research division brownies, anyone?
[18:58:59] * JamesF grins.
[18:59:32] <Eskimo> it's 7 PM, I guess we can start the discussions on the agenda ?
[18:59:47] * WiseWoman (n=weberwu@p54BD000A.dip0.t-ipconnect.de) has joined #wikimedia-research
[18:59:53] <WiseWoman> Hello!
[18:59:59] <JamesF> Wait a few minutes just in case some others arrive.
[19:00:01] <JamesF> Hello WiseWoman.
[19:00:10] <Eskimo> ok
[19:00:13] <CXI> *shakes head* ouch, it's 7pm there?
[19:00:31] <WiseWoman> I'm still in the middle of baking a cake, in case I don't answer the next 15
minutes :-)
[19:00:41] <JamesF> CXI> It is, yes.
[19:00:57] <Eskimo> 7 PM in the UK
[19:01:09] <Eskimo> 8 PM for me :p
[19:01:11] <CXI> 6am, hooray for au
[19:01:11] <JamesF> OK, well, people who said they'd turn up were WiseWoman, here, OdileB, Eskimo,
and Kevin.
[19:01:31] <JamesF> CXI> Yes, but's it's 06:00 tomorrow. Or something.
[19:01:34] <brian0918> 2pm
[19:01:45] <there> 2 hours still yeah?
[19:01:52] <CXI> well, what day it is isn't so bad - it's more the fact that it's 6am :P
[19:02:02] <JamesF> there> No, now.
[19:02:07] <there> oops.
[19:02:10] * there is now known as here
[19:02:16] <JamesF> Got your time conversion wrong? :-)
[19:02:17] <here> hey all ;)
[19:02:36] <JamesF> OK, so we're still missing OdileB and Kevin...?
[19:02:55] <odile> hello :)
[19:03:02] <here> cake!, alright..
[19:03:02] * Angela (n=beesley@wikipedia/Angela) has joined #wikimedia-research
[19:03:06] <JamesF> Ah, whoops. Didn't see you there.
[19:03:09] <JamesF> And Angela.
[19:03:10] <WiseWoman> Worldwide meetings are so difficult to arrange!
[19:03:13] <CXI> yeah, how good is cake?
[19:03:14] <Angela> Hi.
[19:03:17] <JamesF> WiseWoman> Indeed.
[19:03:20] <JamesF> Hello Angela.
[19:03:22] <CXI> actually, that's making me feel like breakfast
[19:03:39] <WiseWoman> The cake won't be out for another 6 minutes.
[19:03:49] <JamesF> OK, I think we should start.
[19:03:53] <WiseWoman> But it tends to be very good, what a shame we are't in the same room :)
[19:03:57] <WiseWoman> Okay!
[19:04:04] <CXI> bah, cornflakes, my old standby
[19:04:18] <JamesF> OK, well, given that Brion isn't here, I'm going to skip the first agendum.
[19:04:28] <brian0918> what was that
[19:04:33] <JamesF> Single login.
[19:04:36] <brian0918> oh
[19:04:41] <WiseWoman> We can just delay it in case he does show up.
[19:04:48] <JamesF> Brion was going to do an analysis of collisions.
[19:04:50] <JamesF> Yes, exactly.
[19:05:04] * Pathoschild (n=Pathosch@207.162.58.44) has joined #wikimedia-research
[19:05:10] <JamesF> And Erik's not here for Wikidata.
[19:05:16] <JamesF> Bad start, 'eh?
[19:05:22] <brian0918> brion hasn't been on IRC for 9+ hours
[19:05:23] <WiseWoman> Oh dear, short meeting?
[19:05:29] <JamesF> brian> Yes, I know.
[19:05:31] <WiseWoman> Did they miss that we are now on wintertime?
[19:05:35] <JamesF> WiseWoman> Hopefully not too much.
[19:05:36] <CXI> haha
[19:05:37] * Datrio (i=dariosik@wikipedia/Datrio) has joined #wikimedia-research
[19:05:52] <JamesF> Hello Pathoschild, Datrio.
[19:05:58] <Pathoschild> Hello.
[19:06:06] <Datrio> hey
[19:06:08] <JamesF> So, shall we talk about metadata?
[19:06:18] <here> how does wikidata relate to semanticmediawiki ?
[19:06:19] <WiseWoman> Sure!
[19:06:44] <JamesF> here> Semantic context could be given using Wikidata, as I understand it. But
more on that later (perhaps another meeting).
[19:06:56] <here> ok, onward
[19:06:58] * matusz (n=matusz@cko253.neoplus.adsl.tpnet.pl) has joined #wikimedia-research
[19:07:01] * presroi (n=neubau@p5484CACE.dip0.t-ipconnect.de) has joined #wikimedia-research
[19:07:02] <presroi> hi
[19:07:09] <JamesF> Hello matusz, presroi.
[19:07:51] <JamesF> The concept of meta-data is quite simple, really - being able to tag articles or
other content with (potentially machine-readable) information.
[19:07:59] <WiseWoman> Why do people feel that we need metadata and why has LOM been proposed?
[19:08:13] * here is unfamiliar with LOM details.
[19:08:16] <brian0918> Metadata lets things be more easily found on wikipedia.
[19:08:22] <WiseWoman> Don't we have categories for that? How rich is the meta data supposed to be.
[19:08:26] <odile> WiseWoman: good question
[19:08:38] <JamesF> WiseWoman> Categories are a first-stab at metadata, but are not great.
[19:08:39] <brian0918> Brion explained it all to me and it sounds very promising.
[19:08:40] <WiseWoman> LOM = "Learning Objects MEtadata", and I don't see Wiki articles as being
learning objects.
[19:08:45] <WiseWoman> But I am rather a bitch about that.
[19:08:56] <brian0918> It's not like you would be required to use it either, or like you would have
to do anything differently.
[19:08:57] <JamesF> WiseWoman> There's no concept of anything other than boolean state.
[19:09:01] * Xirzon (n=anon@wikipedia/Eloquence) has joined #wikimedia-research
[19:09:03] <nichtich> but wikibooks can use LOM
[19:09:05] <WiseWoman> http://ltsc.ieee.org/wg12/
[19:09:05] <JamesF> Aha, Erik.
[19:09:06] <Xirzon> hi
[19:09:08] <JamesF> Hello Erik.
[19:09:15] <Xirzon> are you guys logging this live?
[19:09:17] <brian0918> WiseWoman: Metadata doesn't mean you have to change how you do anything.
[19:09:17] <JamesF> We had just started to talk about meta-data.
[19:09:27] <JamesF> Xirzon> No, sorry. I'll post a log later.
[19:09:44] <JamesF> WiseWoman> LOM is an IEEE standard, though.
[19:09:57] <odile> brian0918: *which* metadata? what knid?
[19:09:59] <JamesF> WiseWoman> Educational meta-data.
[19:10:04] <odile> (kind)
[19:10:06] <brian0918> The kind that is extracted from templates used on pages
[19:10:07] <CXI> hmm
[19:10:09] * Angela (n=beesley@wikipedia/Angela) Quit (Excess Flood?)
[19:10:26] <brian0918> For example, any country or city pages with the proper template can have all
of that metadata extracted
[19:10:29] <brian0918> So, you could query for:
[19:10:31] <JamesF> odile> Everything from "this is a person" to "the following King of Prussia was
[[Wilhelm II]]".
[19:10:31] <CXI> so the idea is something like categories except you can tag an article with a key/
value pair or similar instead?
[19:10:32] <karynn> i am logging this.
[19:10:32] <Creidieki> JamesF, is there a page on meta or something that covers this?
[19:10:36] <brian0918> "Give me all cities with over 10,000 people."
[19:10:38] <brian0918> and it will work
[19:10:42] <nichtich> a method to better transform templates to metadata would be nice
[19:10:56] <Creidieki> brian0918, how is that template information different from wikidata?
[19:11:03] * research_log (n=nobody@p15122355.pureserver.info) has joined #wikimedia-research
[19:11:03] * ang2 (n=beesley@pD9535F73.dip.t-dialin.net) has joined #wikimedia-research
[19:11:12] <here> reading ( http://ltsc.ieee.org/wg12/ ) I agree that perhaps Learning Objects are
not what we are working with here
[19:11:15] <JamesF> Creidieki> [[IEEE LOM]].
[19:11:15] <here> , what would other options be?
[19:11:17] * ang2 is now known as Angela
[19:11:26] <Xirzon> ok, live log now at http://scireview.de/wiki/research/channel.log
[19:11:27] <WiseWoman> I don't care if it is a "Standard".
[19:11:30] <WiseWoman> There are so many of them!
[19:11:41] <brian0918> ...
[19:11:42] <JamesF> WiseWoman> Yes, that's true.
[19:11:49] <JamesF> Xirzon> Ah, right, thanks. :-)
[19:11:50] <odile> brian0918, JamesF: metadata can be very diverse,
[19:11:58] <WiseWoman> brian0918, that's what people believe.
[19:12:01] <odile> you have to limit yourself
[19:12:08] <JamesF> odile> Exactly. But our categories system can't be.
[19:12:15] <WiseWoman> This is part of the basis of Semantic Web, but much of this is just a pipe
dream IMHO.
[19:12:22] <Xirzon> ping me when wikidata comes up, otherwise I'll be busy
[19:12:38] <JamesF> Xirzon> Sure. It'll be in a few minutes, maybe 10.
[19:12:51] <WiseWoman> I spent many years of my life teaching a machine to do math proofs. Trying to
satify complex predicates is undecidable, and no amount of computing power will change
that.
[19:12:52] <nichtich> Xirzon: we're just talking about metadata aren't we?
[19:12:59] <JamesF> Another point of using IEEE LOM is that it's currently being used or considered
for use by educational institutions.
[19:13:10] <Xirzon> nichtich: wd is on the agenda for today
[19:13:15] <CXI> yeah, LOM looks like we're trying to cram the wrong sort of data into a space
[19:13:20] <Creidieki> JamesF, so you're suggesting that we add the ability for users to declare
named data fields on pages, and any two data fields with the same name on different
pages would be considered semantically identical?
[19:13:24] <CXI> an encyclopedia doesn't really strike me as a learning object as such
[19:13:30] <WiseWoman> CXI, exactly.
[19:13:50] <CXI> I mean, fair enough, multimedia content, instructional content - but learning
objectives? we're not supposed to have those...
[19:13:52] <nichtich> Who proposed to implement LOM?
[19:13:55] <JamesF> Creidieki> I'm merely discussing what has been suggested.
[19:14:14] <JamesF> nichtich> I think Gerard in particular was interested in our using LOM.
[19:14:15] <Creidieki> JamesF, all right, fair. Was my understanding correct?
[19:14:20] <WiseWoman> JamesF, hogwash. "Educational Institutions" have no idea what LOM is, but if
a learning management system say that it has LOM, then they nod their heads.
[19:14:38] <JamesF> Creidieki> I think it was meant to be more structured and less free-form than
that.
[19:14:47] <WiseWoman> Creidieki, not quite.
[19:15:19] <JamesF> Creidieki> As in, a fixed (centrally-managed) set of definable attributes (with
semantic and dependency relations) that can be attached to articles/multimedia objects
generally.
[19:15:33] <nichtich> Without an example of how to map a wiki article to LOM it looks like nothing
we can decide about at the moment
[19:15:52] <CXI> what kind of things would we be looking at in those attributes?
[19:15:56] <Creidieki> JamesF, so there'd be, like, a "Metadata" namespace somewhere, and it would
define that "Population" is an integer number associated with "Countries"?
[19:16:01] <WiseWoman> Here's an example of LOM: http://www.downes.ca/xml/RSS_LOM.htm
[19:16:03] <JamesF> nichtich> Well, the specifics of how it would work would entirely depend on
wikidata, I suppose.
[19:16:15] <JamesF> Creidieki> That sort of thing, yes.
[19:16:22] * hashar (n=hashar@wikipedia/Hashar) has joined #wikimedia-research
[19:16:24] <Xirzon> JamesF: http://meta.wikimedia.org/wiki/IEEE_LOM in case it hasn't come up yet -
Gerard thinks there could be funding for this
[19:16:30] <JamesF> Xirzon> Yeah.
[19:16:36] <WiseWoman> This would be identifying a string that is an ISBN number: <dc:identifier>
[19:16:36] <WiseWoman> <myVoc:ISBN>
[19:16:36] <WiseWoman> <rdf:value>0-226-10389-7</rdf:value>
[19:16:36] <WiseWoman> </myVoc:ISBN>
[19:16:36] <WiseWoman> </dc:identifier>
[19:16:39] <JamesF> The Dutch educational authorities are pushing hard on it.
[19:16:41] <Creidieki> JamesF, so it's like a cross between templates and categories.
[19:16:44] <nichtich> JamesF: i don't think so. First you have to think about what you want and then
how (Wikidata=how)
[19:17:00] <Creidieki> WiseWoman, I desperately hope that wouldn't be what the users see.
[19:17:11] <WiseWoman> Oh, yes, governments throw money at LOM research projects.
[19:17:12] <JamesF> nichtich> Yes, but the "example of it being applied" would be post-how.
[19:17:18] <nichtich> You can implement LOM-support in many ways but what is *meant* my "supporting
LOM"?
[19:17:19] <CXI> I'm all for integrating wikidata with wikimedia pages
[19:17:20] <brian0918> Creidieki, no, it would be much simpler, and if it was Template-side, then
they wouldn't have to touch it
[19:17:20] <WiseWoman> Creidieki, that is what you have to type in.
[19:17:26] <CXI> but that seems like a separate issue
[19:17:31] <WiseWoman> You can set up visual editors, but it is a pain in the backend.
[19:17:32] <Xirzon> Wikidata is a very specific project that is only marginally related to other
semantic web efforts.
[19:17:36] <JamesF> CXI> Yes, later.
[19:17:38] <brian0918> WiseWoman, in that example, yes, but we can reduce what needs to be typed
[19:17:43] <Xirzon> I hope "Wikidata" is not used as a general term for structured data in wiki :)
[19:17:46] <WiseWoman> And many LOM projects suffer from malformed input...
[19:18:13] <brian0918> brion gave the example of [[publisher::Joe Publisher]] as a much shorter
version
[19:18:17] <CXI> should we be trying to separate data (Fooland has a population of 3000) and
metadata (Fooland can be classified as a Country)?
[19:18:23] <WiseWoman> Xirzon, I thought it was - shall we maybe merge these two topics, metadata
and LOM?
[19:18:26] <Creidieki> So would categories be implemented as a type of LOM metadata?
[19:19:02] * Eskimo (n=Eskimo@wikipedia/Eskimo) Quit (Nick collision from services.?)
[19:19:05] <WiseWoman> brian0918, sorry to be such a pain, but there can be more than one publisher
(original book and paperback book), so how do you differentiate them in your method?
[19:19:12] * Eskimo (n=Eskimo@wikipedia/Eskimo) has joined #wikimedia-research
[19:19:29] <here> add another publisher.
[19:19:36] <brian0918> WiseWoman, that would require more thinking, but trashing the idea at the
first sign of difficulty is never the proper course.
[19:19:46] <Xirzon> WiseWoman: well, LOM is a specific instance of metadata, so yes. But Wikidata is
not for "metadata" per se. Much more simply put, it is a wiki engine layered on top of
arbitrary tables in a relational database
[19:19:47] <WiseWoman> You need more data to signify which one has what role.
[19:20:03] <JamesF> WiseWoman> The meta-data item would have an attached data fragment, too.
[19:20:04] <WiseWoman> The simple examples are always simple, but the real world tends to be
extremely complicated.
[19:20:09] <here> [[publisher::Joe One]] [[publisher::Joe Two]] is not enough?, hmm.
[19:20:23] <brian0918> WiseWoman: then [[pub1::Joe One]] [[pub2::Joe Two]]
[19:20:28] <JamesF> WiseWoman> So "Paperback publisher" as text, or perhaps a proper metadata tag
for that.
[19:20:29] <WiseWoman> Xirzon, can you say than in plain English for the rest of the world?
[19:20:38] <Xirzon> hmm
[19:20:43] <nichtich> I propose to wait with LOM until Wikidata and typed links are implemented -
than you can play around and try to do some LOM stuff
[19:20:48] <WiseWoman> brian0918, now I want the paperback publisher - which one is that, pub1 or
pub2?
[19:20:59] <WiseWoman> I need a proper definition, and that is where it gets hairy.
[19:21:02] <CXI> I got it - it's just wikimedia + arbitrary data instead of wikipedia + encyclopedia
articles
[19:21:02] <Xirzon> OK, I think the main distinction that people here have to understand is between
a project like Semantic MediaWiki
[19:21:04] <Xirzon> and Wikidata.
[19:21:16] <brian0918> well there will be limits to what can be determined, of course
[19:21:17] <CXI> er, s/wikipedia/wikimedia/
[19:21:17] <Creidieki> So these would still be part of the main article text? It seems like it
would work better to have metadata, categories, interwiki links, etc., in a separate
edit window.
[19:21:18] <Xirzon> Semantic MediaWiki utilizes the existing links system in MediaWiki to provide
information *about* links and resources.
[19:21:26] <WiseWoman> Xirzon, can you give us a synopsis?
[19:21:26] <JamesF> WiseWoman> Yes, which is why we definitely would not use things like
"[[publisher::Joe]]".
[19:21:30] <Xirzon> This is a real project with real code, and it is quite cool.
[19:21:30] <brian0918> You can't ask "which publisher has brown eyes and lives in Maine"
[19:21:44] <Xirzon> Is anyone working on the Semantic MediaWiki project present?
[19:21:56] <JamesF> Creidieki> Spinning off meta-data is a part of Wikidata. Possibly.
[19:22:00] <nichtich> I'm only following it
[19:22:03] <WiseWoman> Creidieki, definately! I was planning something like that for the Wikiversity
, I would have one extra tab for each of the many Metadata standards.
[19:22:05] <JamesF> Xirzon> Not as far as I am aware.
[19:22:07] <Xirzon> The demo site is at http://wiki.ontoworld.org/index.php/Main_Page
[19:22:30] <Xirzon> It has some Java dependencies and that was an issue due to Java's non-freeness
[19:22:31] <here> xirzon, i've visited and lookd around.
[19:22:55] <Xirzon> Some aspects of Semantic MediaWiki are painfully slow, I don't know if that is
just because a VM has to be loaded every time you do certain operations, or because the
implementation sucks
[19:23:17] <CXI> ah, interesting
[19:23:23] <Xirzon> In any case, Semantic MediaWiki is different from Wikidata in that its primary
intended use is the fluid addition of arbitrary metadata *inside the article text*.
[19:23:36] <Xirzon> But the software remains largely as it is, and you still basically just have a
text window to deal with.
[19:23:56] <Xirzon> It is not capable of modelling relations of a certain complexity, such as the
ones we need for the main Wikidata application, Ultimate Wiktionary.
[19:24:05] * soufron (n=soufron@vol75-8-82-233-239-57.fbx.proxad.net) has joined #wikimedia-research
[19:24:18] <JamesF> Hello soufron.
[19:24:22] <JamesF> OK.
[19:24:26] <Xirzon> Where you have words, and the user can add an arbitrary number of meanings to
these words, synonyms, translations, and so on. It's a form- and database-driven
application which is much more like traditional web-based database apps.
[19:24:30] <Xirzon> It's not very "semantic webbish".
[19:24:34] <WiseWoman> Xirzon, this is extremely important - will metadata be part of the text
window (like the categories and Persondata in the German Wikipedia), or in a separate
window?
[19:24:48] <JamesF> WiseWoman> No. And categories wouldn't, either.
[19:24:48] <Xirzon> WiseWoman: That depends on what you do with Wikidata - you can do it either way.
[19:24:53] <Xirzon> In Ultimate Wiktionary, for example
[19:24:57] <CXI> referring to http://wiki.ontoworld.org/index.php/San_Diego - that seems almost like
what we're dealing with here in terms of metadata
[19:24:58] <Xirzon> We have relations between words.
[19:25:06] <Xirzon> We have a table where you have the word IDs, and the relation types.
[19:25:17] <JamesF> CXI> Yes, exactly.
[19:25:21] <Xirzon> And in the UI, you will be able to link words together by selecting them and
selecting a relation type.
[19:25:35] <odile> sorry I have to leave, :-( bye
[19:25:38] <Xirzon> But, there's no reason why you shouldn't be able to just throw category links
into a text field, and have them parsed as in the past.
[19:25:48] <JamesF> odile> Bye.
[19:25:48] <Xirzon> in fact, that's relatively trivial.
[19:26:13] <Xirzon> Redesigning the tagging/category system is, to me, completely separate from
Wikidata.
[19:26:17] <WiseWoman> Xirzon, you have things here being relations that are not absolute, but only
relations over time.
[19:26:22] <Xirzon> It has design and user interface challenges of its own.
[19:26:30] <Creidieki> Xirzon, so categories wouldn't be a type of metadata?
[19:26:49] <JamesF> Xirzon> But it could certainly be moved into Wikidata and restructured there,
yes?
[19:26:52] <WiseWoman> For example, what exactly is a city? Is that defined by the number of people
living there? When does SD become a metropolis?
[19:26:54] <Xirzon> Creidieki: If you wanted, you could basically implement a category system on top
of Wikidata. We'll have to see if that is a practical/desirable approach.
[19:26:59] <Xirzon> One thing to keep in mind is that
[19:27:09] <Xirzon> when you move categories out of the text field, you will still want to version
them
[19:27:11] <JamesF> WiseWoman> That depends, of course, on locality.
[19:27:14] <WiseWoman> Creidieki, categories ARE metadata, they are just very unstructured metadata.
[19:27:17] <Xirzon> be able to revert to earlier revisions of category-page relations, and so on -
[19:27:32] <Xirzon> and as such, Wikidata (which provides this versioning of relations) may be a
desirable engine to run this kind of system.
[19:27:35] <JamesF> WiseWoman> In some countries, the UK for one, "city" is a legal term, and only a
few settlements get it.
[19:27:39] <Xirzon> But the main challenges for categories to me are
[19:27:40] <soufron> hello
[19:27:41] <Xirzon> a) localization
[19:27:45] <Xirzon> b) user interface.
[19:27:46] <soufron> hello JamesF
[19:27:57] <JamesF> soufron> It's a bit hectic right now in here.
[19:28:07] <Xirzon> Because auf a), we are focusing on Ultimate Wiktionary as a Wikidata application
, which can be used as a localization engine for categories.
[19:28:13] <Xirzon> Some of you may already have seen my demo for this.
[19:28:14] <WiseWoman> JamesF, exactly. I hadn't thought about the localization problem with a
global implementation yet, that is an added problem. Sigh.
[19:28:15] <soufron> JamesF, I can see that
[19:28:18] <soufron> :)
[19:28:28] <Xirzon> http://epov.org/uwd/index.php/Main_Page
[19:28:32] <WiseWoman> Just to ask a stupid question: what is the point of the semantic relations?
Just a game?
[19:28:46] <CXI> an important thing to remember here is that this still has to be something that
will be useful to users
[19:28:55] <brian0918> WiseWoman, there can be some ambiguity about everything. Again, this alone is
not a reason to dump the whole idea
[19:28:56] <nichtich> WiseWoman: what is the point of Wikipedia at all?
[19:28:57] <Xirzon> WiseWoman: In UW it is very much the opposite. For example, for a thesaurus you
need relations like "is broader term of" or "is narrower term"
[19:28:57] <CXI> I mean, as simple as our current system is, it's easy for people to model
internally
[19:29:03] <JamesF> WiseWoman> So the metadata tag "city", in this example, would actually have
different versions for different places. "City of the United Kingdom" might be one (and
would combine multiple facets). Etc.
[19:29:12] * leoadec (i=leoadec@201-1-135-198.dsl.telesp.net.br) Quit ("leaving"?)
[19:29:22] <Xirzon> WiseWoman: The first milestone in the Ultimate Wiktionary development is a
running version of GEMET, a European Union environmental thesaurus.
[19:29:24] <Creidieki> JamesF, wouldn't "City of the United Kingdom" want to be a type of city?
[19:29:32] <WiseWoman> brian0918, I want to make some points clear before everyone invests an
enourmous amount of effort into something which is intractable.
[19:29:47] <JamesF> Creidieki> Not really.
[19:30:02] <brian0918> WiseWoman: but you haven't suggested any alternatives. You have simply
identified all the problems.
[19:30:04] <WiseWoman> The Ultimate Wiktionary is a nice, small, focused area I can imagine would be
a nice test bed.
[19:30:14] <JamesF> Creidieki> It's a type of settlement, and it's a geographically-bound identifier
. But this is off-topic.
[19:30:19] <Xirzon> Small? You haven't seen Gerard's ERDs :-)
[19:30:21] <Creidieki> JamesF, well, fine, but we could use the metadata system to say that "City of
the United Kingdom" metadata is a "specific-localized-version" of "City" metadata,
right?
[19:30:28] <WiseWoman> brian0918, actually, there are more problems. And I don't know what the
problem is that we are trying to solve :-)
[19:30:39] <Xirzon> http://meta.wikimedia.org/wiki/Ultimate_Wiktionary_data_design
[19:30:42] <WiseWoman> Small as compared with the Wikipedia
[19:30:47] <nichtich> WiseWoman: Semantic relations *are* very complex and maybe overestimated but
they will be useful
[19:30:48] <JamesF> Creidieki> Oh, yes, exactly. So that would be a (quite standard) relationship
definition.
[19:31:02] <Creidieki> JamesF, So we could organize types of metadata with metadata.
[19:31:10] <JamesF> Creidieki> Definitely.
[19:31:13] <Xirzon> Actually, UW will be at least as large as Wikipedia if it is successful. We
already have sources containing hundreds of thousands of words which we will be able to
tap.
[19:31:20] <Creidieki> JamesF, sweet. It's all pretty like Category Theory.
[19:31:28] <Creidieki> JamesF, I am now in favor.
[19:31:53] * k1v1n (n=k1v1n@cpe-066-057-113-059.nc.res.rr.com) has joined #wikimedia-research
[19:32:06] <WiseWoman> Creidieki, but it is not as clean as Category Theory! Again: what is the
point of the metadata for the Wikitionary - I have not understood this yet, pardon my
ignorance.
[19:32:07] <JamesF> Creidieki> Good. That's one. :-)
[19:32:13] <JamesF> k1v1n> Hello.
[19:32:23] <Xirzon> Semantic relations are useful. I want to make clear that these types of
relations are only a sub-element of Wikidata.
[19:32:29] <JamesF> Xirzon> Absolutely.
[19:32:29] <k1v1n> hello, sorry i'm late. fell asleep watching a game.
[19:32:41] <Xirzon> The primary use of Wikidata is to break up the wiki page itself into application
-specific forms.
[19:33:08] <Xirzon> In the simplest case, say Wikinews, you just add a date field to the article
text, for example.
[19:33:11] <Creidieki> WiseWoman, you could probably use it for relations like "[Foo] is derived
from [Bar]", and then it would be easy to make word derivation graphs.
[19:33:20] <Xirzon> That already makes the wiki more useful for that specific application, namely
news.
[19:33:30] <Creidieki> WiseWoman, or "This word is the plural of that other word"
[19:33:44] <WiseWoman> Xirzon, but the fun thing of the Wikipedia is that it is Application
INDEPENDANT! You can use it as the basis for many things!
[19:34:01] <Xirzon> WiseWoman: Yes, but none of them properly ;-)
[19:34:30] <WiseWoman> Xirzon, do the right things, don't worry about doing things right!
[19:34:30] <Xirzon> WiseWoman: Seriously, "the wiki" as such sucks for a dictionary or a news site
or a quote collection. You need specialized technology for these specific apps.
[19:34:44] <Creidieki> WiseWoman, or "This word is defined from that language" -- it would give us a
way of organizing words by language in the actual information, rather than just having a
heading that says "==Spanish==".
[19:34:50] <Xirzon> WiseWoman: do the right things right sounds about right to me :)
[19:35:02] <WiseWoman> Xirzon, having just eyeballed the data structure for the dictionary you are
quite right.
[19:35:06] <CXI> wikidata makes a lot of sense to me from a wiktionary perspective
[19:35:08] <soufron> pfiou
[19:35:28] <Xirzon> Creidieki: exactly, but the power of UW is far bigger than that.
[19:35:34] <Xirzon> The power of UW is in automatic localization.
[19:35:43] <Xirzon> The way it works is by following from a meaning to the translations and
synonyms.
[19:35:50] <Creidieki> Xirzon, right, after you have the language information you can use it in
display preferences.
[19:36:04] <Xirzon> For example, when you have an infobox
[19:36:18] <WiseWoman> Xirzon, I don't believe that will ever be fully automatic.
[19:36:23] <Xirzon> you have something like "Country name: Germany". There's no reason this name
should not be localized automaticaly into Swahili.
[19:36:41] <Xirzon> WiseWoman: It can pretty much be for "label" type words and short phrases as
long as you specify which meaning you're referring to.
[19:37:02] <Creidieki> WiseWoman, the Category system shows the the power of bored people to
semantically organize information is immense.
[19:37:05] <Xirzon> But even for translators working with tools like Omega-T, Ultimate Wiktionary
can be an incredibly useful source of translation memories.
[19:37:23] <Xirzon> s/be/become :)
[19:37:33] <Xirzon> So, yes, this is the vision we're working towards, and it's a pretty big vision.
[19:37:42] <Xirzon> The good news is that we now have an additional developer, avar, doing things on
Wikidata.
[19:37:44] <WiseWoman> Creidieki, this is people who believe there is a mathematical basis of
everything, for example language :)
[19:37:56] * avar waves
[19:37:59] <CXI> I'm worried about wikipedia in terms of datacruft though... are we likely to have
stuff like [[Population:=6,000]] in articles?
[19:38:11] <CXI> talking metadata here, not wikidata
[19:38:12] <WiseWoman> Xirzon, as a tool - great. But not as an automatic translator.
[19:38:18] <Xirzon> The WIKIDATA branch in CVS is now installable thanks to avar, so you can take a
look at the way the new namespace manager works.
[19:38:19] <JamesF> CXI> It wouldn't probably be shown in article-editing view.
[19:38:25] <Creidieki> CXI, I don't understand your concern. Is having that a bad thing?
[19:38:44] <Xirzon> WiseWoman: You have to distinguish translating texts and translating labels.
When you add labels, and also add the meaning of the labels, you can then truly
automatically localize them.
[19:38:47] <WiseWoman> CXI - that's what I meant with dynamic data. Things change, someone will have
to clean up the cruft periodically.
[19:38:59] <Creidieki> CXI, are you asking about the "crap at the bottom of the article" syndrome?
[19:38:59] <CXI> Creidieki: well, it doesn't really strike me as useful as metadata
[19:39:06] <CXI> and yes, basically
[19:39:27] <Xirzon> The crap at the bottom of the article syndrome is worst with interlanguage
links.
[19:39:34] <Xirzon> And the interlanguage links system is bad, just plain bad.
[19:39:42] <Xirzon> It needs to be redesigned from scratch.
[19:39:50] <JamesF> Xirzon> Yes. Rationalisation of that is part of the intended fixes long-term.
[19:39:59] <Creidieki> I think they were talking about having a separate window for
articlebottomcrap, right? And that window could eventually absorb the interwiki links
and categories.
[19:40:11] <WiseWoman> Xirzon, but even labels can "self-destruct", for example the lable "Country
Name: Yugoslavia" needing to be broken up every now and then.....
[19:40:17] <CXI> the thing is, though, stuff like "foo has a population of x" strikes me as data
rather than metadata
[19:40:22] <Xirzon> JamesF: Absolutely, absolutely focus on single login now. It makes a million of
these centralization tasks easier.
[19:40:30] <JamesF> Yes.
[19:40:30] <k1v1n> agreed. that's not metadata.
[19:40:33] <JamesF> But brion isn't here.
[19:40:36] <JamesF> So, not today.
[19:40:41] <Xirzon> WiseWoman: That's what you have the Wiktionary for - if you change things, the
changes are reflected in the places where the labels are used.
[19:40:55] <Creidieki> No, but it's metadata because it's data *about the article*, rather than the
article itself.
[19:40:58] <CXI> so while I think it'd be nice to maintain stuff like that for wikidata purposes,
I'm not sure how useful it would be cramming it into our metadata system
[19:41:11] <CXI> it's not data about the article, though, it's data about the country
[19:41:24] <Xirzon> CXI: I'd love to have a "country factbook" application built with Wikidata.
[19:41:39] <Xirzon> The thing is - stuff like "Semantic MediaWiki" allows you to get reasonable
amount of structure quickly.
[19:41:39] <WiseWoman> Xirzon, but I won't want everything changed automatically! An article on the
history of Yugoslavia should not change name suddenly.
[19:42:09] <Xirzon> WiseWoman: Right. If you are changing the meaning, you have to create a new word
anyway :)
[19:42:21] <WiseWoman> Ted Nelson has written about this problem - sometimes you want the current
value, sometimes a previous value. I don't see that the work involved to get clean
metadata will pay for anything useful. Enlighten me!
[19:42:25] <Creidieki> Well, but all of the examples I gave ("[Foo] is a word in the language
[Spanish]") weren't metadata either, by that definition. What's the difference between
wikidata and the Metadata they're suggesting?
[19:42:34] * Eskimo (n=Eskimo@wikipedia/Eskimo) Quit (Read error: 54 (Connection reset by peer)?)
[19:43:02] <Creidieki> Strictly speaking, metadata should be things like "how often has this article
been viewed" and the ratings system and such.
[19:43:04] <Xirzon> The problem is that metadata is just too damn catchy a word and everyone likes
to use it when they really just mean "data" :)
[19:43:07] <CXI> because "is a word in the language" provides a mapping between two points of data
[19:43:12] <CXI> which makes it metadata
[19:43:29] <Creidieki> CXI, article != data. It provides a mapping between two articles.
[19:43:38] <Creidieki> CXI, it's still a piece of data about the word [Foo].
[19:43:43] <here> creidieki, i think metadata here is about making information already in the
article machine readable.
[19:43:44] * Eskimo (n=Eskimo@wikipedia/Eskimo) has joined #wikimedia-research
[19:43:57] <JamesF> Creidieki> No, that's metadata about the server's copy of the article.
[19:44:01] <CXI> well, if we're talking about metadata in articles then we have to be talking about
articles as data
[19:44:09] <JamesF> Creidieki> No, about the article and it's content
[19:44:13] <CXI> otherwise what's it metadata of?
[19:44:17] <Xirzon> heh :)
[19:44:29] * Xirzon loves semantic discussions about semantic web
[19:44:33] <k1v1n> the categories are a form of metadata. We just need more robust info.
[19:45:12] <here> i'm not convinced this is all useless, so where to go from here?
[19:45:24] <Xirzon> I don't really care what you call it, I care what you can do with it. Does it
give you scalable access to certain repeating information patterns or not?
[19:45:46] <Creidieki> So, do I have this correct? Some people are talking about just using Wikidata
for the metadata, and others are suggesting a separate standard (LOM)?
[19:45:53] <WiseWoman> here, one thing that I would love to see would be an extra tab for metadata.
[19:46:06] <Xirzon> I think the "typed links" approach is likely to be very good at revealing
structures, I'm not sure it's so good at accessing and creating them.
[19:46:11] * Xirzon sighs
[19:46:14] <WiseWoman> As said above - getting the junk from the bottom of the page factored out.
[19:46:15] <here> absolutely, even fork the article structure elsewhere and try overlaying.
[19:46:15] <nichtich> WiseWoman: yes
[19:46:26] <k1v1n> we're currently building a DC metadata extension for an enterprise wiki. We have
to have it.
[19:46:45] <nichtich> We should get all the interlaguage, categories and upcoming metadata stuff
into a seperate tab
[19:46:52] <Xirzon> k1v1n: is it going to be open source?
[19:47:02] <here> DC?
[19:47:08] <WiseWoman> Here, if there ever is embedded metadata I would insist on an overlay so that
newbies can still edit!
[19:47:16] <k1v1n> for example, just because you've done an edit doesn't make you an "author". The
current way author is determined doesn't cut it.
[19:47:20] <here> wisewoman: i agree, could even be a pref.
[19:47:23] <k1v1n> Absolutely open.
[19:47:29] <Xirzon> nichtich: more important is to share the metadata across language editions
[19:47:41] <JamesF> Yes.
[19:47:44] <JamesF> Absolutely.
[19:47:45] <CXI> actually, that's a really good point
[19:47:47] <CXI> wikidata-wise
[19:47:49] <k1v1n> Dublic Core
[19:47:57] <JamesF> Sharing of metadata between languages is vital.
[19:47:57] <k1v1n> s/dublic/dublin
[19:48:14] <WiseWoman> http://dublincore.org/
[19:48:15] <CXI> metadata or data? or are we not having a distinction? :D
[19:48:16] * notafish (n=Delphine@wikipedia/notafish) has joined #wikimedia-research
[19:48:19] <Xirzon> For interlanguage links, it's a no-brainer. For categories, it's more tricky,
of course.
[19:48:25] <Xirzon> Hello Delphine.
[19:48:26] <JamesF> Hello Delphine.
[19:48:28] * brain|food is now known as mind|concert
[19:48:29] <JamesF> Again.
[19:48:45] <Xirzon> JamesF: stop being my exact clone :-)
[19:48:57] <JamesF> Xirzon> Sorry. :-)
[19:49:23] <Xirzon> OK, so I can only invite you to look at the Wikidata branch and also wait for
the paper which I'm working on atm -
[19:49:39] * delphine (n=Delphine@wikipedia/notafish) Quit (Nick collision from services.?)
[19:49:44] <Xirzon> it should be finished by next week and will pretty much finalize the design of
the Wikidata engine.
[19:49:50] <CXI> the only reason I keep bringing this up is because I think it's important we be
careful about what we're trying to do - provide a mapping between articles or duplicate
the data in an article in a pure data form
[19:49:57] * notafish is now known as delphine
[19:49:59] <delphine> thank you and hmpf
[19:50:03] <WiseWoman> Here's a nice paper on Dublin Core for beginners:
http://dublincore.org/documents/usageguide/
[19:50:12] <here> cxi, are those mutually exclusive goals?
[19:50:18] <here> ww: thanks.
[19:50:26] <Xirzon> CXI: Semantic MediaWiki does both, for example.
[19:50:52] <Xirzon> http://wiki.ontoworld.org/index.php/San_Diego
[19:51:05] <CXI> I linked that a little while ago :P
[19:51:15] <CXI> I'm just trying to make sure I understand whether we're trying to do one, the other
or both
[19:51:20] <CXI> because only one of those two things is metadata
[19:51:34] <Creidieki> CXI, articles are types of data, right? There isn't any difference. It's
just like higher-level functions or whatever in PL.
[19:51:59] <Creidieki> Associating a number with an article, or a pair of articles, shouldn't be any
different than associating an article with that article.
[19:52:06] <JamesF> OK, are we making progress here or going around in circles?
[19:52:18] <Xirzon> CXI: We're definitely doing both, using a few different approaches.
[19:52:28] <Xirzon> JamesF: A bit of a semantic circle really.
[19:52:37] <JamesF> Xirzon> Yes, that's what I feel too.
[19:52:39] <nichtich> maybe we better move on to the next topic
[19:52:39] <Creidieki> JamesF, I think we all understand Metadata and Wikidata a lot better than we
did. Is there a specific issue you'd like us to try to address?
[19:52:41] <CXI> mm, sorry, I might be contributing somewhat to the circlyness
[19:52:59] <JamesF> Creidieki> I'm here to help you understand things better, and hopefully help
work towards them. :-)
[19:53:21] * here has pages of new reading material ;)... good for now.
[19:53:27] <JamesF> I've written a sort-of summary as we've been going along
http://meta.wikimedia.org/wiki/Wikimedia_Research_Network/Meetings/2005-12-03#Summary
[19:53:28] <CXI> so what we're thinking of trying to create then (with metadata, and not wikidata)
is the ability to associate arbitrary data with an article?
[19:53:37] <JamesF> Is it missing anything specifically?
[19:53:37] <k1v1n> we might have our metadata extension ready to demo by the next meeting. happy to
send some links before then.
[19:53:40] <Creidieki> JamesF, okay. So when do we get to the part where we spend Wikimedia's
money?
[19:53:48] <nichtich> JamesF: thanks
[19:53:54] <JamesF> Creidieki> That's what I was going to move us on to.
[19:54:00] <Xirzon> k1v1n: If you could post to wikitech-l about it, that would be cool.
[19:54:02] <JamesF> Xirzon> You might want to stay for this, perhaps. :-)
[19:54:08] <Creidieki> CXI, I'm mostly talking out of my very newly-gained understanding, please
don't believe anything I've said.
[19:54:14] * matusz (n=matusz@cko253.neoplus.adsl.tpnet.pl) Quit ("Koniec"?)
[19:54:16] <JamesF> k1v1n> That would be brilliant if possible.
[19:54:27] <CXI> :o) I'm basically doing the same, don't worry
[19:54:28] * k1v1n (n=k1v1n@cpe-066-057-113-059.nc.res.rr.com) Quit (Remote closed the connection?)
[19:54:37] <JamesF> OK, next topic.
[19:54:38] <JamesF> ------------------------------------------------------------------------------
[19:54:45] <Xirzon> Wikimedia has money to spend?
[19:54:54] <JamesF> I've been asked if I want to put forward a budget for next year.
[19:55:10] <JamesF> Xirzon> Not yet. But charities' budgets are aspirational.
[19:55:15] <WiseWoman> For the WRN?
[19:55:30] <JamesF> So, do any of you have any ideas for what the Research division could spend
money on?
[19:55:35] <Creidieki> I think that all of the Board Members and high-level research people should
have subscriptions to EB and Encarta, so we're familiar with how our competitors do
things.
[19:55:42] <Xirzon> JamesF: A decent peer review design paper.
[19:55:54] <Creidieki> Some of them may already have subscriptions from universities or whatever.
[19:55:57] <JamesF> One example was funding a dedicated server for analysing data.
[19:56:05] <Xirzon> That deals particularly with issues such as quickly changing articles.
[19:56:10] <here> like toolserver?
[19:56:19] <WiseWoman> JamesF, that is an excellent idea.
[19:56:22] <JamesF> here> Yes, but tasked to us exclusively.
[19:56:23] * k1v1n (n=k1v1n@cpe-066-057-113-059.nc.res.rr.com) has joined #wikimedia-research
[19:56:25] <Angela> Creidieki: a subscription tells you nothing about how they do things - only what
their output is.
[19:56:28] <here> sounds great.
[19:56:43] <CXI> indeed, a data analysis server sounds nice
[19:56:56] <JamesF> It would get copies of all the logs, etc., for crunching.
[19:56:58] <nichtich> But we can already use the toolserver for many tasks (not all)
[19:57:14] <JamesF> nichtich> We can, yes.
[19:57:20] <Creidieki> Angela, I meant more like "what decisions they've made about site design",
etc., than their editorial process.
[19:57:20] <Xirzon> I'd love to see money spent on people's brains rather than on CPUs for a change
[19:57:31] <JamesF> nichtich> But our needs would likely outgrow a single server.
[19:57:35] <WiseWoman> That would offer a data basis that could be used for teaching or research,
that would be lovely.
[19:57:43] <CXI> would it also be useful to use such a server for testing out proposed extensions to
wikipedia functionality?
[19:57:43] <nichtich> JamesF: yes
[19:57:47] <JamesF> nichtich> It could also be used for test sites, like the toolserver, too.
[19:57:56] <JamesF> Xirzon> That's possible, certainly, though harder.
[19:57:56] <Creidieki> Angela, or "what kind of coverage they've made of certain topics".
[19:57:56] <nichtich> Once there was a testserver
[19:58:13] <WiseWoman> Would spending money make single login happen faster /at all?
[19:58:14] <JamesF> Xirzon> When you said "peer review design paper", what did you mean?
[19:58:31] <Xirzon> JamesF: There's been lots of talk about "stable versions" and how much we need
them, but
[19:58:36] <JamesF> WiseWoman> We could certainly try for that.
[19:58:38] <Angela> Creidieki: there are enough Wikipedians interested in that who already have
access. I don't think there's a need to pay for this.
[19:58:44] <Xirzon> almost none of these discussions actually ever gets into the part of *how the
articles are meant to be reviewed*.
[19:58:50] <here> i second any support toward single login. not sure how to throw $ at it.
[19:58:55] <nichtich> ...nobody implemented it in a useful way
[19:59:01] <Xirzon> How is consensus reached, what specialized tools are needed, when and how and by
whom is the article flagged, etc.
[19:59:09] <Creidieki> Can we just stick bounties on all of the single login-related bugzilla bugs
and pay people if they fix them?
[19:59:09] <Xirzon> s/article/revision/
[19:59:14] <JamesF> WiseWoman> Essentially, fund Brion to an extent, then require him to spend some
time on it. But, he's not exactly slacking, so...
[19:59:18] <Xirzon> Brion is meant to be doing single login.
[19:59:22] <Xirzon> He gets paid by Wikimedia.
[19:59:27] <Xirzon> So he really just needs to be prodded a bit.
[19:59:31] <CXI> heh
[19:59:32] <Xirzon> OK, a lot.
[19:59:35] <CXI> pray a special prodding squad
[19:59:42] <CXI> s/pray/pay/
[19:59:44] <JamesF> Xirzon> He's very busy. All the time. :-(
[19:59:55] <Xirzon> JamesF: yes, but he's closest to the specs as they stand.
[20:00:04] <nichtich> I don't like the idea to pay someone to write a research paper on review
systems in Wikipedia
[20:00:08] <JamesF> CXI> Problem is, only Brion and Tim, really, would know enough and have
sufficient access and sway to actually make it happen.
[20:00:16] <JamesF> Xirzon> Yes, I know.
[20:00:27] <JamesF> Xirzon> Last I heard, he's essentially waiting for a green light and some time.
[20:00:35] <JamesF> Xirzon> And he's been given neither.
[20:00:44] <Xirzon> JamesF: green light from whom?
[20:00:48] <JamesF> nichtich> I'm not sure about it, either.
[20:00:49] <WiseWoman> nichtich, yes. Research papers should never be done for money.
[20:00:51] <Creidieki> Single login would be more the part of a development budget than a research
budget, though, right?
[20:00:55] <CXI> mm... how's wikipedia doing in terms of devs anyway?
[20:01:15] <JamesF> Xirzon> For robbing everyone of their logins and changing them all around? The
Board, I'd hope. I'm not wanting to take the flak for it personally, certainly. ;-)
[20:01:16] <WiseWoman> JamesF, does the WRN need to kick someone to turn on the green light?
[20:01:28] <here> what about outreach to the research community, any goals there?
[20:01:40] <Angela> I'm not aware of any opposition from the Board to single login.
[20:01:43] <here> (external research community)
[20:01:46] <JamesF> WiseWoman> We'd need to come to a conclusion on the exact best way to deal with
collisions.
[20:01:46] <Angela> It's really just up to Brion.
[20:01:47] <Xirzon> JamesF: you could just submit the specs for approval if you think it's needed
[20:01:53] <WiseWoman> Creidieki, it seems to be moving from research into development - that's why
both are often in the same department in companies.
[20:01:58] <nichtich> here: what kind of outreach?
[20:02:04] <Xirzon> nichtich / WiseWoman : Find someone who does his PhD on Wikipedia review
processes then. We need a good design for this or it is not going to happen.
[20:02:10] <JamesF> Xirzon> There are still different possible ideas and ways of doing it.
[20:02:41] <here> nichtich: unsure, but a primary goal of WRN.
[20:02:41] <Xirzon> JamesF: I thought the specs were fairly unambiguous at this point?
[20:02:42] <nichtich> I'm going to do my PhD on other aspects of Wikipedia - will you pay me?
[20:02:56] <WiseWoman> Xirson, or her..... actually, there are so many thesis topics up and running
at the moment about the Wikipedia, I am not sure that this is not being addressed.
[20:03:07] <Xirzon> nichtich: I wouldn't pay you even if you did naughty things with me :)
[20:03:21] <JamesF> Xirzon> Automatic collision reversal via what method? Number of edits? Primacy
of project? None at all? Etc.
[20:03:36] <Xirzon> JamesF: Collision reversal? What does tha tmean?
[20:03:51] <Eskimo> if 2 ppl have the same user name on different projects
[20:03:51] <Xirzon> JamesF: Primary of project - why would there be?
[20:04:08] <JamesF> Xirzon> User James on the English Wikipedia and user James on the Anglo-Saxon
Wiktionary, for example.
[20:04:09] <Creidieki> Could we make it clear that there would be support for students doing
research projects on Wikipedia? Would board members/developers/etc. be willing to pledge
some time to talk with people doing undergrad. or grad. research on aspects of
Wikipedia?
[20:04:13] <Xirzon> JamesF: Oh, that - we have a fairly well specified process for that.
[20:04:14] <JamesF> Off the top of my head, those are different people.
[20:04:28] <JamesF> Yes, but not agreement that that's the best way
[20:04:29] <Xirzon> JamesF: At some point you take the specs that are there and submit them for
approval.
[20:04:39] <Xirzon> If you think they need further discussion, put them up for a consensus
discussion or a vote.
[20:04:40] <JamesF> Xirzon> I will, I suppose.
[20:04:49] * JamesF nods.
[20:04:52] <JamesF> OK, enough of this.
[20:04:53] <Xirzon> But consensus is going to be almost impossible on this.
[20:05:06] * Pathoschild (n=Pathosch@207.162.58.44) Quit
[20:05:06] <Xirzon> There are religious discussions about whether decentralized or centralized
authentication is preferable.
[20:05:10] <WiseWoman> Creidieki, that could end up being a full time job.
[20:05:11] <Xirzon> You're not going to get people to agree.
[20:05:18] <Xirzon> I did a basic vote template for that a while ago.
[20:05:36] <Xirzon> But it seems fairly clear to me that most *users* would prefer to have *one*
Wikimedia identity.
[20:05:40] <JamesF> Xirzon> Yes, well, voting is evil. :-)
[20:05:41] <Xirzon> And not a confusing system with multiple identities.
[20:05:42] <Creidieki> So it seems like most of what we need in terms of a research budget is
developer time.
[20:05:59] <Creidieki> And a stats/testing server(s).
[20:06:06] <Xirzon> JamesF: See, that's what you get for having these crazy religious ideas about
voting - you don't get things done anymore ;->
[20:06:07] <nichtich> I think so
[20:06:11] <WiseWoman> Just to kick another idea around (if we are looking for ways to spend money):
how about if Wikipedians want to have a stand at a research conference?
[20:06:22] <CXI> has anyone proposed separating out wikimedia-wide logons and individual project
ones?
[20:06:23] <nichtich> or traveling costs
[20:06:28] <JamesF> Xirzon> Or I just declare it so. :-)
[20:06:36] <WiseWoman> They would need to pay a stand fee and printing of material, trying to
attract "real" reasearchs to the Wikipedia.
[20:06:47] <JamesF> CXI> That would be complicated and not achieve the intended goal.
[20:06:51] <WiseWoman> Might be too vague how to specify how you get money for this, though.
[20:06:52] <Xirzon> CXI: Well, to some extent they would be
[20:07:04] <Xirzon> CXI: The current single login specs are built around the existing MediaWiki
authentication plugin.
[20:07:10] <Creidieki> I thought that "interfacing with academia" wasn't one of the areas of the
Research committee, and that we were supposed to be researching about ways to improve
Wikimedia?
[20:07:18] <Xirzon> Which means that you have a centralized database of account names and passwords
[20:07:20] <JamesF> CXI> Because people would want "James" to be *that James* everywhere. Not *that
James* except where it's another, local, James.
[20:07:27] <Xirzon> and local databases of preferences and other user-related stuff.
[20:07:39] <WiseWoman> Creidieki, I meant speaking to potential researchers about improving the
Wikimedia.
[20:07:40] <CXI> which is why you wouldn't be able to create "James" as a single user logon
[20:07:40] <Xirzon> So that which should be global, would be global, and that which should be local
wouldn't.
[20:07:49] <Eskimo> JamesF> yes, there has to be a transition at some point
[20:07:55] <WiseWoman> But then I realized that it would be too difficult to target conferences,
that is better 1-1
[20:07:58] <JamesF> Creidieki> It's more my personal area as CRO than the Network's area, yes.
[20:08:13] <nichtich> WiseWoman: Yes, we have to specifiy that "improving Wikipedia" must be a focus
to get support
[20:08:13] <here> 'and to collaborate with researchers who are not part of the Wikimedia community.'
[20:08:26] <Creidieki> Okay.
[20:08:33] <Xirzon> JamesF: I think peer review specs could be the next focus after single login,
with payment or without.
[20:08:38] <JamesF> CXI> So, because of some stick-in-the-mud editor who won't change - or has left
- James, the founder of the ang. WKT, can't have a global account? That's a tad unfair.
[20:08:50] <JamesF> Xirzon> It's certainly on my list.
[20:09:02] <Xirzon> JamesF: This is *the* thing Wikipedia needs to get into academia.
[20:09:10] <CXI> James: well, which james should have priority, then?
[20:09:18] <Creidieki> Xirzon, what do you mean by peer review "specs"?
[20:09:19] <JamesF> What about LDC's ideas on developing and properly making Wikitax LR0-parsable?
[20:09:25] <CXI> it strikes me that any option in that regard is somewhat unfair wrt "sorry, there's
another, more important james"
[20:09:27] <WiseWoman> Xirzon, I think that is a realistic focus point.
[20:09:38] <JamesF> CXI> Yes, that's the rub of it exactly.
[20:09:46] <WiseWoman> There are actually researchers wo do work on peer review, for example
Wolfgang Coy at the Humboldt University.
[20:09:54] <JamesF> CXI> But, in essence, we have to make that decision.
[20:09:58] <JamesF> CXI> But more on this later.
[20:10:05] <CXI> sure, sorry for derailing :P
[20:10:06] <Xirzon> Creidieki: As I said before, the basic goal is simple: to have "stable, reviewed
revisions". But how do you achieve that consensus, who can implement it, how do you deal
with minor vs. major changes, rapid vs. slow ones, do you want specific functionality
for sources, etc.
[20:10:24] <nichtich> WiseWoman: peer review in sciene is a whole research area - but Wikipedia is
not science
[20:10:38] <Xirzon> There's also algorithmic information that can be collected to aid in finding
potentially unreliable articles. That should fall within the peer review specs.
[20:10:48] <Creidieki> Xirzon, Okay, so you're talking about peer-reviewing articles, and that's
separate from the simultaneous discussion about having someone write a peer-reviewed
article about Wikipedia?
[20:10:49] <Xirzon> Jamseday is a person to talk to here, he's got some excellent ideas.
[20:10:57] <Xirzon> Creidieki: ooooh, absolutely.
[20:11:12] <Eskimo> Single login has to be unfair at some point
[20:11:12] <Eskimo> one of the james has to be renamed as James@en.wikipedia, for instance
[20:11:13] <k1v1n> there was a pretty good paper at wikimania on factors determing quality. Seems to
me our goal should not to be to replicate the old peer-review, but develop algorythms
that are just as predictive.
[20:11:30] <Xirzon> k1v1n: I think we need both, but I'd be happy to see progress in either
direction.
[20:11:39] <WiseWoman> Xirzon, I have suggested a simple version: I digitally sign a version of an
article and this is shown in the history - I, WiseWoman, put my hand in the fire that
this article is correct.
[20:11:43] <k1v1n> xirzon: agreed.
[20:11:48] <JamesF> Eskimo> Yes. Or, all Jameses get renamed like that, and then there's a massive
scramble to get the global one of your choice. :-)
[20:12:03] <Xirzon> WiseWoman: I believe in a "wiki"-like process where there's true collaboration
rather just competing individuals.
[20:12:15] <JamesF> WiseWoman> But how many of our articles can that apply to?
[20:12:17] <nichtich> WiseWoman: that method was also suggested by others - we only need it
implemented
[20:12:21] <CXI> which, I think, is the reason why you have to just say that some logins can't be
single-user
[20:12:25] <here> currently [[Stable versions]] on en:
[20:12:27] <WiseWoman> Xirzon, but the point of a review is that an "expert" says that this or that
is true.
[20:12:35] <Xirzon> WiseWoman: In the traditional sense, yes.
[20:12:45] <Eskimo> JamesF> maybe one of the james can choose a new nickname :)
[20:12:50] <JamesF> WiseWoman> Or, at least, that a panel of experts fail to notice the falsehoods.
[20:13:03] <WiseWoman> JamesF, actually, there are quite a lot of good articles on wikipedia (in
both English and German).
[20:13:03] <Xirzon> In the collaborative sense, it would be that a community of people who claim to
be knowledgeable about a subject area have taken a look and reached a consensus.
[20:13:10] <Eskimo> (sorry, I feel off topic, we're supposed to be talking about peer review ?)
[20:13:23] <JamesF> WiseWoman> Yes, but the average proper article will have several hundred,
possibly a thousand facts.
[20:13:26] <Creidieki> So is the idea of this peer-review system tied in to some verification of
credentials? That sounds...interesting.
[20:13:38] * mav (n=chatzill@c-71-56-70-11.hsd1.ga.comcast.net) has joined #wikimedia-research
[20:13:41] <WiseWoman> JamesF, oh, the article can still be false, or wrong! That is often the case.
[20:13:43] <JamesF> WiseWoman> Can you /really/ say, truthfully, that you are entirely convinced of
the veracity of every single one.
[20:13:47] <JamesF> mav! Heya
[20:13:50] <Xirzon> JamesF: the notion of highlighting these facts individually is quite interesting
. would be a nice UI challenge as well.
[20:13:55] <Xirzon> hey mav
[20:14:01] <WiseWoman> Creidieki, I have to be non-anonymous and my reputation is my credentials.
[20:14:03] <JamesF> We're discussing possible budget ideas. Except we keep getting distracted. :-)
[20:14:13] <mav> hi
[20:14:15] <Xirzon> oh yeah, I think I stole the topic ;-)
[20:14:17] <Xirzon> sorry about that.
[20:14:21] <JamesF> Xirzon> And then providing proper citations for every single one, five times
over? :-)
[20:14:41] <Creidieki> Well, does anyone on the research board need any books? We could buy them
books.
[20:14:46] <Xirzon> JamesF: Exactly. I think getting everything "maximally cited" is worthwhile.
[20:14:48] <WiseWoman> JamesF, I don't believe anything - I am a scientist. But I get a lot of good
information as a basis, and then verify from there. I would never believe a single
source for anything.
[20:15:06] <here> remind me how we are spending money on this?
[20:15:12] <JamesF> WiseWoman> I did specifically say "convinced of the veracity" rather than
"believe to be true". ;-)
[20:15:20] <CXI> I don't think we really need to try to spend money on things
[20:15:26] <Xirzon> here: at some point you'll need an implementation, obviously.
[20:15:27] <JamesF> WiseWoman> But I fully agree about multiple-sourcing.
[20:15:35] <CXI> the fact of the matter is that research doesn't actually cost that much
[20:15:41] <Xirzon> so even if you don't want to spend money on the design, you can at least spend
money on the code.
[20:15:44] <CXI> for us, anyway
[20:15:50] <WiseWoman> JamesF, can one ever be convinced of the veracity? We could go on about that
all night :-)
[20:15:54] <JamesF> CXI> You mean, volunteers' research costs nothing. :-)
[20:16:00] <Creidieki> I mean, if someone on the research board or development team decides they
need an O'Reilly book about something or "Inside Encyclopedia Britannica" or something,
they shouldn't have to pay for it themselves. I think we should have a modest book
budget (maybe $500 or $1000 per annum).
[20:16:20] <JamesF> WiseWoman> Eurgh, don't get me started. My uncle's a professor of the History
and Philosophy of Science. :-)
[20:16:20] <CXI> I mean that research when the researchers are unpaid and we're working with free
resources like wikimedia projects
[20:16:36] <Xirzon> I think the way the WRN can be primarily useful is in collaboratively
recommending a certain focus of development, and working on rough specifications.
[20:16:43] <JamesF> Creidieki> Possibly, but most of that kind of information is freely-available.
[20:16:46] * mav_______ (n=chatzill@c-71-56-70-11.hsd1.ga.comcast.net) has joined
#wikimedia-research
[20:16:51] <JamesF> Hello mav.
[20:16:55] <JamesF> Again.
[20:16:56] <Xirzon> I don't know to what extent money should go into the design and specification
process; I would certainly be in favor of it, but there are obviously problems.
[20:17:03] <mav_______> argh
[20:17:05] <Creidieki> JamesF, okay, maybe I'm the only one who like paper books anymore.
[20:17:07] <k1v1n> A good budget idea is one that Jakob raised. Maybe we should ask for funds to
subsidize graduate student travel to wikimania (and other meetings) to present results/
participate.
[20:17:12] <JamesF> Creidieki> No no, I love them. :-)
[20:17:18] <Eskimo> bounties could be a great way of making people work on these projects
[20:17:25] <JamesF> mav> Any ideas, before you crash again? :-)
[20:17:33] <Xirzon> Eskimo: not in the sense of "multiple people have to submit code and the first
one to do so gets the bounty"
[20:17:35] <Creidieki> JamesF, I agree that the information is freely available, but I think that
developers work more efficiently with the right reference books.
[20:17:44] <Xirzon> Eskimo: that's n ot a good model - a task has to be "locked" when someone is
working on it.
[20:18:00] * mav_______ is now known as maveric149
[20:18:02] <Eskimo> Xirzon> yes
[20:18:05] <Creidieki> Xirzon, why not? I've been trying to say that we should add a "Bounty" field
to every bug on Bugzilla.
[20:18:38] <Xirzon> Creidieki: because it is demotivating to start with - you don't know if someone
else will claim the money before you, and your work time therefore has been for nothing -
and if you lose, you might never do another task again.
[20:18:46] <Xirzon> Cooperation over competition.
[20:18:49] <Creidieki> People can't donate *towards a specific bug* right now. There are a lot of
bugs I care about, and would be willing to donate effort towards, but don't want to fix
myself.
[20:19:07] <JamesF> Creidieki> Bug 550, you mean? :-)
[20:19:07] <Xirzon> Contractual development like we do it for Wikidata now works.
[20:19:13] <Xirzon> Competitive bounties, I'm very skeptical of.
[20:19:30] <maveric149> can bugs be voted no?
[20:19:32] <maveric149> on
[20:19:41] <Xirzon> there's a voting system in bugzilla, yes
[20:19:41] <Eskimo> Creidieki> you can add a comment "I give $20 to the dev who fixes it" :)
[20:20:00] <Creidieki> Eskimo, I've tried that once or twice, but I really wish there were support
for it.
[20:20:02] <here> *earlier note about peer review versions on en:, actually [[Wikipedia:Stable
versions]]
[20:20:08] <maveric149> mini contracts would be better
[20:20:19] <Xirzon> here: there are tons of pages about peer review
[20:20:20] <Creidieki> I mean look at bug 550! 53 people have bothered to vote for it! Those
people *care* that it gets fixed.
[20:20:34] <maveric149> give somebody a task and tell them how much they will get paid beforehand
[20:20:46] <Xirzon> maveric149: exactly
[20:21:01] <maveric149> I'm all for that
[20:21:03] <Creidieki> I'd be okay with minicontracts as long as we have some way of harnessing
people's collective care.
[20:21:14] <Xirzon> Jimbo is not a big fan of mini-contracts.
[20:21:26] <Xirzon> That's why they've been mostly happening outside Wikimedia's care so far.
[20:21:36] <Xirzon> He believes in long-term employment as in Brion's case to reduce friction among
volunteers.
[20:21:57] <Xirzon> Of course, the problem with long-term eployment is how you decide the
priorities.
[20:22:08] <Xirzon> When does anyone tell Brion "implement bug 550"?
[20:22:08] <Eskimo> I guess Wikimedia can't hire an other dev ? :)
[20:22:28] <maveric149> looking at top-vote getters in bugzilla is one place to look
[20:22:54] <Xirzon> There is currently effectively no process for "telling Brion what to do".
[20:23:01] <maveric149> can't hire? No reason why not
[20:23:17] <maveric149> only Jimmy has that authority
[20:23:17] <Creidieki> We told him "keep the servers from catching on fire", right? Isn't that
what he's doing fulltime?
[20:23:29] <Xirzon> he's doing loads of different things.
[20:23:31] <maveric149> and working on MediaWiki
[20:23:50] <JamesF> Hmm.
[20:23:54] <Xirzon> bugfixing is a huge part of his mediawiki work.
[20:23:57] <JamesF> Yes.
[20:23:59] <Xirzon> and integration of incoming code.
[20:24:03] <JamesF> And security management, especially.
[20:24:05] <JamesF> Yes, that too.
[20:24:13] <JamesF> Integration alone is almost a full-time job.
[20:24:15] <Eskimo> so, we have to ask Santa Jimbo for a new dev
[20:24:23] <Xirzon> it's true that it would be good to have an additional developer who could focus
on new feature deveopment, yes.
[20:24:40] <JamesF> Xirzon> But they'd have to work very closely with Brion, so...
[20:24:50] <Xirzon> ask Tim
[20:24:55] <nichtich> we should ask brion for his opinion too
[20:25:05] <maveric149> Erik ; I've been pushing for just that
[20:25:43] <maveric149> I've already done a bit of ground work in that area
[20:26:03] <maveric149> it is up to the devs I asked and the board
[20:26:12] <nichtich> I bet we'll talk about a third or fourth developer next year in the same
way ;-)
[20:26:13] <maveric149> I can't say anything else
[20:26:14] <Xirzon> there still needs to be a process by which it is quickly decided what tasks
should be worked on.
[20:26:20] <Xirzon> with a new developer or without.
[20:26:25] <Creidieki> donations! Donations to a task!
[20:26:32] <JamesF> Xirzon> Only the Board can order Officers about.
[20:26:41] <Eskimo> this way, 2 problem are solved
[20:26:45] <Xirzon> JamesF: not scalable
[20:26:50] <here> xirzon: that would be extremely nice.
[20:26:52] <Eskimo> more donations, more features :d
[20:26:55] <maveric149> money should not direct developement; need and community want should
[20:27:03] * k1v1n (n=k1v1n@cpe-066-057-113-059.nc.res.rr.com) Quit (Remote closed the connection?)
[20:27:05] <JamesF> Xirzon> Then Chief Officers need to have sub-ordinates. That's how all
organisations work and scale.
[20:27:18] <JamesF> Dan> Absolutely.
[20:27:32] <Xirzon> JamesF: what the board could scale to do is approve recommendations by the WRN
[20:27:35] <brian0918> What are we talking about now?
[20:27:49] * k1v1n (n=k1v1n@cpe-066-057-113-059.nc.res.rr.com) has joined #wikimedia-research
[20:27:52] <Xirzon> perhaps together with the CTO
[20:27:52] <JamesF> Xirzon> We can always petition the Board to direct others, yes.
[20:28:13] <JamesF> But it's the Board's perrogative. It certainly isn't mine or the Network's. :-)
[20:28:20] <Creidieki> maveric149, I agree that a bug shouldn't be confirmed/accepted because of
money. But after it's been accepted as something that should be fixed, "community
willingness to donate money" starts to look a lot like "community need and want".
[20:28:21] <Xirzon> JamesF: not so much direct them in any specific way other than saying "do what
the WRN suggests"
[20:28:29] <JamesF> brian> How to focus developer activity.
[20:28:50] <JamesF> Xirzon> I think that that would be a mistake.
[20:28:56] <JamesF> We're here to advise and help develop.
[20:28:59] <Xirzon> I think a WRN consensus approach might be cooler than just using votes to figure
out what should be done.
[20:29:07] <JamesF> Not decide exactly what should be done and how.
[20:29:11] <Xirzon> The WRN can look at votes, but it can also look at feasibiity.
[20:29:11] <maveric149> I just don't want to see rich individuals try to push through features that
are not popular or needed
[20:29:16] <Xirzon> It can look at overall importance etc.
[20:29:39] <Xirzon> JamesF: who decides then?
[20:29:46] <JamesF> It's very much not our place to want to ask the dev.s to do feature development
when the wiki is falling down.
[20:30:02] <Creidieki> maveric149, at the moment, individuals who are good programmers can already
use their resources to "push through" features, by donating patches. How would that be
different?
[20:30:03] <Xirzon> that's why the CTO would have to approve all recommendations
[20:30:06] <JamesF> Xirzon> Brion is his own master except where directed by the Board, just as I
am.
[20:30:16] <Xirzon> (the CTO is Brion, for the record)
[20:30:17] <JamesF> Xirzon> And just as all the other Officers are. :-)
[20:30:24] <JamesF> No.
[20:30:27] <JamesF> Brion is the CDO.
[20:30:33] <Xirzon> the what now?
[20:30:34] <JamesF> Domas is the CHO.
[20:30:40] <JamesF> Chief Development Officer.
[20:30:44] <Xirzon> silly title
[20:30:49] <JamesF> No, specific title.
[20:31:02] <Xirzon> in any case, he could look at WRN recommendations and decide whether he agrees
or not.
[20:31:14] <maveric149> Creidieki ; I trust the devs to do what is right ; the way things are right
now works fine; add money to the mix and things could go very very wrong
[20:31:15] <JamesF> "Technology" includes Hardware (Domas), Development (Brion), even Research (me).
[20:31:17] <Xirzon> if he agrees, the board might or might not have to give its stamp of approval.
[20:31:18] <WiseWoman> So what are we recommending?
[20:31:21] <JamesF> mav> Exactly.
[20:31:45] <JamesF> Xirzon> Brion's a member of the Network too, and I talk with him about
developments all the time.
[20:31:56] <JamesF> Xirzon> But I don't want us to think that we can order other people about. :-)
[20:32:04] <Xirzon> JamesF: that's good. now how do you get a specific thing like single login
*done*?
[20:32:05] <Creidieki> maveric149, I don't agree that things are fine now. I care a lot about
wikimedia, there are bugs and missing features that plague me constantly, and in order
to help out with that I would have to delve into the Wikimedia source and spend 1-2
weeks getting up to speed.
[20:32:33] <JamesF> Xirzon> By asking Brion to do it, and him managing somehow to find the time
despite all the other pressures.
[20:32:38] <maveric149> we just have to be careful, that is all
[20:32:40] <Xirzon> it's not about ordering people around, it's about having a consensus process
for deciding wha tto do.
[20:32:56] <Creidieki> maveric149, maybe we could have the chief developers "open a bug for
donations" or something.
[20:32:57] <JamesF> Xirzon> It's not about consensus, it's about what the Board wants.
[20:33:02] <maveric149> I'm leaning toward Erik's view
[20:33:19] <Xirzon> JamesF: slow down, you can't have it both ways. either you have to talk to the
board or you don't. which is it?
[20:33:27] <maveric149> Creid; that could work
[20:33:28] <JamesF> Xirzon> What?
[20:33:34] <Creidieki> Um, sorry for the newbie question, but is Erik one of the people currently
here?
[20:33:43] * mav (n=chatzill@c-71-56-70-11.hsd1.ga.comcast.net) Quit (Connection timed out?)
[20:33:45] <Xirzon> Creidieki: that would be me
[20:33:56] <Creidieki> Xirzon, okay, I was wondering. Thanks.
[20:33:56] <JamesF> Creidieki> Possible, but still has the Iwant-noIwant-noIwant issue.
[20:34:00] <Xirzon> in the case of single login, I'd say you have to get both the board's approval
and brion's consensus
[20:34:10] <Xirzon> in the case of a minor but important feature, it might be enough to just get
brion's consensus
[20:34:18] <WiseWoman> Why don't we invent "Wikimoney"?
[20:34:20] <Creidieki> JamesF, in terms of multiple developers trying to solve the problem at once?
[20:34:26] <JamesF> Creidieki> Yeah.
[20:34:36] <maveric149> we could just add it to the budget
[20:34:37] <WiseWoman> People get paid "Wikidollars" for working on something, or Wikicents for
trivial little things.
[20:35:01] <Xirzon> we had that :)
[20:35:04] <WiseWoman> People can put some of their Wikimoney towards getting bugs fixed or for
getting an article on a topic written.
[20:35:09] <JamesF> Xirzon> What I mean is, the Research Network cannot demand something to be done
- it can ask the CDO, or ask the Board to demand of the CDO. I'm sure we're in agreement
here.
[20:35:11] <Xirzon> WikiMoney. It was a spectacular failure
[20:35:18] <JamesF> WiseWoman> We tried that 3 years ago or so.
[20:35:22] <JamesF> It failed horribly.
[20:35:25] <here> wow, any pointers?
[20:35:26] <JamesF> No-one cared.
[20:35:27] <Xirzon> JamesF: I used the term "recommendations"
[20:35:27] <maveric149> CDO?
[20:35:31] <here> called wikimoney?
[20:35:33] <JamesF> mav> Brion. ;-)
[20:35:36] <JamesF> here> Yes.
[20:35:47] <JamesF> here> Designated by a phi symbol, IIRC.
[20:35:55] <Creidieki> WiseWoman, the normal users need a lot of features added. Developers don't
necessarily need as many articles edited.
[20:35:57] <WiseWoman> Then there is a defined method for getting WIkimoney payed out in cash :-)
[20:35:59] <JamesF> Xirzon> That's why were' in agreement.
[20:36:14] <JamesF> Umm.
[20:36:14] <WiseWoman> JamesF, maybe it would work better now? Why did it fail?
[20:36:17] <JamesF> s/were'/we're/
[20:36:18] * maveric149 is now known as mav
[20:36:43] <JamesF> WiseWoman> No-one cared about earning wikimoney, really. It's somewhat counter-
cultural to the volunteer mentality.
[20:36:43] <Xirzon> ok. So it's basically clear what needs to be done -
[20:36:50] <Creidieki> JamesF, that's a difficult problem.
[20:37:04] <JamesF> Creidieki> Yes.
[20:37:08] <Xirzon> a) as soon as the budget is there, hire another full-time developer who can
focus more on feature development
[20:37:28] <Creidieki> JamesF, I'll spend some time thinking about that and get back to you.
[20:37:32] <Xirzon> b) the wrn together with the devs tries to hack out specs for things that are
worth implementing and submits them either to the paid devs, the board, or both.
[20:37:36] <JamesF> Creidieki> Cool. :-)
[20:38:46] <Xirzon> as for what specs, I suggest that the single login stuff is pushed forward, and
after that, focus could shift towards crucial issues like peer review.
[20:39:33] <mav> how about a list of development priorities?
[20:39:56] <Xirzon> There is [[m:Development tasks]].
[20:40:26] <mav> I'm moreso interested in seeing something come from this dept
[20:40:42] <mav> a needs/wants/wish list
[20:40:50] <Xirzon> yep
[20:41:37] <k1v1n> it would be nice to see some of the reporting tools that Erik and Jakob have done
built into mediawiki. Where people could manipulate variables and get different views,
etc.
[20:41:39] <mav> we could then ask the devs to look at that and tell us just how hard it will be to
do each item
[20:41:55] <mav> then we can plan
[20:42:12] <WiseWoman> I have to go, have a nice rest-of-chat!
[20:42:17] <Xirzon> bye WiseWoman !
[20:42:22] <mav> bye :)
[20:42:23] <here> bye wisewoman
[20:42:26] * WiseWoman (n=weberwu@p54BD000A.dip0.t-ipconnect.de) has left #wikimedia-research
[20:42:27] <Creidieki> Have fun, WiseWoman.
[20:42:29] <Creidieki> drat.
[20:42:30] <JamesF> mav> "The Network should in future discuss and come to agreement on priorities
in the non-critical task list (that is, other than bugs, security holes, etc.), possibly
stratified (needs/wants/wouldbenice/wishes) and with dependency links ("centralised
inter-wikis depends on Wikidata and Single User Login")."
[20:42:48] <JamesF> mav> Just written to
http://meta.wikimedia.org/wiki/Wikimedia_Research_Network/Meetings/2005-12-03#Summary
[20:42:57] <mav> cool :)
[20:43:12] <Xirzon> mav: by when do you think a new developer position is realistic?
[20:43:13] <JamesF> This writing-the-minutes-as-you-go works quite well with IRC meetings, actually.
[20:43:23] <mav> next quarter
[20:43:32] <nichtich> k1v1n: I better prefer to keep tools seperate and plug them to MediaWiki using
interfaces . The XML dump is such an "interface"
[20:43:46] <Creidieki> I imagine we'll have a good idea after this fundraiser?
[20:43:47] <mav> but that depends on if Jimmy wants to hire other people as well
[20:44:12] <Creidieki> (Does anyone know where the fundraiser details are, btw? I haven't been able
to find anything except the #Wikipedia title message.)
[20:44:37] <JamesF> Creidieki> Mav knows, but it's Secret. Or something. :-)
[20:44:42] <mav> :)
[20:44:51] * JamesF looks all conspiratorial.
[20:44:54] <mav> the fund drive starts on Friday 9 December
[20:45:04] <JamesF> Ah, it's public knowledge now?
[20:45:09] <mav> there is a page for it on meta
[20:45:15] <JamesF> (Well, it definitely is /now/, but...)
[20:45:16] <JamesF> Ah.
[20:45:25] <delphine> it's always been
[20:45:27] <delphine> ;)
[20:45:29] <mav> I'm going to announce that after this meeting
[20:45:35] <delphine> JamesF you're to much in the secret stuff
[20:45:35] * JamesF nods.
[20:45:39] <delphine> *too
[20:45:47] <JamesF> delphine> Yes, but I forget everything. So... :-D
[20:45:49] * cimon (n=cimon@wikipedia/Cimon-avaro) has joined #wikimedia-research
[20:45:58] <JamesF> Hello cimon.
[20:46:06] <mav> actually, it was already annoced after the last board meeting, but we pushed the
date back a week since then
[20:46:07] <JamesF> I think we're sort-of finishing up right now, actually. :-)
[20:46:12] <cimon> hello, my crappy ISP is crap
[20:46:23] <here> would anyone like to comment on the share watchlists idea
[20:46:24] <here> ?
[20:46:24] <JamesF> cimon> Ah.
[20:46:32] <mav> my IRC connection is crap
[20:46:49] <JamesF> here> Isn't that mostly a thing Cormac is interested in? He sent his
apologies...
[20:46:58] <mav> oh god ; shared watchlists? that would kill the servers
[20:47:00] <here> actually i rewrote the whole page
[20:47:04] <here> mostly my deal
[20:47:08] <JamesF> Ah, OK.
[20:47:14] <here> [[m:Share watchlists]]
[20:47:16] <Xirzon> mav: not necessarily
[20:47:30] <JamesF> mav> You want a budget? We could just demand more servers! ;-)
[20:47:36] <here> i think rather than code changes, a script workaround to easily copy users
watchlists to a subpage would be best
[20:47:41] <here> (not a priority)
[20:47:47] <cimon> are you going to post th log?
[20:47:58] <JamesF> cimon> Yes, of course.
[20:48:04] <cimon> nice
[20:48:13] <JamesF> cimon> In-promptu minutes are already up at
http://meta.wikimedia.org/wiki/Wikimedia_Research_Network/Meetings/2005-12-03
[20:48:26] <JamesF> Lagging behind the discussion by about 10 minutes or so, as I get around to it.
[20:48:55] <JamesF> here> Would shared watchlists be part of a multiple-watchlist system?
[20:49:05] <here> not necessarily.
[20:49:10] <JamesF> here> So you'd have a few private and maybe a few public watchlists.
[20:49:15] <JamesF> "High vandalism targets".
[20:49:21] <JamesF> "Policies"
[20:49:26] <here> actually i'd like to see users actual watchlists shared.
[20:49:30] <JamesF> Might be general public lists.
[20:49:34] <here> not necessarily thematically
[20:50:00] <here> see on [[en:User:Here/watchlist]]
[20:50:02] <JamesF> Whereas "stuff I've been editing and/or have randomly watch-listed" would be a
private list (and what we all currently have).
[20:50:11] <Angela> a form of shared watchlists exists at
http://en.wikipedia.org/wiki/Wikipedia:Watch
[20:50:17] <Angela> I don't know if they're widely used though.
[20:50:26] <Xirzon> here: so why have you been watching [[gay midget photography]] exactly?
[20:50:27] <k1v1n> i like the idea of people sharing their watchlist. That would be like
del.icio.us bookmark sharing. That could be interesting. Who else is interested in what
you're interested in.
[20:50:31] <here> "stuff I've been editing and/or have randomly watch-listed" is exactly what I want
to see
[20:50:32] <here> and share
[20:50:34] <nichtich> How about letting each user define the status of his watchlist?
[20:50:40] <JamesF> Xirzon> Certainly, yes, there are privacy issues.
[20:50:46] <here> k1v1n exactly, i think we are missing a huge community builder here
[20:50:49] <k1v1n> who else watches this page...
[20:50:56] <Angela> Sorry, that might be the wrong link. It seems mostly about schools. There was
another one by topic.
[20:50:57] * mav_ (n=chatzill@c-71-56-70-11.hsd1.ga.comcast.net) has joined #wikimedia-research
[20:50:58] <here> it is absolutely opt-in first and foremost
[20:51:14] * Solensean is now known as Solaway
[20:51:21] <cimon> xirzon: and why do you care? :P
[20:51:39] <here> xirzon: because i'm interested, or editted it.
[20:51:42] * mav_ is now known as maveric149
[20:51:56] <Xirzon> cimon: people who are nosy about the interests of others rarely have a
justification
[20:51:58] <here> i think that the privacy issue is way overblown here, and that enough folks would
actually be into this.
[20:52:15] <Xirzon> those who want to share their watchlist can put it on their user page already.
[20:52:19] <Xirzon> and use related changes.
[20:52:19] <here> the results would be a user-user network of watched pages (both friends, enemies,
leaders, etc)
[20:52:28] <here> xirzon, no one does this as it takes too much effort.
[20:52:39] <here> all of these concerns are addressed at [[m:Share watchlists]]
[20:52:46] <here> including summary of an extensive discussion from 2003
[20:53:03] <here> what is needed is an easy script workaround and uses for the resulting data
[20:53:06] <JamesF> here> Multiple lists could allow "non-privacy-invading" sub-lists, perhaps.
[20:53:17] <maveric149> shared whitelists would be a better idea
[20:53:18] <nichtich> It should be able to 1) opt-in to publish your watchlist 2) add/remove single
articles to you public watchlist
[20:53:19] <Angela> I'm not sure that sort of network would be useful. If you look at social
networking sites, a lot of people have the sole aim of getting 1000 people on their
friends list. With shared watchlists, the aim might turn from watching for vandalism to
trying to show who your "friends" are in the hope they watch your page too.
[20:53:31] <here> hmm.. perhaps i really am overestimating individuals willingness to share
everything on their list
[20:53:41] <JamesF> Angela> Yeah, there is that.
[20:53:53] <here> nichtich: that would be nice, but not necessary
[20:54:09] <here> angela: i think that noise would be minimal and/or interesting
[20:54:32] <maveric149> my watchlist is already too big ; why would I want to add somebody else's
watchlist to that?
[20:54:35] <nichtich> Maybe you want to publish your watchlist but not [[gay midget photography]]
[20:54:42] <brian0918> It seems like shared watchlists could end up being used to push agendas in
the same was as AIW or ADW
[20:54:55] <nichtich> what is AIW / ADW?
[20:54:58] <here> there is also the ability to see what others are watching, ideally through an xm
l output, and even see their related changes.
[20:55:09] <Angela> It could also lead to accusations of stalking if you're watching a lot of stuff
that another user edits a lot.
[20:55:09] <brian0918> Association of Inclusionist/Deletionist Wikipedians
[20:55:09] <k1v1n> the reason people opt in on del.icio.us is not to connect with other individuals,
it is to find what other people with similar interests also find interesting. It's a
mechanism to "discover" new and interesting things.
[20:55:31] <brian0918> They point like-minded people's attention to votes on AFD in the hopes to
corral votes in their favor
[20:55:31] * Eskimo (n=Eskimo@wikipedia/Eskimo) Quit (Read error: 104 (Connection reset by peer)?)
[20:55:40] <brian0918> Shared watchlists would probably do the same thing.
[20:55:40] <here> k1v1n: exactly.
[20:55:44] <Angela> we have WikiProjects to discover those things and for people working on similar
topics to "meet"
[20:56:10] <Xirzon> Angela: the thing about WikiProjects is that they exist in the wiki namespace
and are reviewed to be in line with our policy goals
[20:56:13] <here> would anyone present be interested in sharing their entire watchlist (other than
myself?)
[20:56:30] <Xirzon> I'm not sure that would be the case with the way any shared watchlist system is
used that is layered on top of everything else
[20:56:39] <k1v1n> sure.
[20:56:44] <nichtich> Do we speak about sharing or just making it accesible like user contributions?
[20:56:52] <Xirzon> if it's just a software feature rather than integrated into the site properly,
there is the serious risk of it being used to coordinate sock puppetry, GNAA and other
nonsense
[20:56:56] <here> accesible, in that meaning.
[20:57:06] <here> xirzon: how?
[20:57:26] <Xirzon> here: groups forming and using their shared watchlists to coordinate votes,
reverts, etc.
[20:57:38] <nichtich> I'm waiting for a method to get my watchlist as RSS for years - so you can
combine watchlists from several wikis
[20:57:49] <Xirzon> here: don't get me wrong - I think *grouping* is essential, and better tools for
grouping are a good thing to work towards.
[20:57:49] <here> xir: seems like that would be equally possible on the positive side?
[20:57:58] * k1v1n (n=k1v1n@cpe-066-057-113-059.nc.res.rr.com) Quit (Remote closed the connection?)
[20:58:00] <here> nicht: this is highly related to watchlist RSS feeds.
[20:58:17] <Xirzon> here: I'm not necessarily opposed to shared watchlists on principle. I just
think what goes on should be transparent - the groups that form should be public, just as
the watchlists themselves.
[20:58:37] * k1v1n (n=k1v1n@cpe-066-057-113-059.nc.res.rr.com) has joined #wikimedia-research
[20:58:41] <Xirzon> and then I'm not sure if what we have with wikiprojects is not already very
close to those goals.
[20:59:08] * soufron5 (n=soufron@vol75-8-82-233-239-57.fbx.proxad.net) has joined
#wikimedia-research
[20:59:31] <here> I'm just extremely curious what watchlist networks actually look like
[20:59:35] <nichtich> Xirzon: do you think of groups as watchlists that are owned by multiple
people?
[20:59:43] <Xirzon> nichtich: yes
[21:00:03] <Xirzon> that's one example of what could constitute a group
[21:00:05] <here> i think it could provide insight into community, how people use watchlists, and
sharing difficult to find pages -- without much negative
[21:00:11] <Xirzon> but wikiprojects already do this to some extent using topic lists
[21:00:18] <Xirzon> which are then checked using Related changes.
[21:00:36] <Xirzon> you can do this for categories, too.
[21:00:45] <nichtich> Categories are also a way to group articles and make them watchable
[21:00:47] <JamesF> Yes.
[21:00:54] * sannse (i=sannse@wikipedia/Sannse) Quit (Read error: 104 (Connection reset by peer)?)
[21:01:24] <Xirzon> group-forming will be very relevant when it comes to peer review, because then
you have to notify people that an article in an area they're an expert in is under
review.
[21:01:25] <nichtich> but it's much more easier to add/remove an article to a watchlist than to
add/remove a category
[21:01:37] <here> nichtich: precisely.
[21:01:52] <Xirzon> nichtich: maybe it should be made easier to add categories then ;-)
[21:02:02] <nichtich> Xirzon: I agree to yoir sentence before "because"
[21:02:02] <here> again "those random pages on your watchlist" is exactly why this is significantly
different than groups and/or related changes vaults.
[21:02:17] <JamesF> Hmm.
[21:02:26] <here> and why manually copying this info is not enough, b/c no one will do it.
[21:02:44] <nichtich> Xirzon: that also
[21:03:09] <Xirzon> here: I'm sure plenty of the researchers here will be interested to look at
the patterns in watchlist data, yes.
[21:03:30] <Xirzon> that's more of an academic project without an immediate practical benefit we
can predict, I think.
[21:03:50] <here> xirzon: yes, i would agree
[21:04:07] <JamesF> That's another thing, BTW.
[21:04:11] <JamesF> On the agenda.
[21:04:20] <nichtich> Xirzon: but we don't want vandals to know wich pages are not watched
[21:04:33] <JamesF> How to reconcile analysis of watchlist data with our privacy policy.
[21:04:41] <JamesF> But we can skip that for now.
[21:04:43] <Xirzon> nichtich: not sure - if we are faster than the vandals (and we are more) we
can make sure that they are.
[21:04:45] <JamesF> It's quite late.
[21:04:49] <Xirzon> but a thing which could be done fairly quickly
[21:04:52] <JamesF> This meeting is starting to drag a little.
[21:04:59] * mav (n=chatzill@wikipedia/maveric194) Quit (Read error: 110 (Connection timed out)?)
[21:05:06] <Xirzon> is to basically implement an opt-in for sharing watch lists, and then export
all those which are shared, and let them be analyzed by researchers.
[21:05:07] <JamesF> ... as Mav crashes.
[21:05:27] <here> ok, well thanks for the feedback.... i'll try to garner more support and work on
a script workaround.
[21:05:35] <here> any tips or resources for such a script would be appreciated.
[21:05:43] <JamesF> Xirzon> Previous meetings have seem to come the the consensus that opt-in
watchlists are by their very nature inappropriately skewed.
[21:05:55] <here> jamesF: i don't see any other way of doing it.
[21:05:56] <nichtich> Do we agree that some method to opt-in to publish your watchlist is wished
(and maybe more functionality)?
[21:06:10] <here> necessary evil of self selection, same as many systematic bias issues here
[21:06:29] <k1v1n> it probably should be an opt-in for each item on your watchlist that you would
be willing to share.
[21:06:44] <JamesF> here> Just take the data and have the data analysed by a trusted party (one of
the developers, perhaps) running provided scripts and returning only the results?
[21:06:45] <Xirzon> JamesF: Yes, it's probably uselessly skewed if you wanted to do a study on, say,
relations between articles on sexual topics
[21:06:52] <here> k1v1n, perhaps -- but honestly i'd reather encourage full disclosure.
[21:06:54] <Xirzon> JamesF: but that doesn't mean any analysis of it would be worthless.
[21:06:57] <JamesF> Xirzon> And thus high-view topics.
[21:07:03] <nichtich> JamesF: you can better anonymize the data
[21:07:08] <here> if that had been the case upon arrival, it would be the norm, and the network
would be heavily used!
[21:07:09] <k1v1n> you might want to share some (say a research watch page, but not a political
watch page), but not all.
[21:07:11] <JamesF> Xirzon> 'Cos our sex-related topics are well-read.
[21:07:14] <here> (full watchlist sharing)
[21:07:27] <JamesF> nichtich> Not really. Almost users will watchlist their own page.
[21:07:35] <Xirzon> JamesF: and well-illustrated.
[21:07:36] <k1v1n> there are sex pages on Wikipedia? I had no idea.
[21:07:41] <nichtich> damn
[21:07:53] <here> heck, even anonymous sharing would be fine!
[21:07:53] <cimon> just have a statistical breakdown; page watched by x byroucrats, Y admins, and
z logged in users.
[21:07:58] <JamesF> nichtich> Certainly, there will be a very strong correlation between the edits
they make (public information, don't forget), and the pages they watch.
[21:07:59] * soufron (n=soufron@vol75-8-82-233-239-57.fbx.proxad.net) Quit (No route to host?)
[21:08:14] <nichtich> The number of watchers of each page is already very interesting for reasearch
and us
[21:08:19] <JamesF> nichtich> I think that it is /very/ unlikely that you couldn't guess who my
watch list is of, for instance.
[21:08:28] <JamesF> Yes, exactly.
[21:08:36] <nichtich> JamesF: you're right
[21:08:38] <JamesF> But we seemingly can't release the watchlist data publically.
[21:08:45] <k1v1n> true, anonymous would work although a person could probably put two-and-two
together by looking at the edits.
[21:08:54] <JamesF> So we'd have to have a developer (or me, or someone) do that part of it.
[21:08:57] <here> k1v1n: true.
[21:09:04] <JamesF> k1v1n> Exactly.
[21:09:09] <nichtich> But opt-in data with gaps is better than no data
[21:09:11] <Xirzon> oh, one big skewing factor would be people who have the [ ] Watch all my edits
option turned on.
[21:09:17] <JamesF> nichtich> Yes.
[21:09:20] <Xirzon> You'd probably want to filter based on that.
[21:09:23] <JamesF> Xirzon> Yes, like me.
[21:09:35] <here> Xirzon: that data is important.
[21:09:36] <JamesF> Xirzon> Except I switch it off as appropriate.
[21:09:49] <Xirzon> here: yeah, but you want to be able to analyze it separately
[21:09:50] <here> again, how people use them, and "all that random junk"
[21:09:51] <cimon> JamesF: it would be useful to have even the watchlist data of a sampling of pages
,, not all of them...
[21:09:55] <k1v1n> good research there on what people are willing and not willing to share.
[21:09:57] <JamesF> Xirzon> Pages I edit but /don't/ watch is probably quite interesting, actually.
[21:10:07] <JamesF> cimon> Interesting idea.
[21:10:36] <here> Bug 727: Number of watching users per page to be shown on recent changes
[21:10:42] <here> (fixed using Enotif)
[21:11:14] <cimon> maybe not identifying even the article in question but for the category it
belongs to, and/or the length of it and/or the number of incoming and exiting links...
[21:12:00] <cimon> if you obscure the title of the article, and the identity of the persons watching
it, there is no privacy issue
[21:12:22] <here> what data is left?
[21:12:29] <cimon> quite a bit.
[21:12:45] <here> #watchers alongside other individual article metrics?
[21:13:02] <cimon> right, sort of..
[21:13:17] <JamesF> cimon> And it also becomes almost useless.
[21:13:30] <JamesF> Well.
[21:13:36] <cimon> I don't necessarily agree. :)
[21:13:44] <JamesF> Unless you also record number of bytes and number of categories, or whatever.
[21:13:52] <JamesF> At which point the original problem strikes again.
[21:14:05] <JamesF> Because you can work out backwards from that point which article you've got data
for.
[21:14:51] <JamesF> And with a dozen articles it wouldn't be a problem, but with a random 10% of the
database, say, you could see what the anonymous "editor #34576" watches, and guess who
they were.
[21:14:56] <JamesF> Again breaking privacy.
[21:15:00] <JamesF> It's a bugger of a problem.
[21:15:41] <JamesF> OK, well.
[21:15:45] <JamesF> We seem to have stopped.
[21:15:49] <JamesF> Yes?
[21:15:52] <nichtich> yes
[21:15:52] <here> thanks for sticking around for that ;)
[21:15:56] * Solaway is now known as Solensean
[21:16:10] <JamesF> OK, if there's nothing else, I will declare this meeting closed.
[21:16:17] <JamesF> Yes?
[21:16:22] * here nods
[21:16:24] <Xirzon> aye
[21:16:24] <k1v1n> soundg good.
[21:16:27] <JamesF> OK, closed.