User talk:Lustiger seth/archive001
Crosslink
editmoved from user talk:wiki_seth. -- seth 01:40, 13 August 2008 (UTC)
Hi Seth - can you provide a link to prove you own the Lustiger_seth account on dewiki please? Just add something to your userpage like "I own Wiki seth on Meta wiki". Just to make sure you're not an impersonator (which I'm sure you're not!) Thank you. Majorly talk 14:15, 9 August 2008 (UTC)
- you are right. i guess that this is enough. -- seth 23:19, 11 August 2008 (UTC)
- Eu possuo o Wiki seth no Meta wiki Maykel31 (talk) 04:55, 27 October 2024 (UTC)
splitting logs
editWould you split Spam blacklist/LogPre2008 too? Running off to work now, or would do it myself. — Mike.lifeguard | @en.wb 09:57, 25 August 2008 (UTC)
- i started it, but have got to leave now, too. ;-) -- seth 11:42, 25 August 2008 (UTC)
- shall we really join both lists? the wiki-source would be small, but the rendered html-file would be quite large. -- seth 19:15, 25 August 2008 (UTC)
- No, I'd say there is no need to have anything further back than the beginning of 2008 on Spam blacklist/Log. We should make Spam blacklist/Log/2008, Spam blacklist/Log/2007, Spam blacklist/Log/2006 so the log is split up somewhat, I think. — Mike.lifeguard | @en.wb 19:34, 25 August 2008 (UTC)
- shall we really join both lists? the wiki-source would be small, but the rendered html-file would be quite large. -- seth 19:15, 25 August 2008 (UTC)
Senseless entry removal
editThe log entry here doesn't work:
[0-9]+\.[-\w\d]+\.info/?[-\w\d]+[0-9]+[-\w\d]*\] # lustiger_seth # removal; senseless, see request
Can you try to figure out what it was trying to block & see if it can be fixed rather than removed? Thanks. — Mike.lifeguard | talk 18:11, 6 September 2008 (UTC)
- hi!
- see Talk:Spam_blacklist#double.2Fwrong_entries (search for "before i start modifying the list").
- the regexp never did, what it should have done, although it was corrected^Wmodified at least one time. as the original request is more that 2 years old, i guess those domains don't need to be blocked any longer, because otherwise the sbl would be full of such entries. -- seth 19:34, 6 September 2008 (UTC)
- No, domains remain blacklisted until a request to de-list them is accepted. Please try to figure out what that regex should have been and fix it instead of removing it. Thanks. — Mike.lifeguard | talk 22:59, 6 September 2008 (UTC)
Log entry
editThis one doesn't have the oldid specified in the template. Can you try to fix it? — Mike.lifeguard | talk 11:58, 8 September 2008 (UTC)
- oops, thx! -- seth 12:22, 8 September 2008 (UTC)
Temp sysop
editYou have the tools, again. Congratulations. Alex Pereira falaê 17:26, 9 October 2008 (UTC)
- :-) -- seth 10:36, 14 October 2008 (UTC)
non-capturing patterns
editAre we sure that using non-capturing patterns helps efficiency? I don't think it does (asked VVV about it at one point) - but would you ask someone to make sure? If it doesn't make a difference, I'd prefer to leave the ?: out; it makes things easier to read. But if it does help, then "Great!" — Mike.lifeguard | @en.wb 01:42, 14 October 2008 (UTC)
- Hi!
- Although the extension is written in php, I guess it is quite similar to perl's regexp engine. However, I assume that php is not better in this than perl. ;-)
- the php manual keeps shtum about most performance-increasing hacks, but the perl-manual says:
- WARNING: Once Perl sees that you need one of $& , $` , or $' anywhere in the program, it has to provide them for every pattern match. This may substantially slow your program. Perl uses the same mechanism to produce $1, $2, etc, so you also pay a price for each pattern that contains capturing parentheses. (To avoid this cost while retaining the grouping behaviour, use the extended regular expression (?: ... ) instead.) [...] (perldoc perlre)
- so "Great!"? :-) -- seth 10:25, 14 October 2008 (UTC)
- Sounds good to me. — Mike.lifeguard | @en.wb 17:28, 14 October 2008 (UTC)
IRC - regex help
editHi Seth, I seem to have some problems with 'catching links' from diffs, and some regex help would be appreciated. Do you have access to IRC, and if so, could you join us in #wikimedia-external-links or in #wikipedia-spam-t ?? Thanks! --Dirk Beetstra T C (en: U, T) 17:25, 3 December 2008 (UTC)
- I'm there now. -- seth 18:17, 3 December 2008 (UTC)
- ...was... ;-) perhaps it's better to e-mail me. -- seth 19:34, 3 December 2008 (UTC)
- Hmm .. we seem to have different times online. It is nothing urgent, but I'll try and explain the problem.
- The linkwatcher-bots (perl scripts) that I am running retrieve every diff, and extract from that the texts that are added and removed, and clean them from tags. That all goes fine. They then fill arrays using regexes of patterns we want them to catch, one array on the removed part, one on the added part. The 'catches' that are not in the removed part but are in the added part are the things that are stored in a database, evaluated (counts etc.) and reported to IRC (see #wikipedia-en-spam and #cvn-sw-spam). The special cases go to other channels (#wikipedia-spam-t for english alerts; some high level alerts to #wikimedia-alerts, and bot-evaluated spam cases to #wikimedia-external-links) and to reports (via COIBot, another bot I run).
- At this moment I have 4 regexes catching things from the diffs:
(?:http:\/\/)?[^\s\]\[\{\}\\\|^~`<>]+?@\w+(?!\.htm)(?:\.\w+){1,3} (?<![\w\d-])(?<!isbn[\s])\d{3,5}[\s-]\d{2,3}[\s-]\d{4}(?![\d\w-]) ftp://[^\s\]\[\{\}\\\|^~`<>]+ http://[^\s\]\[\{\}\\\|^~`<>]+
- The last two can be combined, and I am thinking about adding more. The problem lies in the first two. Example, applying that regex to this diff gives the following results:
- 415 (14982, 37, 3, 1)
- 839 (14982, 27, 3, 1)
- 6885 (14982, 26, 3, 1)
- 646-227-4900 (AL: \d{3,5}[\s-]\d{2,3}[\s-]\d{4}) (14982, 25, 3, 1)
- http://world.wikia.com/wiki/user:jimbo_wales (RL: \bwikia\.com\b) (14970, 47, 1, 1)
- http://en.wiki (14970, 322, 4, 1)
- (the numbers between the brackets are counts for the specific catches). It should catch 646-227-4900, that looks like a telephone number / social security number. But the three three other numbers are a string "+1 415 839 6885", which should be caught (or at least the 415 839 6885 part of it) by the second regex. I am not sure where the problem is, but I am afraid it is in the brackets around the lookbehind/lookforward. They seem to result in certain cases in 'secondary' catches (the regex is performed as "@array = $added =~ m/$regex/sig;").
- Another problem (for which I coded a workaround) is that links 'http://www.somewhere.com/someone@somehow.sometime' are caught both by the first and fourth rule in the list above (the one for email addresses and the one for http-links). The workaround is now that after regex I check if the a previous regex did not catch the same string. But it would be nice to have it work a bit more strict.
- I wondered if you would be able to help me with this. I can adapt things in some of the channels in runtime (the rules are in a database as well). Thanks already! --Dirk Beetstra T C (en: U, T) 10:45, 4 December 2008 (UTC)
- Adding, I am generally online during worktime (from about 9-10 in the morning until about 5-6 every afternoon; if I guess your timezone correctly, you should add one hour to those times; I am in Cardiff, Wales, UK), and some time around that (but irregular). We may see each other there. Extra info: I can set up the bots (BigWikiLW2) to send de specific 'spam' to an own channel as well (see #wikimedia-external-links). If you are interested, tell me where and I will set it up. --Dirk Beetstra T C (en: U, T) 11:01, 4 December 2008 (UTC)
- Hi!
- 1. (second regexp)
- Your second regexp looks good. I don't see any assertion-problem. the parentheses belong to look-ahead and look-behind assertions and there is no capturing, if that is what you were worrying about.
- You strip the tags from the added text of the diff, and you said "that all goes fine". But what is the result e.g. of the given diff? What is done with {{nowrap|1=<span style="text-align:left">+1 415 839 6885</span>}}? Will it become just "+1 415 839 6885"?
- In that case you can see, that
perl -e "$_='+1 415 839 6885'; my @array = $_ =~/(?<![\w\d-])(?<!isbn[\s])\d{3,5}[\s-]\d{2,3}[\s-]\d{4}(?![\d\w-])/sig; print $array[0];"
(windows-code! in bash you have to invert the " and ')- will result in "415 839 6885" and not just "415". So I guess your bug is not inside the regexp. I guess I need more code to be able to analyse that in a better way.
- 2. (e-mail and web addresses)
- The first regexp shall catch e-mail addresses only? Why do you start with
(?:http:\/\/)?
then? you could use zero-width negative look-behind assertions or you cold build one big regexp from those four regexp, but in reverse order, so that http-addresses will get cought with higher priority than e-mail addresses. - 3. (addtion)
- timezone: you're right, here it's UTC+1 now (CET). but I can't give you regular times, sometimes I'm here, sometimes I'm not. :-) But probably the next 2 weeks I'm here very often after 11:00 (utc) and before 20:00 (utc). -- seth 12:25, 4 December 2008 (UTC)
- concerning 1. Ok, via chat you found out, that the problem was a split by " " after a join by " ". I guess, the workaround by splitting and joining by "_" is not good either, because e-mail and web addresses may contain "_". Perhaps it would be better to use "\" (or one or more signs of '[]{}'), because it will never be a part of your matches. Btw. why don't you allow "~" in web addresses? "~" may be part of urls. -- seth 13:23, 4 December 2008 (UTC)
When blacklisting
editPlease use the template. The SBL stats tool uses it (& makes things easy to parse at-a-glance). Thanks — Mike.lifeguard | @en.wb 02:23, 12 February 2009 (UTC)
- Ok, thx. (I didn't know that.) -- seth 18:10, 12 February 2009 (UTC)
Good catch
editThanks for saving my butt :) — Mike.lifeguard | @en.wb 02:03, 26 February 2009 (UTC)
Adminship
editHello Seth. Please fill in your details at Template:List of administrators. —Anonymous DissidentTalk 10:35, 8 April 2009 (UTC)
Notice of review of adminship
editHello,
In accordance with Meta:Administrators/Removal and because you have made fewer than ten logged actions over the past six months, your adminship is under review at Meta:Administrators/Removal/October 2009. If you would like to retain your adminship, please sign there before 2009-10-9. Kind regards, —Anonymous DissidentTalk 15:10, 2 October 2009 (UTC)
- Hi!
- Thx for that notice. I signed there. -- seth 07:19, 3 October 2009 (UTC)
Spam search
edites
editHi!, I'd like to report you a possible bug on your tool. When I try to search on wp-es this message apppears:
« Content-type: text/html
got an error: not a valid file: http://es.wikipedia.org/wiki/MediaWiki:Spam-blacklist »
Best regards Dferg (T-ES) 12:14, 3 January 2009 (UTC)
- thx! should be fixed now.
- reason was: es-wiki doesn't use
<pre>
, but uses<pre class="...">
in its lists. -- seth 16:21, 3 January 2009 (UTC)- Thank you very much Dferg (T-ES) 21:13, 3 January 2009 (UTC)
- Sorry, the tool still doesn't work. Dferg (T-ES) 11:50, 8 January 2009 (UTC)
- Thanks again. You are right, every entry below
-nude\.blogspot\.com
is ignored. I'll have a look at that in a few hours. -- seth 12:25, 8 January 2009 (UTC)- es:MediaWiki talk:Spam-blacklist#wrong_syntax. I could write a work-around. But I guess, a simple revert is much more comfortable for all. -- seth 15:45, 8 January 2009 (UTC)
- Thanks again. You are right, every entry below
- Sorry, the tool still doesn't work. Dferg (T-ES) 11:50, 8 January 2009 (UTC)
- Thank you very much Dferg (T-ES) 21:13, 3 January 2009 (UTC)
- Done Dferg (T-ES) 16:12, 8 January 2009 (UTC)
ru
editI got an error trying to check ruwiki:
got an error: Reference to nonexistent group in regex; marked by <-- HERE in m/https?://+[a-z0-9_.-]*(?:\1 <-- HERE host\.in/st\.exe)/ at ./grep_regexp_from_url.cgi line 280.
— Mike.lifeguard | @en.wb 00:25, 8 January 2009 (UTC)
default=meta
editAny chance you can change the default to check only Meta? Normally that's all the data we need - one can of course request further data from the tool as required. Thanks. — Mike.lifeguard | @en.wb 02:07, 12 January 2009 (UTC)
- At the moment the default is the only way to get some info about XBotLink. SO I better attach XBotLink to one of the languages/projects. Should I add it to "meta" or to "w:en"? -- seth 09:55, 12 January 2009 (UTC)
- It's part of enwiki, I'd say. — Mike.lifeguard | @en.wb 05:46, 29 January 2009 (UTC)
- done. -- seth 14:29, 29 January 2009 (UTC)
- It's part of enwiki, I'd say. — Mike.lifeguard | @en.wb 05:46, 29 January 2009 (UTC)
caching
editAnother: Can you have the tool cache any pages it fetches live (when a user requests a "purge")? Then, when replicated data is older than the cached version, you can use the cached data (and still offer to fetch newer data). When replicated data is newer than the cache, throw it away, since it's useless. This avoids fetching live data on Meta's spam blacklist multiple times unnecessarily.
Just thinking now... the toolserver replicated database doesn't contain page text, I thought... so where are you getting the contents of the page? — Mike.lifeguard | @en.wb 20:43, 9 February 2009 (UTC)
- I'm caching all data already. I use the mirror function of LWP::UserAgent, see [2] (search for "mirror").
- I'm using the SBL-urls like http://meta.wikimedia.org/wiki/Spam_blacklist. -- seth 02:27, 10 February 2009 (UTC)
trim
editCould you have it trim leading/trailing whitespace from input? — Mike.lifeguard | @en.wb 15:56, 20 March 2009 (UTC)
- done. -- seth 02:10, 21 March 2009 (UTC)
Encoding
editWhen checking your tool against ruwiki ([3]), I get something like "Ñпам в ÑÑ‚. ÐÑка ЛÑнгли Сорью, Ñайт Ñодержит материалы, нарушающие ÐП. altes" - could you double-check you're reading the page text with the right encoding? — Mike.lifeguard | @en.wb 23:59, 31 March 2009 (UTC)
- Should be fixed now:
print $cgi->header();
- →
print $cgi->header(-charset=>'utf-8');
- -- seth 21:51, 1 April 2009 (UTC)
proper escaping?
editAt [4], it seems there may be missing escaping or something - the regex fragment isn't grabbed from the blacklist properly (or isn't shown properly at any rate). — Mike.lifeguard | @en.wb 19:56, 2 June 2009 (UTC)
- should be fixed now. -- seth 20:38, 4 June 2009 (UTC)
XLinkBot log not defined
editWhile your spamlists search does include listings on http://en.wikipedia.org/wiki/User:XLinkBot/RevertList, it does not show the log entries. I get this;
- "no log defined for this project. ask there if you want this script to use the log. "
The log is located here. I don't know what we did before the spamlists search tool. I've placed a Note on XLinkBot here. thanks seth.--Hu12 15:42, 28 August 2009 (UTC)
- thx for that hint. i'll cope with that in perhaps 5 days, because i'll be away from my code for that time. meanwhile perhaps you could "clean up" the log so that it looks similar to the other logs. otherwise i'll do that, too. -- seth 18:12, 28 August 2009 (UTC)
- Cleaned up the log...If more need to be done let me know. thanks--Hu12 16:36, 29 August 2009 (UTC)
- thx! should be fixed now. -- seth 20:15, 2 September 2009 (UTC)
- Cleaned up the log...If more need to be done let me know. thanks--Hu12 16:36, 29 August 2009 (UTC)
protocol relative links from grep_regexp_from_url.cgi
editHi Lustiger seth,
With the tool when I am logged in with secure url for https://toolserver.org to assist with my secure login for WMF, I trip across protocol relative issues with the output from the regex tool
- the purge link is fine
- the links to lists: wikiname all give http:// rather than https:
- when I get a positive hit for meta when it displays the log links it shows a problematic double url of
which I don't see if I get a hit on enWP
list: meta blacklist dogsex(?!posed) log entry: dogsex(?!posed) #: # lustiger_seth # modification; see request
Thanks. billinghurst sDrewth 01:41, 6 January 2012 (UTC)
- Hi! Thanks for reporting. Should be fixed now. -- seth 09:53, 6 January 2012 (UTC)
Possible XSS vulnerability
editOne of your tools may have an XSS vulnerability: https://toolserver.org/~seth/grep_regexp_from_url.cgi?userdeflang=%3Cbig%3E%3Cbig%3E%3Cbig%3E%3Cbig%3EXSS?%3C/big%3E%3C/big%3E%3C/big%3E%3C/big%3E&url=bit.ly πr2 (t • c) 03:07, 28 March 2012 (UTC)
- Hi!
- Many thanks! Should be fixed now. -- seth (talk) 20:09, 28 March 2012 (UTC)
de - Spamfilter schlägt an - warum?
editHallo Lustiger seth, hast du eine Idee, weshalb auf meiner BNR-Seite zur Großen Magellanschen Wolke die Url www.wissenschaft.de/allgemein/neues-vom-nachbarn/ nicht durch den Spamfilter geht? Magst du dir das einmal ansehen? Danke! ----Watzmann (talk) 20:33, 19 February 2021 (UTC)
- gudn tach Watzmann!
- blockiert wurde dein versuch, den artikel
- in dewiki zu verlinken.
- hintergrund fuer die blockade ist die diskussion
- hilft das? -- seth (talk) 22:21, 19 February 2021 (UTC)
- Du hast recht, da hatte ich anscheinend die falsche URL kopiert, sorry. Das erklärt es wenigstens. Danke! --Watzmann (talk) 23:30, 19 February 2021 (UTC)
sbl
editCould you take a look - affects dewiki mainly. — Mike.lifeguard | @en.wb 00:08, 23 March 2009 (UTC)
- Yes, it's already been discussed at de:WP:SBL#mountainzones.com. It's not considered as spam at de-wiki. Well, actually I considered it to be spam, but some reputable users requested un-blacklisting. -- seth 12:57, 27 March 2009 (UTC)
Blacklisting
editDear Lustiger Seth,
It was brought to our attention that all the references to our websites have vanished from Wikipedia. A short check has shown that our domain, 888.com, was blacklisted due to conflict of interest issues. We wish to emphasize that we never had any intension of doing something which is against Wikipedia’s guidelines. We are a worldwide, leading company in our field, which has been operating for over a decade. We will appreciate any feedback that you could give us, as well as steps we need to take in order to improve our status and be removed from the blacklist. We would like to make sure this situation does not repeat itself, in the interest of all parties involved.
Looking forward for your reply.
Thank you and best regards, Oris 12:21, 25 March 2009 (UTC)
- Hi!
- Links to that domain were added by different users, see m:user:COIBot/XWiki/888.com. Afaics those links are no advantage to wikipedia articles in sense of en:WP:EL (or de:WP:EL a.s.o.). Is there any article where a link to that domain would be useful? -- seth 15:54, 27 March 2009 (UTC)
- Hi Seth,
- Thank you for your prompt reply! We carefully read the instructions written in en:WP:EL and went over the list of references to 888.com m:user:COIBot/XWiki/888.com. In this list we found several references which are very relevant to the articles and can definitely enrich their content. For instance, it is only natural and straightforward to add external links to the company’s main websites which are mentioned within the article on 888 Holdings.
- Furthermore, in some cases the external links to 888.com were inserted as we are the official sponsors of personas and teams such as Shane Warne (see au.888.com/shane-warne/), Michael Keiner (see de.888.com/sponsorship/de/michael_keiner.htm), Jeff Fenech (see www.888.com/sponsorship/en/jeff-fenech.htm) & Sevilla FC (see es.888.com/sponsorship/es/sevilla.htm). In each of these cases, relevant content which contributes to the article’s theme, was added. Though I agree with you, It might have been even more appropriate to reference these pages under “References” or “Notes”.
- We will greatly appreciate any feedback you may have on the cases noted above or on any other steps we should take in order to follow Wikipedia’s guidelines.
- Best Regards, Oris 09:41, 31 March 2009 (UTC)
- At 888 Holdings there is a link to the company already.
- I'm not sure about the sponsor links. I guess, it is better, if you choose one of those articles and ask on its talk page, whether a link to 888.com is wanted or unwanted. -- seth 10:20, 31 March 2009 (UTC)
- Dear Seth,
- We contacted you several months ago regarding the blacklisting of 888.com in Wikipedia. As a result we took some measures regarding the guidelines and instructions you provided us with. We now have only one focal point which is allowed to suggest content to Wikipedia according to its guidelines and in an effort to really enrich it with high quality, relevant content. We will be grateful if you could reconsider our removal from Wikipedia blacklist.
- According to your recommendation, our focal point tried to add some relevant content within one of the talk pages but it seems like the editor relied more on the fact 888.com being blacklisted than on relevance of the content itself. See here.
- We are currently in a process of uploading valuable, interesting content to our websites. We feel that Wikipedia readers might find this content attractive and relevant. You are more than welcome to review it yourself at: de.888.com/magazine/de/ or de.888.com/magazine/en/.
- Best regards,Oris 14:15, 25 August 2009 (UTC)
- Hi!
- It's not true, that the blacklisting itself was the main reason, see [5].
- You can tell me a specific 888 web page and a specific wikipedia article, and I'll will ask on the article's talk page, whether the authors would like to add the link there. (Of course you can ask by yourself, if your German is good enough or if you want to ask in English there.) In the English wikipedia there was one answerer only, user:2005, and seems to have a similar opinion as I have, i.e., one link to the main page should be enough. -- seth 20:12, 26 August 2009 (UTC)
- Hi Seth,
- Thanks for your patience and willingness to help!
- 888.com sponsors Dr. Michael Keiner (a professional German poker player) and keeps track on all of his main achievements in the following URL: de.888.com/sponsorship/de/michael_keiner.htm. I have noticed that Dr. Michael Keiner's Wikipedia page doesn't have any updates on his more recent achievements in the 2009 WSOP.
- If you too believe that this information is relevant and contributes to the Wiki's article, I would be more than happy to get your help in addressing the relevant Wiki editor.
- Thanks!! Oris 13:17, 22 September 2009 (UTC)
- Hi!
- I asked at de:talk:Michael_Keiner for some opinions concerning the link. Let's wait a few days for some answers. -- seth 11:49, 23 September 2009 (UTC)
- Thanks! Oris 05:48, 24 September 2009 (UTC)
- Hi Seth,
- I have read the responses at de:talk:Michael_Keiner and they seem very peculiar to me since I believe that Wikipedia has a need for a good and reliable information with relevant and trusted references on and off the web. Maybe I didn't understand the editors' remarks correctly and so I would appreciate it a lot if you could help me understand this issue better.
- I also wanted to thank you a lot for all your efforts - I really appreciate your help and it is not taken for granted!! Oris 08:04, 28 October 2009 (UTC)
- Hi!
- I'll try to translate and summarize the comments:
- I asked at de:talk:Michael_Keiner: The article doesn't say anything about Keiner's achievements at 2009 WSOP. His sponsor's website gives some information. That domain is on our blacklist right now, so I'm interested in opinions on that link, for making a dicision about whitelisting it.
- User GiordanoBruno doesn't see any reason for linking the webpage. He says, it would be better to copy the information to the wikipedia article, if the information was important enough. There wouldn't be anything special about that page.
- User He3nry agrees with GiordanoBruno.
- 217.86.39.170 sees the Webpage as a reliable source and adds that every evidence is better than an unreferenced information. 217.86.39.170 claims that the wikipedia article contained a lot of numbers which should be proved by references. So in this user's opinion the link is good as a reference, but not as a regular external link (in that point 217.86.39.170 agrees with GiordanoBruno).
- User Oberfoerster doesn't see the benefit of this external link as a reference, because he could not find the numbers from the wikipedia article in the 888-page.
- 217.86.39.170 sees the Webpage as a reliable source and adds that every evidence is better than an unreferenced information. 217.86.39.170 claims that the wikipedia article contained a lot of numbers which should be proved by references. So in this user's opinion the link is good as a reference, but not as a regular external link (in that point 217.86.39.170 agrees with GiordanoBruno).
- User He3nry agrees with GiordanoBruno.
- User GiordanoBruno doesn't see any reason for linking the webpage. He says, it would be better to copy the information to the wikipedia article, if the information was important enough. There wouldn't be anything special about that page.
- My English is not very good, but I guess, I could outline the main points.
- All in all I don't see that the link really is wanted there. -- seth 21:59, 28 October 2009 (UTC)
Lofoten.info
editHi, in case you haven't noticed, Talk:Spam_blacklist#lofoten.info. As I am writing there, I don't quite understand why this was added to global blacklist, but I guess I'm missing something ;) Best regards, Finn Rindahl 19:04, 8 November 2009 (UTC)
- Thx! I will answer there. -- seth 23:26, 8 November 2009 (UTC)
google spam
editHi Lustiger seth, could I ask you to have a look at Talk:Spam_blacklist#Google_redirect_spam - a lot of Google needs to be blocked as it can be used as a redirect site. Thanks. --Dirk Beetstra T C (en: U, T) 13:38, 24 October 2011 (UTC)
- Could I ask you to have another look at the request, I am not happy with the current rule, which may have false positives (though I agree, if the link redirected to is blacklisted, it should already not work - there may be a bug there). Thanks! --Dirk Beetstra T C (en: U, T) 15:57, 1 November 2011 (UTC)
- Sorry for my late replies. For the last two months I couldn't spend much time in wikipedia. I hope this will get a bit better now. -- seth 20:20, 1 November 2011 (UTC)
regex
editHi Seth, I need some we have a regex that kills \bxairforces\.(com|net|org)\b, but I would also like to include xairforce (without the s). I figured xairforce[s] would do the trick, but apparently not. How do I make this simple thing work? EdBever 19:46, 4 December 2011 (UTC)
- Hi!
\bxairforces\.(?:com|net|org)\b
→ xairforces.com, xairforces.net and xairforces.org\bxairforce[s]\.(?:com|net|org)\b
→ same as above\bxairforces?\.(?:com|net|org)\b
→ xairforces.com, xairforces.net, xairforces.org, xairforce.com, xairforce.net, and xairforce.org- -- seth 20:47, 4 December 2011 (UTC)
- OK, thanks! This guy keeps spamming the different domains. If he finds another TLD, I'll block \bxairforces?\b EdBever 20:53, 4 December 2011 (UTC)
- Hi!
- In that case maybe
\bxairforces?\.
would be better (less general). -- seth 21:46, 4 December 2011 (UTC)
- OK, thanks! This guy keeps spamming the different domains. If he finds another TLD, I'll block \bxairforces?\b EdBever 20:53, 4 December 2011 (UTC)
Regexp check
editHi seth. I added this to the SBL based on Talk:Spam_blacklist#Cross-wiki_spammer. Is the pro-* regexp correct? Thanks in advance. Best regards. -- MarcoAurelio (talk) 15:07, 16 October 2012 (UTC)
- I have changed it to
\bpro-.*?\.ru\b
. Thanks. -- MarcoAurelio (talk) 15:17, 16 October 2012 (UTC)
- Hi! I'm using unsafe computers/connections right now, so I'm not logged in.
pro-*
matches any string (or to be more precise: domain) that contains "pro", "pro-", "pro--", "pro---" snd so on.\bpro-.*?\.ru\b
matches any domain that contains "pro-" (where "pro-" must not be prefixed by any alphanumerical character) and that contains somewhere behind that an occurence of ".ru" (where ".ru" must not be followed by ny alphanumerical character), e.g., "www.pro-wikipedia.ru", "pro-.ru", "sub.some-pro-domain.ru". -- 41.182.45.198 09:34, 17 October 2012 (UTC) (not logged in -- seth (talk) 21:25, 2 November 2012 (UTC))
- Thank you very much for your advice. -- MarcoAurelio (talk) 22:18, 2 November 2012 (UTC)
klassik-resampled.de und s-fahl.de
edit- Falsche und unberechtigte Einträge in Spamlist "klassik-resampled.de" "s-fahl.de" verhindern Korrektur eines bestehenden und gesichteten Links!!
Letztes Jahr hat der Wikipedia Spamlinkjäger Hu12 ohne jede Rücksicht auf irgendwelche Inhalte sämtliche zum überwiegenden Teil seit Jahren gesichteten Links zu einer Reihe an vollkommen unkommerziellen Klangbeispielen zu ansonsten selten oder überhaupt nicht anderweitig verfügbaren Aufnahmen vor allem unbekannter Komponisten auf die Blacklist gesetzt. Lediglich die Tatsache, dass über 1000 Aufnahmen von 100 verschiedenen Komponistenb aus vier Jahrhunderten tatsächlich unabhängig von der Sprachversion in vielen verschiedenen Artikeln bereichernde Links zu entsprechenden Klangbeispielen bieten können. Nach eingehender Diskussion im Musikbereich des deutschen Wikipedia hat die Administratorin density die Links im Artikel über Jean Louis Nicodé wieder hergestellt. Diesen kann ich aber wegen der irrigen Spamlisteinträge "klassik-resampled.de" und "s-fahl.de" nicht korrigieren! Auch der Nicodé-chor empfiehlt übrigens diese Aufnahmen auf seiner offiziellen Homepage Ähnliche Versuche die Links wegen ihrer inhaltlichen Berechtigung wiederherzustellen hatte es z.B. im Artikel Tzigane des Niederländischen wikipedia wurden ohne jede inhaltliche Diskussion mit dem Verweis auf diese Blacklisteinträge kassiert. Ob diese Einträge in der Blacklist inhaltlich auch nur im geringsten gerechtfertigt wären konnte nirgendwo aufgezeigt werden. Tatsächlich rechtfertigt die Seite klassik-resampled in keiner Weise irgend einen einzigen Aspekt der den Eintrag in einer Spamblacklist rechtfertigt. Vielmehr bietet sie mit zahlreichen frei verfügbaren und vollständig unkommerziellen Erstaufnahmen, Aufnahmen selten gespielter Komponisten und Gesamtaufnahmen von Standardrepertoir in vielen Artikeln eine inhaltliche Bereicherung durch vollständige Musikbeispiele, die nach der Löschung der Links in der Regel durch nichts ersetzt werden konnten. Ich empfehle dringend diese Einträge endlich und umgehend von der Blacklist zu nehmen. --Fahl5 (talk) 16:14, 25 September 2013 (UTC)
- gudn tach!
- danke fuer den hinweis. ich werde mit der musik-redaktion noch mal ruecksprache halten. wenn die dort einverstanden sind, gebe ich den link zunaechst in der deutschsprachigen wikipedia frei. eine globale entsperrung hier auf meta waere dann der schritt danach. -- seth (talk) 21:48, 25 September 2013 (UTC)
Crosspost_from_en.wikipedia
editHello Seth. Maybe you're interested in this thread. Regards, --dferg ☎ talk 19:50, 6 September 2010 (UTC)
- thx. :-) -- seth 22:12, 6 September 2010 (UTC)
- Hi,
- I had to edit the title blacklist - diff. I wish to block the word "Peidar" with as many variants as we can from any new account. It is portuguese strong language and it is not needed anywere. That word, along with others, are being user by an idiot vandalizing all PT projects. I would like to be sure the regexp I've added is not causing massive harm, etc. Thanks,
- --dferg ☎ talk 22:42, 7 October 2010 (UTC)
- Hi!
- Sorry for late answer. Is the meta abuse filter active by now? -- seth 20:40, 11 October 2010 (UTC)
- Hi. On meta we have activated local abusefilter, but I do not know how it will help on this case since that vandal only hits pt.wiktionary/wikiquote/wikipedia, etc. projects. Regards, --dferg ☎ talk 07:33, 12 October 2010 (UTC)
- The way you did it (P[eèéêë][iìíïî]d[aàáâä]r) was actually ok. There is no really better possibility, cause there are no character classes like "characters that look like e". And even if there were such classes, the regexp could be easily circumvented by using context-sensitive whitespace-similar characters like "P_e_i_d_a_r", "PxExIxDxAxR" a.s.o. (afaik the last example would even trick the abuse filters normalize function)
- So I guess that you have to modify the regexps "on demand", i.e., every time when the vandal modifies his behaviour. -- seth 09:11, 12 October 2010 (UTC)
- Thank you for your advice. Since my knowledge of regexp is still very basic I prefered to ask you. I am sorry for any inconvenients I might have caused. Regards, --dferg ☎ talk 06:36, 13 October 2010 (UTC)
- Your questions are not inconvenient at all. :-) -- seth 20:30, 13 October 2010 (UTC)
- Thank you for your advice. Since my knowledge of regexp is still very basic I prefered to ask you. I am sorry for any inconvenients I might have caused. Regards, --dferg ☎ talk 06:36, 13 October 2010 (UTC)
- Hi. On meta we have activated local abusefilter, but I do not know how it will help on this case since that vandal only hits pt.wiktionary/wikiquote/wikipedia, etc. projects. Regards, --dferg ☎ talk 07:33, 12 October 2010 (UTC)
- Hello again Seth. I'd like to ask if the proposed regexp in this thread is OK. Feel free to handle it if you want to. Cheers, -- Dferg ☎ talk 21:34, 29 July 2011 (UTC)
Spam Filter schlägt zu bei Quelleneinbindung und Konvertertool versagt
editHilfe.
Ich versuche gerade eine Quelle der spanischen Nationalbibliothek (Ergebniseintrag 1 und 10) zu verlinken, da versagt mir die Software die Nutzung des Links. Ich wollte eigentlich die zwei Links hier direkt einhängen, aber selbst das wird verweigert. :-(
Leider verwendet die Nationalbibliothek selber Google, sodass kein reiner Link existiert (Dann macht ein Blacklisteintrag wenig Sinn)!!! Und auch das Convertertool verweigert den Dienst bei den Links.
Kannst du mir helfen? Danke im voraus --Catchapa (talk) 07:19, 5 September 2014 (UTC)
- gudn tach!
- danke fuer den hinweis, ich hab das convertier-script http://tools.wmflabs.org/url-converter dahingehend abgeaendert, dass es nun auch mit solchen urls zurechtkommen sollte. klappt's damit? -- seth (talk) 23:04, 5 September 2014 (UTC)
- Danke, immerhin das Tool funktioniert wieder, Super. --Catchapa (talk) 08:51, 8 September 2014 (UTC)
Spam blacklist
editHi Lustiger seth. I wanted to use this tool to check why a url was added, but it's not working... Can you help please? Trijnsteltalk 14:34, 10 March 2015 (UTC)
- Hi!
- Thanks for reporting. I don't know, what went wrong, I restarted the webservice and now it seems to work fine again. -- seth (talk) 23:23, 10 March 2015 (UTC)
- Yep. It does. Thanks for the quick response! Trijnsteltalk 23:34, 10 March 2015 (UTC)
SBL search problem encountered
editHi seth!. I was searching for a domain at eswiki's lists and got the following error:
- got an error: Reference to invalid group 0 in regex; marked by <-- HERE in m/https?://+[a-z0-9_.-]*(?:\g <-- HERE unbound-cash-gratis\.es\.tl)/ at /data/project/searchsbl/public_html//index.pl line 442.
Is the tool or the eswiki bl/wl who is not working? Best regards. —MarcoAurelio 18:44, 3 August 2015 (UTC)
- Hi!
- Syntax error in eswiki: es:MediaWiki_discusión:Spam-blacklist#syntax_error_in_blacklist. -- seth (talk) 20:36, 3 August 2015 (UTC)
Spam searcher - only looking at the blacklist not the logs
editGuten morgen, Lustiger. I was perusing your script, https://tools.wmflabs.org/searchsbl on a site to find the entry when it was added. It then nicely gives both the rule on the spam-blacklist, and it's removal. I decided to remove the rule (but with a thought that things could be added back), so I clicked the link again: https://tools.wmflabs.org/searchsbl/?url=supercars.com .. it now obviously does not show the rule on the blacklist (I removed it), but I think it should now find two log entries. Would it be possible to have the script go through the appropriate logs, even when the link is not on the blacklist anymore? --Dirk Beetstra T C (en: U, T) 05:38, 17 July 2016 (UTC)
- Hi Beetstra!
- Sorry for the late answer. Well, actually it's not a real answer yet. In the end of December I might have some time to have a closer look at the old code and then I'm able to answer your request. -- seth (talk) 20:48, 11 October 2016 (UTC)
Youtu.be
editHallo Seth. URLs der Form https://youtu[.]be/foobar sind gesperrt. Ich weiss nicht genau warum... weder in Spam blacklist noch MediaWiki:Spam-blacklist kommt diese URL vor. Kannst du sagen wo / warum die URLs gesperrt sind? Afaik leiten die ja nur auf Youtube und nicht auf beliebige Domains weiter um. Falls die Sperre gewollt ist, koenntest du youtu.be in dein "convert redirects to original url" Tool aufnehmen? Vielen Dank! Tobias talk · contrib 18:07, 19 December 2018 (UTC)
- gudn tach Tobias!
- via https://tools.wmflabs.org/searchsbl kannst du (normalerweise und auch in diesem fall) den eintrag finden.
- hier ist es die meta-blacklist: [6] -- seth (talk) 21:46, 19 December 2018 (UTC)
- ach so, https://tools.wmflabs.org/url-converter/ sollte jetzt auch youtu.be koennen. -- seth (talk) 22:47, 19 December 2018 (UTC)
- Super, vielen Dank! --Tobias talk · contrib 22:13, 20 December 2018 (UTC)
Question
editIs it abusive?. Thanks, ---Dferg 22:12, 22 October 2009 (UTC)
- Hi!
- "Schwanz in Auge" just means "dick (penis) in eye". I would not block somebody with this name, if that user would be helpful. But there maybe admins at w:de which would block that user just because of the name. -- seth 18:54, 23 October 2009 (UTC)
labs/toolserver
editRückmeldung bei Fehlfunktion erwünscht
editHallo, Lustiger seth, folgender ziemlich langer Link erzeugt auf der Umwandlungsseite für Google-Suchergebnisse die Meldung: "results some error occurred."
Hier der Link: [http://www.google.de/imgres?newwindow=1&safe=off&client=firefox-a&hs=rYv&rls=org.mozilla:de:official&biw=1061&bih=559&tbm=isch&tbnid=ohyTAaScbJH4EM:&imgrefurl=http://www.designboom.com/art/maurizio-cattelans-5-horses-at-fondation-beyeler/&docid=95wQ9B6x7-0XNM&imgurl=http://www.designboom.com/wp-content/uploads/2013/06/maurizio-cattelan-fondazione-beyeler-designboom.jpg&w=818&h=471&ei=xNe9UZkbhuWzBvnzgbAB&zoom=1&iact=hc&vpx=4&vpy=137&dur=3651&hovh=170&hovw=296&tx=116&ty=193&page=1&tbnh=135&tbnw=224&start=0&ndsp=15&ved=1t:429,r:0,s:0,i:79 ] --Areta87 (talk) 15:33, 16 June 2013 (UTC)
- gudn tach!
- ja, bisher war der converter nur fuer normale links zu webseiten. hab's jetzt dahingehend erweitert, dass auch imgurls beruecksichtigt werden. -- seth (talk) 16:14, 16 June 2013 (UTC)
webcache von google
editProblem mit https://webcache.googleusercontent.com/search?q=cache:https://www.ft.com/content/6b810e5e-ba89-11e3-8b15-00144feabdc0%23axzz3pOVzTlau. Der Link führt auf eine ft-Seite. Die existiert aktuell, befindet sich aber hinter einer Zahlschranke, nicht jedoch via Google. Leider ist der Link gesperrt und eine Auslösung mit Deinem Tool funktioniert nicht. -- Dietrich (talk) 17:31, 14 October 2016 (UTC)
- gudn tach Dietrich!
- fuer webcache-links ist das tool nicht gedacht. anscheinend wird bei dem url
https://www.ft.com/content/6b810e5e-ba89-11e3-8b15-00144feabdc0
der referrer abgefragt, denn wenn man ueber die google-suchergebnisse dorthin gelangt, dann kommt man direkt auf den artikel, -- ohne zahlungsaufforderung. - eine technische loesung waere hier eher murks, deswegen waere meiner ansicht nach das beste, wenn du einen alternativen beleg verwendest. waere z.b. einer der folgenden geeignet: [7] oder [8]? -- seth (talk) 19:29, 14 October 2016 (UTC)
- Danke! - Dietrich (talk) 20:38, 14 October 2016 (UTC)
Kurzlinks von archiv.is
editHallo! Bei den Kurzlinks von archiv.is bekomme ich jetzt öfters den Fehler "could not get information about original url from 'https://archive.is/qwPBf'.some error occurred" "Früher" hab ich da nie Probleme gehabt.Gruß, --Hiddenhauser (talk) 15:58, 19 January 2017 (UTC)
- gudn tach Hiddenhauser!
- hmm, die von archive.(ie|today) haben das anscheinend geaendert. hab's jetzt anders geloest. klappt's jetzt? -- seth (talk) 10:36, 22 January 2017 (UTC)
- Ich habe gerade mal ein Dutzend probiert. Kein Fehler, scheint also zu funktionieren. Danke!--Hiddenhauser (talk) 14:23, 22 January 2017 (UTC)
- gudn tach!
- ok, prima, danke fuers testen. -- seth (talk) 17:25, 22 January 2017 (UTC)
- Kleiner Nachtrag: Bei manchen links muss man per hand "http://" nachtragen um die archiv.is Vorlage nutzen zu können.--Hiddenhauser (talk) 16:12, 26 January 2017 (UTC)
- gudn tach Hiddenhauser!
- sorry fuer mein langsames antworten.
- kannst du mir ein beispiel-url angeben, damit ich besser sehen kann, was du meinst? -- seth (talk) 19:52, 5 February 2017 (UTC)
- Also aus https://archive.is/dQXOP wird
- http://archive.today/20131120224629/www.eastbengaltherealpower.com/fans/news/449/Sunanda-Dhar-declared-as-the-AIFF-I-League-CEO-|-The-All-India-Football-Federation-(AIFF)-has-appointed-Sunando-Dhar-as-the-new-CEO-of-the-I-League- statt
- http://archive.today/20131120224629/http://www.eastbengaltherealpower.com/fans/news/449/Sunanda-Dhar-declared-as-the-AIFF-I-League-CEO-|-The-All-India-Football-Federation-(AIFF)-has-appointed-Sunando-Dhar-as-the-new-CEO-of-the-I-League- --Hiddenhauser (talk) 10:55, 6 February 2017 (UTC)
- gudn tach Hiddenhauser!
- ich geb jetzt einfach beides aus, so ok? -- seth (talk) 23:16, 6 February 2017 (UTC)
- Also aus https://archive.is/dQXOP wird
- Kleiner Nachtrag: Bei manchen links muss man per hand "http://" nachtragen um die archiv.is Vorlage nutzen zu können.--Hiddenhauser (talk) 16:12, 26 January 2017 (UTC)
- Ich habe gerade mal ein Dutzend probiert. Kein Fehler, scheint also zu funktionieren. Danke!--Hiddenhauser (talk) 14:23, 22 January 2017 (UTC)
Editor @ ar.wiki
editHello. I would like to inform you that I have granted you editor flag at the Arabic Wikipedia, all your edits there will be automatically marked as patrolled. Best regards.--Avocato (talk) 06:43, 7 December 2013 (UTC)
- thanks. :-) -- seth (talk) 18:13, 24 May 2014 (UTC)
admin removals/renewals
editNotice of removal of adminship
editHello,
I regret to inform you that, in accordance with Meta:Administrators/Removal and as a result of your inactivity, administrator rights have been removed from your account. Please see Meta:Administrators/Removal/April 2014 for details. Kind regards, Barras talk 09:53, 1 April 2014 (UTC)
- hi!
- what do I have to do to get the rights back? In fact, I won't be here very much, maybe just a few times per year. But I'm not dead and my edits, alhough they occur selden, should be helpful to the wikipedias, so I don't think that it's useful not being admin here. -- seth (talk) 18:16, 24 May 2014 (UTC), 08:33, 31 May 2014 (UTC)
- @user:Barras: ping -- seth (talk) 08:33, 31 May 2014 (UTC)
- Hello. If you want the right back, you will have to pass another RFA on meta. -Barras talk 11:52, 1 June 2014 (UTC)
Congratulations
editI've closed your request for adminship as successful. Happy button pushing, Tiptoety talk 05:36, 11 June 2014 (UTC)
- thanks! :-) -- seth (talk) 15:08, 15 June 2014 (UTC)
Notice of review of adminship
editHello Lustiger seth. In accordance with Meta:Administrators/Removal and because you have made fewer than ten logged administrator actions over the past six months, your adminship is under review at Meta:Administrators/Removal/October 2014. If you would like to retain your adminship, please sign there before October 08, 2014. Kind regards, Barras talk 14:18, 1 October 2014 (UTC)
Notice of review of adminship
editHello Lustiger seth. In accordance with Meta:Administrators/Removal and because you have made fewer than ten logged administrator actions over the past six months, your adminship is under review at Meta:Administrators/Removal/April 2015. If you would like to retain your adminship, please sign there before April 08, 2015. Kind regards, Matanya (talk) 13:57, 1 April 2015 (UTC)
Notice of review of adminship
editHello Lustiger seth. In accordance with Meta:Administrators/Removal and because you have made fewer than ten logged administrator actions over the past six months, your adminship is under review at Meta:Administrators/Removal/October 2015. If you would like to retain your adminship, please sign there before October 09, 2015. Kind regards, MF-W 03:58, 2 October 2015 (UTC)
Notice of review of adminship
editHello Lustiger seth. In accordance with Meta:Administrators/Removal and because you have made fewer than ten logged administrator actions over the past six months, your adminship is under review at Meta:Administrators/Removal/April 2016. If you would like to retain your adminship, please sign there before April 08, 2016. Kind regards, Barras talk 15:23, 1 April 2016 (UTC)
Notice of removal of adminship
editHello,
I regret to inform you that, in accordance with Meta:Administrators/Removal and as a result of your inactivity, administrator rights have been removed from your account. Please see Meta:Administrators/Removal/October 2016 for details. Kind regards, MF-W 01:12, 11 October 2016 (UTC)
Your RfA
editHello. Please see Meta talk:Requests for adminship/lustiger seth4. We wonder if you'd accept a restricted adminship for spam and titleblacklist tasks only, given the result of your RfA. Please advice. Thank you. —MarcoAurelio 15:14, 20 October 2016 (UTC)
- Hi,
- I've granted you admin rights based on your request, but only for spam blacklist management and viewing abusefilters. Moreover, you'll be required to sign on each October inactivity run if you wish to keep your admin rights for another yearly term. I hope that this is good for you. Thank you for your work on the blacklists. I really appreciate it.
- MfG, —MarcoAurelio 14:24, 26 October 2016 (UTC)
- Ok, thanks. -- seth (talk) 16:48, 27 October 2016 (UTC)
Review of adminship
editHi. If you would like to stay an admin on Meta, please say so on Meta:Administrators/Removal/October 2017. Best regards --MF-W 13:39, 1 October 2017 (UTC)
Notice of review of adminship
editHello,
Per your RfA. If you want to retain your adminship, you'll need to sign at Meta:Administrators/Removal/October 2018 before November 09, 2018. Kind regards, Matiia (talk) 02:17, 2 November 2018 (UTC)
Notice of review of adminship
editHello,
As usual, if you want to keep your tools, you'll have to sign on Meta:Administrators/Removal/October 2019, Kind regards. Matiia (talk) 23:12, 7 October 2019 (UTC)
Invitation to the CentralNotice-admins list
editHi! This bulk email is to let you know about a mailing list used to communicate bug reports and new features in CentralNotice, and to facilitate conversations between the admins. This message is being sent to you because you have the privileges to use the CentralNotice admin interface.
If you use CentralNotice to post or modify notices, please consider joining the list by visiting this page and subscribing yourself:
https://lists.wikimedia.org/mailman/listinfo/centralnotice-admins
Thanks,
00:23, 30 October 2014 (UTC)
Fundraising Tech,
Wikimedia Foundation
Global CSS/JS migration
editHello Lustiger seth. You have global scripts in User:Lustiger seth/global.js, which you import using your local JS pages. Since August 2014, your global.js and global.css pages are loaded automatically on all wikis. Since you already import them yourself, you may experience script errors or tools being added twice. Do you want me to remove the global imports from your local pages using Synchbot (without changing any other content)? —Pathoschild 18:52, 28 June 2015 (UTC)
- Hi!
- Thanks for this notice. I think I updated/deleted all local scripts now. -- seth (talk) 19:09, 28 June 2015 (UTC)
- Yep, you're all good now. :) —Pathoschild 19:11, 28 June 2015 (UTC)
The Community Wishlist Survey
editHi,
You get this message because you’ve previously participated in the Community Wishlist Survey. I just wanted to let you know that this year’s survey is now open for proposals. You can suggest technical changes until 11 November: Community Wishlist Survey 2019.
You can vote from November 16 to November 30. To keep the number of messages at a reasonable level, I won’t send out a separate reminder to you about that. /Johan (WMF) 11:24, 30 October 2018 (UTC)
Blocklist
editWhat's this kind of discussion meaning if may ask Markus7819 (talk) 01:41, 14 April 2019 (UTC)
- Hi!
- This project's main goal is the creation of an encyclopedia. If you want to help in this project, you'll get help from other wikipedians. Up till now I don't see where you want(ed) to improve the Wikipedia. -- seth (talk) 07:46, 14 April 2019 (UTC)
Blockierte Links
editImmer wieder kommt es vor, dass ich Links zu Webseiten nicht einfügen kann. Heute zum Beispiel auf https://glossar.wein-plus.eu/grundwein Was ist den falsch an dieser Internetseite, dass sie blockiert werden muss? --Wagner67 (talk) 17:18, 24 November 2019 (UTC)
- gudn tach!
- die begruendung war damals masseneintragungen in Nichtartikels via ipv6 / paywall.
- ist ok fuer dich, wenn ich diesen thread nach w:de:WP:SBL verschiebe? -- seth (talk) 23:35, 24 November 2019 (UTC)
- Hallo Seth! Ich kenne mich nicht aus mit diesen Filtern. Ich verstehe dich so, dass diese Seite sich dann um das Entsperren kümmern soll? Dann nicht verschieben, weil das nur ein Beispiel war. Es erschreckt mich jedes Mal, wenn so ein Filter erscheint, als hätte ich etwas Böses getan. Mir ging es eher darum, was das für Konsequenzen für mich hat, wenn ich solche Filter auslöse (obwohl keine böse Absichten dahinter stecken). --Wagner67 (talk) 19:21, 27 November 2019 (UTC)
- gudn tach!
- vorab: es gibt die spam-blacklist und es gibt das edit filter, die grundsaetzlich unterschiedlich funktionieren. die spam-blacklist soll das hinzufuegen von bestimmten links verhindern. im edit filter koennen komplexe regeln angelegt werden, die z.b. bestimmten usern bestimmte edits in bestimmten artikeln verbieten.
- du beziehst dich hier auf die spam-blacklist. wenn du einen url hinzufuegen moechtest, der durch diese liste verhindert wird, erscheint die meldung w:de:MediaWiki:Spamprotectiontext. gibt's daran eine bestimmte formulierung, bei der du verbesserungspotenzial siehst? vorschlaege sind willkommen! :-)
- konsequenzen: eigentlich keine. zwar wird jeder blockierte versuch mitgeloggt, siehe [9], aber das ist nix schlimmes. das log wird vor allem fuer statistische auswertungen benutzt bzw. um zu schauen, ob man z.b. ncah 10 jahren einen eintrag von der spam blacklist wieder loeschen kann.
- das meiste, was bei dir blockiert wurde und deshalb im log steht, waren google-links. die stehen aus zwei gruenden auf der blacklist, 1. weil wir google nicht so viele daten unserer user schenken wollen; 2. weil man google als sperrumgehung verwenden kann. bei diesen links sollte eigentlich immer der original-url extrahiert und stattdessen verwendet. bei dem genannten protectiontext ganz unten ist ein tool angegeben, mit dem die original-urls aus den google-urls herausgekratzt werden koennen.
- hilft dir das weiter? -- seth (talk) 22:18, 27 November 2019 (UTC)
- Ja, zum Teil hatte ich mir das auch so gedacht. Neu ist für mich dass man google "nicht so viele daten unserer user schenken" will. Das mit dem protectiontext schau ich mir noch genauer an. Danke dir.--Wagner67 (talk) 06:28, 28 November 2019 (UTC)
searchsbl: searching for regexes
editHi. Could you please indicate if there is a means to search for bopindia.com
when it is caught by the regex \bopindia\.(?:com|in)\b
. I am not expecting there to be a means, just if there is, it is not evident to me, so asking rather than presuming. Thanks. — billinghurst sDrewth 22:29, 3 May 2020 (UTC)
- gudn tach user:billinghurst!
\bopindia\.(?:com|in)
matches opindia.com and opindia.in (and this-opindia.com or that-opindia.in), but neither anythingopindia.com nor anythingelseopindia.in.\b
means word boundary.- so bopindia.com is not blocked by that regexp. in fact is is not blocked at all: http://bopindia.com.
- if you wanted to block bopindia.com, you could add the sbl entry
\bbopindia\.com
. - does this answer you question? (i'm not sure, whether i understood the question correctly.) -- seth (talk) 22:36, 3 May 2020 (UTC)
- Ack, bad copy and paste by me, that was meant to be
opindia.com
per
- Ack, bad copy and paste by me, that was meant to be
opindia.com
— billinghurst sDrewth 22:47, 3 May 2020 (UTC)
- ok, that seems to work: blocked at en. -- seth (talk) 22:53, 3 May 2020 (UTC)
Non-hitting entries
editIn this edit from several years ago, you cleared a number of entries at "non-hitting." What exactly does that mean? That the url doesn't resolved, or that no one has attempted to add a link from one of those sites in a certain frame of time? Ohnoitsjamie (talk) 14:05, 9 July 2020 (UTC)
- hi!
- the corresponding post to my edit was [10] in the thread Talk:Spam_blacklist/Archives/2015-01#Expert_maintenance.
- so non-hitting entries were "entries that never triggered the sbl during 2013-sep and 2015-feb anywhere" (i.e. the second of your options). -- seth (talk) 21:00, 9 July 2020 (UTC)
To note this phabricator ticket. Not wanting to advertise it on the other talk page as it is stalked by one of the abusers. — billinghurst sDrewth 14:26, 30 August 2020 (UTC)
adminship renwal
editHello seth. I hope that you're doing okay. If you want your limited adminship extended for another year term, please sign at Meta:Administrators/Removal/October 2020 within one week. Best regards, —MarcoAurelio (talk) 11:38, 4 October 2020 (UTC)
- hi!
- thanks! done. -- seth (talk) 12:17, 4 October 2020 (UTC)
- Thank you! —MarcoAurelio (talk) 12:22, 4 October 2020 (UTC)
Kleiner Fehler in Spam-Blacklist?
editHallo Seth,
im Artikel AliExpress der deutschen Wikipedia ist die URL untypisch für einen Website-Artikel ausgeklammert worden. Das hatte mich gewundert, und wollte das wieder in den Artikel einfügen, bekam dann aber eine Fehlermeldung, weil die URL wohl auf der Spam-Blacklist liegen würde. Das habe ich durch das Tool überprüft, dort wurde dann angezeigt, dass "\baliexpress\.com\b" blockiert wird. Sollte also nur "Baliexpress.com" blockiert werden und ist "AliExpress.com" da fehlerhaft mit blockiert worden, oder ist die Website von AliExpress zurecht wegen Spam auf der Blacklist gelandet? Ich habe dafür gerade keine Begründung gefunden mit dem Tool, ich kann natürlich auch was übersehen haben. Irritierend finde ich auch, dass im Spam-Tool steht, dass die URL global für alle Wikis blockiert ist, er in der englischen Wikipedia allerdings ganz normal in der Infobox steht. Wenn du da mehr weißt, wäre ich über eine Antwort sehr dankbar, auch wenn es wirklich nur eine eher unwichtige Kleinigkeit ist. --Max221B (talk) 20:38, 4 January 2021 (UTC)
- gudn tach Max221B!
\b
steht fuer wortgrenze. es ist also die komplette domain "aliexpress.com" gesperrt. grund war wohl spam, der auf User:COIBot/XWiki/aliexpress.com teilweise dokumentiert ist.- es gibt leider keine nachhaltige loesung, um domains von bestimmten seiten fernzuhalten (technisch ist es mittels edit filter zwar moeglich, aber regeln im edit filter sind nicht so schoen wartbar.)
- ich werde es deshalb so machen, dass ich die domain kurz von der globalen blacklist nehme, dann den link einbaue, und dann den link wieder sperre.
- sicherheitshalber werde ich das auf Talk:Spam blacklist ansprechen. vielleicht hat jemand einen besseren vorschlag. -- seth (talk) 08:21, 5 January 2021 (UTC)
- Hi, dein Vorschlag klingt doch ganz gut, der Link hat ja außerhalb des Artikels auch nirgends etwas verloren, muss also nur da drinstehen. Und danke für die Erklärung mit dem "b". --Max221B (talk) 11:05, 6 January 2021 (UTC)
searchbsl chokes on nowiki and syntaxhighlight tagged pages
editHi. When blacklist and whitelist pages use alternate means to present their pages as mono typefaces, eg, use of <nowiki> or <syntaxhighlight> the tool https://searchsbl.toolforge.org/index.pl chokes on the pages. It seems that the tool only likes pages where <pre> is used.
s:mediawiki:spam-blacklist and s:mediawiki:spam-whitelist both use <syntaxhighlight> and fail. To test https://searchsbl.toolforge.org/index.pl?userdeflang=en&userdefproj=s&url=signal9.com
If something is possible, that would be great. I try to avoid "pre". — billinghurst sDrewth 23:50, 23 January 2021 (UTC)
- hi user:billinghurst!
- actually both lines you changed say "leave this line exactly as it is". as this is defined in the original source code, both of these lines should actually be a constant across all ~900 wikis. the lines before the beginning pre tag may be formatted in an almost arbritrary way. thus the most reliable way to extract the actual sbl entries is to use the pre tags.
- and i don't know whether searchsbl is the only tool that relies on that.
- if you want to change that lines, then imho you should change them in all wikis uniformly. then i could adapt the script. but adapting the script for each single wiki would be to expensive. so i'm afraid i'd rather skip this feature request. or is there a crucial benefit of your local change? -- seth (talk) 09:39, 24 January 2021 (UTC)
- (@Billinghurst:) I have to check COIBot's code, but I think that I also select the part between the "pre"s and then parse the rest. This is a wiki man, everything should be uniform and the same: spaces after and before the '==' in section headers, everything properly templated, https for every external link, and tags uniformly applied :-) --Dirk Beetstra T C (en: U, T) 12:31, 24 January 2021 (UTC)
- Ugh, okay, so 2005! — billinghurst sDrewth 12:33, 24 January 2021 (UTC)
Link question
editHey, this link is being flagged https://www.roddure.com/bio/plant/medicinal-plants-list-bd/ Is it talked as an unreliable source? Or just a false positive? --Greatder (talk) 05:13, 15 August 2021 (UTC)
- hi user:Greatder!
- the domain was transferred from a list of unreliable sources to the blacklist one year ago by user:NahidSultan.
- if you need further information or want to request a removal from the blacklist, please consider asking at bn:MediaWiki talk:Spam-blacklist. (i'm not an admin there.) -- seth (talk) 08:47, 15 August 2021 (UTC)
URL-Converter mit archive.li links funktioniert nicht
editIch wurde gerade auf meiner (temporären) disk darauf hingewiesen, daß Archivlinks möglichst nicht in Kurzformat angegeben werden sollen. Ist ja auch verständlich erklärt, warum das besser ist. Das würde ich auch gerne machen. Mein Problem ist: Ich finde die zugehörige lange URL nicht, bzw. weiß nicht, wie ich danach suchen könnte. Der Bot (CamelBot), der mich informiert hat verweist dafür auf Dein Tool URL-converter .
Leider kommt der mit der Archiv-URL offenbar nicht zurecht:
Archivlink https://archive.li/wip/3bRb7
- results
- sorry, i don't know how to convert this url.
Der Archivlink wurde mir von »Robustlinks Mementoweb« so ausgegeben, nachdem der Beleglink „robustifiziert” wurde, und er funktioniert auch. Kannst Du deinem Tool beibringen, die Langform der Kurzlinks auch für diese Archive zu ermitteln? Vielen Dank! Liebe Grüße --92.73.146.168 22:19, 20 August 2021 (UTC)
- gudn tach!
- vielen dank fuer den hinweis! :-)
- hab's korrigiert. -- seth (talk) 22:50, 20 August 2021 (UTC)
- Prima, jetzt funktioniert das. Ein klasse Werkzeug! Vielen Dank, daß Du das so schnell hinbekommen hast. A guats nächtle. --92.73.146.168 23:34, 20 August 2021 (UTC)
- Es funktioniert wahrscheinlich wieder nicht. Ich versuche es seit einigen Tagen vergeblich. Es kommt die gleiche Meldung wie oben bereits beschrieben. Pintsknife (talk) 12:51, 10 August 2023 (UTC)
- Gudn Tach!
- Ja, sorry, aktuell ist der Wurm drin. Siehe auch w:de:user talk:lustiger_seth#URL-Converter_Probleme? bzw. w:de:WP:Technik/Werkstatt#archive.today_funzt_nur_im_Browser_aber_nicht_per_curl. -- seth (talk) 15:32, 10 August 2023 (UTC)
- Es funktioniert wahrscheinlich wieder nicht. Ich versuche es seit einigen Tagen vergeblich. Es kommt die gleiche Meldung wie oben bereits beschrieben. Pintsknife (talk) 12:51, 10 August 2023 (UTC)
- Prima, jetzt funktioniert das. Ein klasse Werkzeug! Vielen Dank, daß Du das so schnell hinbekommen hast. A guats nächtle. --92.73.146.168 23:34, 20 August 2021 (UTC)
Special adminship renewal - October 2021
editHello lustiger_seth. I hope that you're doing okay. Pursuant to the special conditions of your last RfA, please kindly sign at Meta:Administrators/Removal/October 2021 if you'd like to keep your adminship at Meta. Best regards, —MarcoAurelio (talk) 11:11, 9 October 2021 (UTC)
Notice of review of adminship
editHello Lustiger seth. In accordance with Meta:Administrators/Removal and because you have made fewer than ten logged administrator actions over the past six months, your adminship is under review at Meta:Administrators/Removal/October 2022. If you would like to retain your adminship, please sign there before October 08, 2022. Kind regards, MF-W 11:57, 1 October 2022 (UTC)
- gudn tach!
- thanks for the notification. i signed there. -- seth (talk) 17:25, 1 October 2022 (UTC)