Talk:CopyPatrol
This page is for discussions related to the CopyPatrol page. Please remember to:
|
BugEdit
For whatever reason the comparison between text was showing {{Unruto=yes
and missing any proper comparisons. The link to the task is here here. Thanks-CAPTCHA Security check for adding an external link? I'm offended, Pabsoluterince (talk) 08:09, 12 May 2022 (UTC)
Feedback requested from CopyPatrol usersEdit
Hello CopyPatrol users! Turnitin, the company behind IThenticate which powers CopyPatrol, has asked us to collect feedback from our users. This is to help build our partnership with them and ensure the long-term stability of CopyPatrol, so your input is very much appreciated! The questions may seem a bit broad, but if you are able to elaborate on any of them please do. For all intents and purposes, "iThenticate" in this context can be viewed as CopyPatrol, since the reports you see surfaced there come from iThenticate. Some of you use the "iThenticate report" link as well; if you do, please describe your workflow in Q3. The questions are as follows:
- How does iThenticate help you in your work of keeping Wikipedia plagiarism-free?
- How would you describe the main benefit of using iThenticate? (e.g. report accuracy? Time saving?)
- What do you do when you identify text similarities in the article you are reviewing? Could you please describe the process of working with the detected text matches?
- How does iThenticate help you prevent copyright violations?
Thank you for taking the time to answer, and also your time and energy spent helping keep Wikipedia clean of copyright violations! I am pinging a few of our English-speaking power users, but anyone should feel free to respond: @Diannaa, @DanCherek, @L3X1, @Sphilbrick, @Ymblanter. Warm regards, MusikAnimal (WMF) (talk) 20:25, 24 August 2022 (UTC)
Responses from DanCherek |
---|
|
Responses from Diannaa |
---|
|
Responses from Sennecaster |
---|
|
Responses from Moneytrees |
---|
The above responses have basically covered anything of substance I would say, so I will provide briefer answers:
|
I don't have much to add to the above - except that wider coverage is still needed. False negatives due to unavailable sources are still too frequent. MER-C 18:04, 13 September 2022 (UTC)
Response from L3X1 |
---|
Without the tools made by ithenticate it would be functionally impossible for me to do anything about plagiarism. Having a program to detect possible violations and format in the queue that I can easily interact with and delivers the information I need right at my fingertips is irreplaceable. enL3X1 ¡‹delayed reaction›¡ 22:16, 18 September 2022 (UTC) |
- @MER-C, Moneytrees, Sennecaster, DanCherek, Diannaa, and L3X1: A belated but sincere THANK YOU for your well-articulated and thorough replies! :) I realized I failed to mention that this was for a case study. The hope is to publish a blog post (authored by Turnitin) on Wikimedia Diff. We saw the draft today and they are using direct quotes from some of you and linking to your user page. I wanted to make sure you were okay with this? I assume so since your words are already in the public eye here. I'm not sure when the post will go live but I will certainly let you know. Thanks for helping us build up our partnership with Turnitin! Best, MusikAnimal (WMF) (talk) 02:09, 2 December 2022 (UTC)
- No issues for me. Thanks, DanCherek (talk) 02:19, 2 December 2022 (UTC)
- I am okay with this and would be pleased to see the resulting blog post. Diannaa (talk) 03:21, 2 December 2022 (UTC)
- Fine with me :) Sennecaster (talk) 12:55, 2 December 2022 (UTC)
- That's cool, I am fine if my words are used. Moneytrees (talk) 20:16, 4 December 2022 (UTC)
- yes I am fine with being quote and/or linked to. thanks for reaching out. enL3X1 ¡‹delayed reaction›¡ 20:44, 5 December 2022 (UTC)
- @Moneytrees, DanCherek, Diannaa, and Sennecaster: The blog post was published and I guess I wasn't notified, but anyway here it is should you want to read it. The rest of you I didn't ping were not mentioned. Thanks again to all of you for your feedback and participation in this PR push! Thanks to you, we should soon hopefully have enough credits secured for CopyPatrol to last for many more years. Warm regards, MusikAnimal (WMF) (talk) 02:13, 18 January 2023 (UTC)
What spaces are included?Edit
Hi! I don't immediately see anything about which project spaces are covered, and which (if any?) are not. The reason I ask is that a fairly serious copyvio problem in Portal space has come to light on en.wp, and that made me wonder how it got past this tool and the heroes who monitor it. Thanks, Justlettersandnumbers (talk) 20:31, 15 September 2022 (UTC)
Loading issueEdit
Starting to see this crop up a lot "No text could be found in the given URL (note that only HTML and plain text pages are supported, and content generated by JavaScript or found inside iframes is ignored)" is this to be expected? 04:10, 22 October 2022 (UTC) enL3X1 ¡‹delayed reaction›¡ 04:10, 22 October 2022 (UTC)
- @L3X1 Sorry for the late reply! Is this still happening, and if so, could you link to an example? I'm assuming you're talking about the "Compare" buttons, which relies on toolforge:copyvios, so the issue may actually be there. MusikAnimal (WMF) (talk) 01:51, 2 December 2022 (UTC)
- yes, it appeared when I clicked the compare button . I will keep an eye out for its return enL3X1 ¡‹delayed reaction›¡ 20:57, 5 December 2022 (UTC)
- @L3X1 I just realized, this was probably because the external URL is behind a paywall. This is why we also give you the "iThenticate report" link, which will show the original text from the website. If that isn't the issue, then it's something else with toolforge:copyvios. You can contact The Earwig to report those issues, but we're happy to pass along the message for you if you'd rather post here :) MusikAnimal (WMF) (talk) 00:41, 6 December 2022 (UTC)
- Hello @User:MusikAnimal (WMF) I found a recent instance: https://copypatrol.toolforge.org/en/?id=93197888 . It appears that the external URL is of a PDF of a scan of a book. I tried to page search en-browser a portion of the new text from the wiki-diff, but the pdf is so large my browser cannot search it. hope this helps, thanks enL3X1 ¡‹delayed reaction›¡ 22:29, 6 December 2022 (UTC)
- @L3X1 I just realized, this was probably because the external URL is behind a paywall. This is why we also give you the "iThenticate report" link, which will show the original text from the website. If that isn't the issue, then it's something else with toolforge:copyvios. You can contact The Earwig to report those issues, but we're happy to pass along the message for you if you'd rather post here :) MusikAnimal (WMF) (talk) 00:41, 6 December 2022 (UTC)
- yes, it appeared when I clicked the compare button . I will keep an eye out for its return enL3X1 ¡‹delayed reaction›¡ 20:57, 5 December 2022 (UTC)
Diannaa stepping back, new featuresEdit
@MusikAnimal (WMF) (Also ping @DanCherek, MER-C, and Diannaa-- please add anything here that you think might also be useful) If you haven't seen, Diannaa is going to be doing less work at copypatrol moving forward. This is a good a time as any to address some long running issues around how work is structured. It's no efficent, healthy, or fair for two or three people to be doing the lion shares of the work. We need to make patrolling and dealing with copyright violations like recent change patrolling, in that the majority of editors have a baseline knowledge of what to do. We need to update the processes around dealing with copyright violations to account for this, and I have two ideas in particular:
- We should have a feature that allows you to search through all the times a specific editor has been flagged at copypatrol.
- We should have a feature that allows you to look through all the reviews someone has done.
If for whatever reason these cannot be used by the general editing group at copypatrol, would it be possible to add an "admin" role at copypatrol that could do this? If this isn't the correct venue to request these features, what would be? Thank you, Moneytrees (talk) 21:48, 28 January 2023 (UTC)
- Hi Moneytrees, I never got a ping notification for this discussion, and noticed only by chance. You might like to notify the others of its existence via some other method in case they never got pinged either. Thanks. Diannaa (talk) 15:11, 2 February 2023 (UTC)
- Hey @Moneytrees! I didn't get this ping either, but I did get your message on my talk page (at the time I was on holiday). I'm sad to hear the mighty @Diannaa will be taking a break! She indeed has done the heavy lifting for some years now.
- Myself and CommTech are happy to look into streamlining CopyPatrol however you think it will help. In my opinion though, the main issue is lack of enough interested patrollers. I think running some sort of campaign to get more folks involved on enwiki is probably going to yield the best results.
- Now, looking at your two specific suggestions:
- a feature that allows you to search through all the times a specific editor has been flagged at copypatrol
- Partially doable, and would be very slow. We don't store any data about the editor in the CopyPatrol database, only the revision ID. While we can do a query to find all revisions by a given editor that exist in CopyPatrol, this wouldn't work if the revisions no longer exist. I.e. see the reviewed cases and you'll notice that now-deleted pages don't have editor info (example). We could change CopyPatrol to start storing user data, but this would be expensive and costly for the benefit it provides, I'm afraid.
- a feature that allows you to look through all the reviews someone has done
- This is doable and quite easily at that! If you can file a Phabricator task with the CopyPatrol task, we'll get it triaged in the next meeting. Or I can write a task when I find the time.
- a feature that allows you to search through all the times a specific editor has been flagged at copypatrol
- Best, MusikAnimal (WMF) (talk) 21:55, 28 February 2023 (UTC)
- @MusikAnimal (WMF) I'm planning on writing a sort of "guide to copypatrol" and some increased community activity for when I have the time. I've created a task on Phab related to the searching reviews feature, let me know if I did it wrong. Moneytrees (talk) 04:00, 5 March 2023 (UTC)
- Hi, @MusikAnimal (WMF)! I saw the task open up at Phabricator and wanted to take a stab at it since I had free time. I forgot to read this thread and didn't notice that you had plans to get it done. The PR can be found here; please feel free to close if it's an overreach. Chlod (say hi!) 06:35, 5 March 2023 (UTC)
- Not an overreach at all! We had not starting working on this. Thank you very much creating a PR :) I'll get to reviewing it soon. MusikAnimal (WMF) (talk) 18:52, 6 March 2023 (UTC)