From Wikipedia, the free encyclopedia

New to bots on Wikipedia? Read these primers!

Operator: JJMC89 ( talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)

Time filed: 02:39, Monday, February 19, 2024 ( UTC)

Function overview: Report potential copyright issues from CopyPatrol to PageTriage

Automatic, Supervised, or Manual: Automatic

Programming language(s): Python

Source code available: copypatrol-backend on GitHub

Links to relevant discussions (where appropriate): Feedback about changes to CopyPatrol was solicited at m:Special:PermaLink/25725587#New backend coming soon.

Active period(s): Continuous

Estimated number of pages affected: n/a

Namespace(s): Main/Article, Draft

Exclusion compliant: n/a

Copyright violation bot: Yes (required for trial)

Function details: This is a replacement for EranBot ( BRFA 3) to report potential copyright issues found in articles and drafts from CopyPatrol to PageTriage (shown in Special:NewPagesFeed). When a report is added to CopyPatrol, check if the page is in PageTriage (action=pagetriagelist). If so, report the revision (action=pagetriagetagcopyvio). The bot does not edit.

Notes:

  • The code has been completely rewritten, migrating from Python 2 to Python 3.11 and from the XML-RPC-based API that EranBot uses to the new Turnitin Core API (TCA). CopyPatrol is also being migrated from Toolforge tools (EranBot is running on the deprecated Grid Engine.) and database to WMCS Cloud VPS instances.
  • Once this bot starts running, even for a trial, EranBot will be disabled.
  • We're currently waiting for the WMF to sign an agreement with Turnitin before we receive production TCA access. (required before trial)

Discussion

  •  On hold. pending the TCA access. Otherwise looks fine to me, replacing functionality of existing bot. Can you drop a note on WT:NPP/R and WT:AFC please, alerting them of this BRFA. Also, is this handover being coordinated with the operator of EranBot? ProcrastinatingReader ( talk) 11:30, 20 February 2024 (UTC) reply
    The handover was authorized by User:ערן at phab:T334265. JJMC89 and myself already have access to the eranbot account on Toolforge and will be able to stop/restart it as needed. MusikAnimal talk 15:59, 20 February 2024 (UTC) reply
    Yes the handover is co-ordindated. Note that EranBot bot is used for multiple tasks beside copyright violations (outside enwiki), and the handover is specifically for copyvio aspect. Having a dedicated user for copyright violation bot seems to be more correct than a general bot, and thanks for JJMC89 and MusikAnimal drive the migration and the rewrite effort. Eran ( talk) 16:31, 20 February 2024 (UTC) reply
    @ ProcrastinatingReader: I dropped pointers to this BRFA on those two pages. It would be helpful to have trial approval and copyviobot in advance of getting TCA access to cut down on any wait or downtime. (We're coming up on the Grid Engine shutdown soon.) —  JJMC89( T· C) 23:33, 5 March 2024 (UTC) reply
    JJMC89, I think PR wants you to have access before granting a trial. Primefac ( talk) 20:58, 10 March 2024 (UTC) reply
    ProcrastinatingReader, please comment. —  JJMC89( T· C) 21:13, 11 March 2024 (UTC) reply
    Yeah, as Primefac said, I was waiting for this to be ready to begin before approving and requesting a crat give the bot the appropriate perms.
    Not that it matters too much either way, but it seems like the delay time on TCA access is much greater than BAG delay time, and it's better practice not to assign userrights to bots-in-trial until they're ready to commence trial imminently. In this case, it seems like you're blocked on other things for an indeterminate period of time, before you're able to begin trial.
    If you really want it sooner and Primefac is happy with that, then sure. Otherwise yeah, since there's no objections from users working in this area and there is the consent of the existing bot operator, I'm happy to tick this when you're ready to begin. ProcrastinatingReader ( talk) 15:26, 12 March 2024 (UTC) reply
Also a poke to @ Diannaa and Moneytrees: as some people involved in copyright issues, in case you have any comments, or know of anyone who might. ProcrastinatingReader ( talk) 11:34, 20 February 2024 (UTC) reply
Approved for trial (30 days). Please provide a link to the relevant contributions and/or diffs when the trial is complete. Please report back after this period on the bot's progress. Primefac ( talk) 12:42, 13 March 2024 (UTC) reply
A user has requested the attention of the operator. Once the operator has seen this message and replied, please deactivate this tag. (user notified) Primefac ( talk) 12:00, 30 April 2024 (UTC) reply
Not the Bot Operator but this link is what I think you mean. Nobody ( talk) 12:43, 30 April 2024 (UTC) reply
From Wikipedia, the free encyclopedia

New to bots on Wikipedia? Read these primers!

Operator: JJMC89 ( talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)

Time filed: 02:39, Monday, February 19, 2024 ( UTC)

Function overview: Report potential copyright issues from CopyPatrol to PageTriage

Automatic, Supervised, or Manual: Automatic

Programming language(s): Python

Source code available: copypatrol-backend on GitHub

Links to relevant discussions (where appropriate): Feedback about changes to CopyPatrol was solicited at m:Special:PermaLink/25725587#New backend coming soon.

Active period(s): Continuous

Estimated number of pages affected: n/a

Namespace(s): Main/Article, Draft

Exclusion compliant: n/a

Copyright violation bot: Yes (required for trial)

Function details: This is a replacement for EranBot ( BRFA 3) to report potential copyright issues found in articles and drafts from CopyPatrol to PageTriage (shown in Special:NewPagesFeed). When a report is added to CopyPatrol, check if the page is in PageTriage (action=pagetriagelist). If so, report the revision (action=pagetriagetagcopyvio). The bot does not edit.

Notes:

  • The code has been completely rewritten, migrating from Python 2 to Python 3.11 and from the XML-RPC-based API that EranBot uses to the new Turnitin Core API (TCA). CopyPatrol is also being migrated from Toolforge tools (EranBot is running on the deprecated Grid Engine.) and database to WMCS Cloud VPS instances.
  • Once this bot starts running, even for a trial, EranBot will be disabled.
  • We're currently waiting for the WMF to sign an agreement with Turnitin before we receive production TCA access. (required before trial)

Discussion

  •  On hold. pending the TCA access. Otherwise looks fine to me, replacing functionality of existing bot. Can you drop a note on WT:NPP/R and WT:AFC please, alerting them of this BRFA. Also, is this handover being coordinated with the operator of EranBot? ProcrastinatingReader ( talk) 11:30, 20 February 2024 (UTC) reply
    The handover was authorized by User:ערן at phab:T334265. JJMC89 and myself already have access to the eranbot account on Toolforge and will be able to stop/restart it as needed. MusikAnimal talk 15:59, 20 February 2024 (UTC) reply
    Yes the handover is co-ordindated. Note that EranBot bot is used for multiple tasks beside copyright violations (outside enwiki), and the handover is specifically for copyvio aspect. Having a dedicated user for copyright violation bot seems to be more correct than a general bot, and thanks for JJMC89 and MusikAnimal drive the migration and the rewrite effort. Eran ( talk) 16:31, 20 February 2024 (UTC) reply
    @ ProcrastinatingReader: I dropped pointers to this BRFA on those two pages. It would be helpful to have trial approval and copyviobot in advance of getting TCA access to cut down on any wait or downtime. (We're coming up on the Grid Engine shutdown soon.) —  JJMC89( T· C) 23:33, 5 March 2024 (UTC) reply
    JJMC89, I think PR wants you to have access before granting a trial. Primefac ( talk) 20:58, 10 March 2024 (UTC) reply
    ProcrastinatingReader, please comment. —  JJMC89( T· C) 21:13, 11 March 2024 (UTC) reply
    Yeah, as Primefac said, I was waiting for this to be ready to begin before approving and requesting a crat give the bot the appropriate perms.
    Not that it matters too much either way, but it seems like the delay time on TCA access is much greater than BAG delay time, and it's better practice not to assign userrights to bots-in-trial until they're ready to commence trial imminently. In this case, it seems like you're blocked on other things for an indeterminate period of time, before you're able to begin trial.
    If you really want it sooner and Primefac is happy with that, then sure. Otherwise yeah, since there's no objections from users working in this area and there is the consent of the existing bot operator, I'm happy to tick this when you're ready to begin. ProcrastinatingReader ( talk) 15:26, 12 March 2024 (UTC) reply
Also a poke to @ Diannaa and Moneytrees: as some people involved in copyright issues, in case you have any comments, or know of anyone who might. ProcrastinatingReader ( talk) 11:34, 20 February 2024 (UTC) reply
Approved for trial (30 days). Please provide a link to the relevant contributions and/or diffs when the trial is complete. Please report back after this period on the bot's progress. Primefac ( talk) 12:42, 13 March 2024 (UTC) reply
A user has requested the attention of the operator. Once the operator has seen this message and replied, please deactivate this tag. (user notified) Primefac ( talk) 12:00, 30 April 2024 (UTC) reply
Not the Bot Operator but this link is what I think you mean. Nobody ( talk) 12:43, 30 April 2024 (UTC) reply

Videos

Youtube | Vimeo | Bing

Websites

Google | Yahoo | Bing

Encyclopedia

Google | Yahoo | Bing

Facebook