Operator: Usernamekiran ( talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)
Time filed: 16:51, Monday, January 9, 2023 ( UTC)
Automatic, Supervised, or Manual: supervised
Programming language(s): AWB
Source code available: AWB's custom module using regex, will upload in my userspace soon
Function overview: remove references/links on mass level (expired/hijacked domains)
Links to relevant discussions (where appropriate): special:permalink/1132589552#pakrail.com at WP:COIN
Edit period(s): mostly one time run per request (removing spammy link)
Estimated number of pages affected: around 1000 for current request
Exclusion compliant (Yes/No): No
Already has a bot flag (Yes/No): Yes
Function details: currently, pakrail.com redirects to an online casino website. It has been used in around 1170 railway related articles. I created a regex that finds the instance of pakrail.com, and removes the <ref... text-pakrail.com ... /ref>
I made around 50 edits through my alt Usernamekiran (AWB) account using that regex. Currently it is removing the links if it is in referencing template.
There is no scope for mistake, I would like the approval for saving the edits automatically.
currently it is not removing the plain link from "external link" section. (eg: * [http://pakrail.com Pakistan Railways official site]
) I will remove these links using some other method from AWB, and I will perfect the method soon.
PS: previous BRFAs were filed under bot's old username, UsernamekiranBOT. —usernamekiran (talk) 16:54, 9 January 2023 (UTC) PPS: the pakrail.com was never the official webiste. —usernamekiran (talk) 17:13, 9 January 2023 (UTC) reply
Is there some reason you don't just let GreenC's bot (see Wikipedia:Link rot/URL change requests) do this? * Pppery * it has begun... 16:57, 9 January 2023 (UTC) reply
{{
usurped}}
in others, etc.. it's a complex process. See
WP:USURPURL. Code is already in place to handle it. -
Green
C 18:18, 9 January 2023 (UTC)
replyIn Special:Diff/1132588299 you left behind an orphaned ref. It worked out in the end, after AnomieBOT rescued it you just took care of that copy too, but it would have been better to not leave the orphan in the first place. Anomie ⚔ 04:38, 10 January 2023 (UTC) reply
Now it does that as well.I was referring to the defined references, like the first diff you provided, where a fragment was left behind. Now it handles such format as well. —usernamekiran (talk) 12:38, 10 January 2023 (UTC) reply
{{ Bot trial complete}} well, sort of. It was using my alt Usernamekiran (AWB) ( talk · contribs), I did around 1100 edits semi-automatically, all these edits were okay. The only unexpected one pointed above by Anomie (I somehow missed it when I was doing the edits), but now it has been taken care of. —usernamekiran (talk) 15:46, 10 January 2023 (UTC) reply
{{ BAG assistance needed}} I have already finished this particular task. But would it possible to get a clearance for non-controversial, non-cosmetic, non-judgement call (non CONTEXTBOT) one-off find-and-replace tasks? I don't come across such tasks much, but in case I do, it would be convenient to have "auto save" option on AWB. I will test my regex thoroughly on my sandbox before every task. —usernamekiran (talk) 05:16, 25 January 2023 (UTC) reply
Operator: Usernamekiran ( talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)
Time filed: 16:51, Monday, January 9, 2023 ( UTC)
Automatic, Supervised, or Manual: supervised
Programming language(s): AWB
Source code available: AWB's custom module using regex, will upload in my userspace soon
Function overview: remove references/links on mass level (expired/hijacked domains)
Links to relevant discussions (where appropriate): special:permalink/1132589552#pakrail.com at WP:COIN
Edit period(s): mostly one time run per request (removing spammy link)
Estimated number of pages affected: around 1000 for current request
Exclusion compliant (Yes/No): No
Already has a bot flag (Yes/No): Yes
Function details: currently, pakrail.com redirects to an online casino website. It has been used in around 1170 railway related articles. I created a regex that finds the instance of pakrail.com, and removes the <ref... text-pakrail.com ... /ref>
I made around 50 edits through my alt Usernamekiran (AWB) account using that regex. Currently it is removing the links if it is in referencing template.
There is no scope for mistake, I would like the approval for saving the edits automatically.
currently it is not removing the plain link from "external link" section. (eg: * [http://pakrail.com Pakistan Railways official site]
) I will remove these links using some other method from AWB, and I will perfect the method soon.
PS: previous BRFAs were filed under bot's old username, UsernamekiranBOT. —usernamekiran (talk) 16:54, 9 January 2023 (UTC) PPS: the pakrail.com was never the official webiste. —usernamekiran (talk) 17:13, 9 January 2023 (UTC) reply
Is there some reason you don't just let GreenC's bot (see Wikipedia:Link rot/URL change requests) do this? * Pppery * it has begun... 16:57, 9 January 2023 (UTC) reply
{{
usurped}}
in others, etc.. it's a complex process. See
WP:USURPURL. Code is already in place to handle it. -
Green
C 18:18, 9 January 2023 (UTC)
replyIn Special:Diff/1132588299 you left behind an orphaned ref. It worked out in the end, after AnomieBOT rescued it you just took care of that copy too, but it would have been better to not leave the orphan in the first place. Anomie ⚔ 04:38, 10 January 2023 (UTC) reply
Now it does that as well.I was referring to the defined references, like the first diff you provided, where a fragment was left behind. Now it handles such format as well. —usernamekiran (talk) 12:38, 10 January 2023 (UTC) reply
{{ Bot trial complete}} well, sort of. It was using my alt Usernamekiran (AWB) ( talk · contribs), I did around 1100 edits semi-automatically, all these edits were okay. The only unexpected one pointed above by Anomie (I somehow missed it when I was doing the edits), but now it has been taken care of. —usernamekiran (talk) 15:46, 10 January 2023 (UTC) reply
{{ BAG assistance needed}} I have already finished this particular task. But would it possible to get a clearance for non-controversial, non-cosmetic, non-judgement call (non CONTEXTBOT) one-off find-and-replace tasks? I don't come across such tasks much, but in case I do, it would be convenient to have "auto save" option on AWB. I will test my regex thoroughly on my sandbox before every task. —usernamekiran (talk) 05:16, 25 January 2023 (UTC) reply