Operator: DreamRimmer ( talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)
Time filed: 14:01, Monday, May 27, 2024 ( UTC)
Automatic, Supervised, or Manual: automatic
Programming language(s): Python
Source code available:
Function overview: Fix the URLs for the ECI election database.
Links to relevant discussions (where appropriate):
Edit period(s): Every six months
Estimated number of pages affected: 5050
Exclusion compliant (Yes/No): No
Already has a bot flag (Yes/No): No
Function details: The
Election Commission of India has moved all of its data (except for very recent elections) to a subdomain. As a result, URLs in more than 5000 pages are now invalid and are giving a 404 error. This bot will replace URLs like
https://eci.gov.in/files/file/11699-maharashtra-legislative-assembly-election-2019
with the new URL
https://old.eci.gov.in/files/file/11699-maharashtra-legislative-assembly-election-2019
. Simply replace
https://eci.gov.in/
with
https://old.eci.gov.in/
.
Why every six months? Primefac ( talk) 18:28, 27 May 2024 (UTC)
https://eci.gov.in/
since it's a "recent election". At what point will that URL get archived to the
https://old.eci.gov.in/
prefix? If it is archived after the subsequent election, why not just update the URL with the new election information along with the data it represents?
Primefac (
talk)
15:00, 6 June 2024 (UTC)
(?<!/)(?<!\\?url=)https?://eci[.]gov[.]in/[^\\s\\]|}{<]*[^\\s\\]|}{<]*
|url-status=
, {{
webarchive}}
and {{
dead link}}
. Also links that are square and bare. It might too difficult to get all these exactly right, if you can change the main |url=
and square URLs and verify the new URL works, that will go a long way! --
Green
C
15:51, 8 June 2024 (UTC)
Note: these links are georestricted to India IPs and can't be archived, or archived very well. I found an article in The Hindu that talks about it. The article quotes one our most technically knowledgeable editors, User:Nemo_bis, who said: "Nemo has studied 'geofencing' of Indian government websites in the past, and in 2020 created a proxy service for users located abroad to access Indian government websites". This might be our solution. I hope Nemo has a working proxy for the Election Commission website? -- Green C 17:58, 5 July 2024 (UTC)
Operator: DreamRimmer ( talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)
Time filed: 14:01, Monday, May 27, 2024 ( UTC)
Automatic, Supervised, or Manual: automatic
Programming language(s): Python
Source code available:
Function overview: Fix the URLs for the ECI election database.
Links to relevant discussions (where appropriate):
Edit period(s): Every six months
Estimated number of pages affected: 5050
Exclusion compliant (Yes/No): No
Already has a bot flag (Yes/No): No
Function details: The
Election Commission of India has moved all of its data (except for very recent elections) to a subdomain. As a result, URLs in more than 5000 pages are now invalid and are giving a 404 error. This bot will replace URLs like
https://eci.gov.in/files/file/11699-maharashtra-legislative-assembly-election-2019
with the new URL
https://old.eci.gov.in/files/file/11699-maharashtra-legislative-assembly-election-2019
. Simply replace
https://eci.gov.in/
with
https://old.eci.gov.in/
.
Why every six months? Primefac ( talk) 18:28, 27 May 2024 (UTC)
https://eci.gov.in/
since it's a "recent election". At what point will that URL get archived to the
https://old.eci.gov.in/
prefix? If it is archived after the subsequent election, why not just update the URL with the new election information along with the data it represents?
Primefac (
talk)
15:00, 6 June 2024 (UTC)
(?<!/)(?<!\\?url=)https?://eci[.]gov[.]in/[^\\s\\]|}{<]*[^\\s\\]|}{<]*
|url-status=
, {{
webarchive}}
and {{
dead link}}
. Also links that are square and bare. It might too difficult to get all these exactly right, if you can change the main |url=
and square URLs and verify the new URL works, that will go a long way! --
Green
C
15:51, 8 June 2024 (UTC)
Note: these links are georestricted to India IPs and can't be archived, or archived very well. I found an article in The Hindu that talks about it. The article quotes one our most technically knowledgeable editors, User:Nemo_bis, who said: "Nemo has studied 'geofencing' of Indian government websites in the past, and in 2020 created a proxy service for users located abroad to access Indian government websites". This might be our solution. I hope Nemo has a working proxy for the Election Commission website? -- Green C 17:58, 5 July 2024 (UTC)