I often import old edits that are not in the English Wikipedia database from older versions of Wikipedia to restore missing edits. In almost all cases, this only affects the page history, and the page content is not affected, but see principle #10 below. This page describes how I handle certain situations when importing this old history.
There are two methods of importing, either via another wiki (transwiki import) or via an XML file on a user's computer (upload import). The former is available to all administrators on the English Wikipedia per bug 20280, while the latter is generally restricted to stewards because it can very easily be used to falsify page histories, among other things. I have been granted the right to use upload imports per this discussion; this right allows me to import edits from the 2002 and 2003 database dumps, among other things (see below).
The Nostalgia Wikipedia is a copy of the Wikipedia database from 20 December 2001, when Wikipedia used UseModWiki rather than MediaWiki. In the following text, "nost.wp" means the Nostalgia Wikipedia and "en.wp" means the English Wikipedia.
Overlapping edits can occur in the following three circumstances:
Note: In the second and third instances, if the duplicate edits are merged, it is impossible for an admin to separate them because they have the same timestamp.
To deal with these overlapping edits, when I only had the transwiki import right, I would normally follow this procedure, where "pagename" was the name of the page (importing by upload allows me to change the name of the page manually):
I used a similar procedure when the page titles on en.wp and nost.wp differed. If there were no overlapping edits in the two page histories, it was often easier to move the English Wikipedia page to the nost.wp title before importing the edits. When importing talk pages, I imported them to the main namespace, since very few encyclopedic articles in the English Wikipedia end with the title "/Talk".
A major source of edits to import is the automatically generated list of pages with the most revisions on the Nostalgia Wikipedia; as of 16 December 2011, I have analysed all of the 5,000 pages on that list for edits worth importing; the list consists of 38,765 edits out of the 93,105 old edits in the Nostalgia Wikipedia database (including those by Conversion script), only 41.6% of the total number of edits. Another way to find edits that should be imported is to check the contributions of editors who have edited between 20 December 2001, the last old edit in the Nostalgia Wikipedia, and 25 January 2002, when UseModWiki was replaced with the Phase II software. The latter method can yield results because under the KeptPages system, when an edit was made when Wikipedia used UseModWiki, older edits would be deleted to make room for the newer edit.
I have downloaded the English Wikipedia dumps from 2002 and 2003 that are available from the above-linked site and installed them on a local copy of MediaWiki 1.21.1, the latest release version of the software available when I started working on this project on 9 June 2013. I used two XAMPP instances under Windows, one containing the latest version for the updated MediaWiki installation, and the other containing XAMPP 1.4.16 (Apache 2.0.54/MySQL 4.1.14/PHP 5.0.5/4.4.0) to run MediaWiki 1.5 and earlier. I used MediaWiki 1.2.6for the March 2002 database dump and MediaWiki 1.3 for the others. I used the dumps to retrieve old edits that had gone missing from Wikipedia. I wrote some tips about upgrading from very old database schemas at the relevant section of the MediaWiki manual about upgrading.
I often import old edits that are not in the English Wikipedia database from older versions of Wikipedia to restore missing edits. In almost all cases, this only affects the page history, and the page content is not affected, but see principle #10 below. This page describes how I handle certain situations when importing this old history.
There are two methods of importing, either via another wiki (transwiki import) or via an XML file on a user's computer (upload import). The former is available to all administrators on the English Wikipedia per bug 20280, while the latter is generally restricted to stewards because it can very easily be used to falsify page histories, among other things. I have been granted the right to use upload imports per this discussion; this right allows me to import edits from the 2002 and 2003 database dumps, among other things (see below).
The Nostalgia Wikipedia is a copy of the Wikipedia database from 20 December 2001, when Wikipedia used UseModWiki rather than MediaWiki. In the following text, "nost.wp" means the Nostalgia Wikipedia and "en.wp" means the English Wikipedia.
Overlapping edits can occur in the following three circumstances:
Note: In the second and third instances, if the duplicate edits are merged, it is impossible for an admin to separate them because they have the same timestamp.
To deal with these overlapping edits, when I only had the transwiki import right, I would normally follow this procedure, where "pagename" was the name of the page (importing by upload allows me to change the name of the page manually):
I used a similar procedure when the page titles on en.wp and nost.wp differed. If there were no overlapping edits in the two page histories, it was often easier to move the English Wikipedia page to the nost.wp title before importing the edits. When importing talk pages, I imported them to the main namespace, since very few encyclopedic articles in the English Wikipedia end with the title "/Talk".
A major source of edits to import is the automatically generated list of pages with the most revisions on the Nostalgia Wikipedia; as of 16 December 2011, I have analysed all of the 5,000 pages on that list for edits worth importing; the list consists of 38,765 edits out of the 93,105 old edits in the Nostalgia Wikipedia database (including those by Conversion script), only 41.6% of the total number of edits. Another way to find edits that should be imported is to check the contributions of editors who have edited between 20 December 2001, the last old edit in the Nostalgia Wikipedia, and 25 January 2002, when UseModWiki was replaced with the Phase II software. The latter method can yield results because under the KeptPages system, when an edit was made when Wikipedia used UseModWiki, older edits would be deleted to make room for the newer edit.
I have downloaded the English Wikipedia dumps from 2002 and 2003 that are available from the above-linked site and installed them on a local copy of MediaWiki 1.21.1, the latest release version of the software available when I started working on this project on 9 June 2013. I used two XAMPP instances under Windows, one containing the latest version for the updated MediaWiki installation, and the other containing XAMPP 1.4.16 (Apache 2.0.54/MySQL 4.1.14/PHP 5.0.5/4.4.0) to run MediaWiki 1.5 and earlier. I used MediaWiki 1.2.6for the March 2002 database dump and MediaWiki 1.3 for the others. I used the dumps to retrieve old edits that had gone missing from Wikipedia. I wrote some tips about upgrading from very old database schemas at the relevant section of the MediaWiki manual about upgrading.