Monday, 24 May 2010
#4n6 event, and CLIR report on digital forensics as applied to cultural materials
We've been adapting digital forensics tools and techniques within BEAM (Bodleian Electronic Archives and Manuscripts) for a few years now, and this meeting was a useful event to talk about how we do this, and some of the issues (process, technical and ethical) it raises.
It was a good meeting, and I very much enjoyed hearing from other digital archivists and *real* forensics practitioners (they have rather different objectives to ours, but their tools are still useful!). Another highlight for me was Stephen Ennis' framing thoughts, presented in the first session. Ennis grounded the discussion, with three key - and very practical - points that should be important to any archivist:
1) What is the hard-cash value of born-digital archives?
Ennis contends that monetary value has been a preservation agent for literary manuscripts. If disks and digital data are of no value, their survival rate is likely to be poor. He cited the example of John Updike's archive (at Harvard), which contained software disks but no related data disks. It's worrying that dealers don't/won't appraise born-digital material, but this will surely change. Another issue is that we need dealers to be able to appraise digital archives without altering what they are appraising. Will they have to adopt digital forensic techniques too?
2) Are the steps that seem justified for celebrity authors justified for others?
This question is very important and equally applicable to 'papers', of course. In the digital domain, the obvious 'celebrity' example is the work Emory's MARBL have done to make one of Salman Rushdie's hard disks accessible to scholars through an emulator and a searchable database. We certainly won't be processing every digital archive submission at this level, and I suspect MARBL won't either. Where it's justified, I think it's a very good thing.
3) What is the researcher's object of study? Are we promoting new and different forms of enquiry?
This question, perhaps, gets closer to exploring our simultaneous excitement and concern when we consider the potential of combining scholarly enquiry and digital forensic tools in relation to born-digital archives. There's a good deal we need to learn about scholars' requirements and I'm looking forward to the day that we have more case studies so we can move this discussion beyond conjecture!
If you're interested in finding out more in advance of the report, you'll probably find that some of the slides will be published in due course at the event's website. You can also take a look at some photos and tweets.
I may extend this post with some of the more interesting tidbits if I find a moment.
Thursday, 29 April 2010
Passwords you never created and never knew
Last weekend the main problem was an unknown password for an email account. In a scenario which can't be that uncommon, an email account had been established by a friend and the password for it remembered by the email client but no human being. Luckily we were able to salvage the password using one of these tools and restore access to the email via a new client on the new computer.
It seems all to possible that we will encounter this scenario with a depositor at some stage, so it's handy to have an easy fix for it. On the other hand, it's a little worrying how easy a fix it is...
Wednesday, 28 April 2010
So long floppy, hello retro cool!
To balance this bad news, I also wanted to flag up the Vintage Computer Festival up the road at Bletchly Park. Lets hope they raise a glass to deprecated storage devices and their tales!
Friday, 23 April 2010
Do you know the way to Dundee?
I'll be covering the workflow we're adopting here at futureArch and hopefully demo part of it, as well as discussing our digital asset management system, the foundation for our archive and how those ideas may scale to smaller systems.
Hope to see you there and if not I'm sure we'll be reporting back right here so stay tuned!
(Also a bit (um, I mean big) thank you to Jennifer Johnstone for helping me find my way to Dundee! :-))
Wednesday, 14 April 2010
Using a D-Link DGE-530T Gigabit Network adapter in ESX 4.
1. Acquire the skge.o driver which supports the Marvell Yukon 88E0001 chipset
The discussion Using a Marvell LAN card with ESXi 4 contains a link to a tarball sky2-and-skge-for-esxi4-0.02.tar.gz containing both the sky2 and skge driver
2. login to ESX 4.0 as root and copy the skge.o driver to /usr/lib/vmware/vmknod
2.1 download sky2-and-skge-for-esxi4-0.02.tar.gz
2.2 tar xvzf ../sky2-and-skge-for-esxi4-0.02.tar.gz
2.3 cp vmtest/usr/lib/vmware/vmkmod/skge.o /usr/lib/vmware/vmkmod
3. run 'lspci' and identify the NICs location (the xx:xx.x number in front of the description)
03:00.0 Ethernet controller: D-Link System Inc Unknown device 4b01 (rev 11)
4. run 'lspci -n' and determine the vendor and device IDs (for D-Link it should be 1186:xxxx)
lspci -n
00:00.0 0600: 8086:29b0 (rev 02)
(snipped)
03:00.0 0200: 1186:4b01 (rev 11)
03:02.0 0200: 8086:1026 (rev 04)
5. create the vmware pciid file '/etc/vmware/pciid/skge.xml' here's a listing of the mine
cat /etc/vmware/pciid/skge.xml
<?xml version='1.0' encoding='iso-8859-1'?>
<pcitable>
<vendor id="1186">
<short>D-Link System Inc</short>
<name>D-Link System Inc</name>
<device id="4b01">
<vmware label="nic">
<driver>skge</driver>
</vmware>
<name>DGE-530T Ethernet NIC</name>
<table file="pcitable" module="ignore" />
<table file="pcitable.Linux" module="skge">
<desc>D-Link System|DGE-530T Ethernet NIC</desc>
</table>
</device>
</vendor>
</pcitable>
6. create file /etc/vmware/init/manifests/vmware-skge.mf which contains a single line as shown
cat /etc/vmware/init/manifests/vmware-skge.mf
copy /usr/lib/vmware/vmkmod/skge.o
7. reboot the server and checking the /var/log/vmware/esxcfg-boot.log should confirm:
That the esxcfg boot process has loaded the skge.xml metafile , constructed the new vmware-devices.map file and included the skge.o driver in the initramfs image.
8. running 'lspci' after adding a second DGE-530T card now shows
03:00.0 Ethernet controller: D-Link System Inc DGE-530T Ethernet NIC (rev 11)
03:02.0 Ethernet controller: D-Link System Inc DGE-530T Ethernet NIC (rev 11)
Of course the normal caveats and disclaimers apply as in not supported by VMware etc.
Monday, 12 April 2010
Want to be our new graduate trainee?
Wednesday, 31 March 2010
Microsoft Works library
Tuesday, 30 March 2010
Disk imaging for older floppies
Monday, 22 March 2010
Media Recognition Guide - Flash Media
Flash memory is the alternative to byte-programmable memory, which is used by hard, floppy and Zip disks. It is much less expensive, meaning large capacity devices are economically viable and has faster access times and much better shock resistance and durability. Altogether this makes it particularly suitable for use as a portable storage device. Flash memory does have a finite number of write-erase cycles, but manufacturers can guarantee at least 100,000 cycles, which is a much larger number than with byte-programmable memory.
| Type: | Flash memory data storage device with USB interface |
| Introduced: | 2000, though the company that invented the device is a legal issue. |
| Active: | Yes |
| Cessation: | - |
| Capacity: | First drive had a capacity of 8 MB but the latest versions can have capacities as large as 256 GB |
| Compatibility: | Widely supported by modern operating systems including Windows, Mac OS, Linux and Unix systems. |
| Users: | Broad. Has replaced 3.5” floppy disks as the preferred device for individuals and small organisations for personal data storage, transfer and backup. |
| File Systems: | FAT, NTFS, HFS+, ext2, ext3 |
| Common manufacturers: | |
Recognition
High Level Formatting
| Type: | Flash memory data storage device with firewire interface |
| Introduced: | 2004 |
| Active: | Yes |
| Cessation: | - |
| Capacity: | Either 4, 8 or 16 GB |
| Compatibility: | Compatible with any computer with a firewire connector |
| Users: | Limited. Never achieved the same popularity as USB flash drives. They come in smaller sizes and have slower memory |
| File Systems: | FAT, NTFS, HFS+, ext2, ext3 |
| Common manufacturers: | Kanguru |
Recognition
Media Recognition Guide - Iomega Zip Disks
| Type: | Removable disk storage |
| Introduced: | 1994 |
| Active: | Yes, but used by minority |
| Cessation: | - |
| Capacity: | Either 100, 250 or 750MB |
| Compatibility: | Zip drive needs to be of a matching or higher capacity than the Zip disk. Supports Windows OS, IBM OS/2, Mac OS 7.6 to 9.2, MAC OS X and some Linux OS. |
| Users: | Small businesses and personal users to backup files |
| File Systems: | NTFS, FAT, ext2, HFS/+, ADFS |
| Common manufacturers: | Iomega |
Recognition
| Drive | Interface | |||
| PATA | SCSI | USB | FireWire | |
| Zip 100 | Yes | Yes | Yes | No |
| Zip 250 | Yes | Yes | Yes | Yes |
| Zip 750 | Yes | No | Yes | Yes |
High Level Formatting
This is done in the same way as formatting floppy disks: with the disk inserted in the drive open ‘My Computer’, right click over the Zip disk drive icon and select ‘Format’. There are two options; either ‘Short Format’ or ‘Long Format’ and you can change the file system used with either option by selecting either Mac or PC. Click ‘Start’ and the disk will be formatted.
Insert the disk into the zip drive. Open the IomegaWare folder then open the ‘Tools’ folder and double click on the Tools icon, which will open up the window. From here click on the appropriate icon for the disk you wish to format. There are two options, either ‘Short Erase’ or ‘Long Erase’. Long Erase should be used for disks containing errors. Select ‘Erase’ to begin formatting the disk. All content will be erased and the disk will be formatted to a file system appropriate for Mac OS (HFS/+).
Formatting with Linux
Monday, 15 March 2010
Media Recognition - Hard Disk Drive part 3
| Type: | Magnetic storage media |
| Introduced: | 2004 |
| Active: | Yes [2010] |
| Cessation: | - |
| Capacity: | Varies, but majority do not exceed 300GB |
| Compatibility: | Compatible with all operating systems, though drives with a capacity of 137GB or more are only compatible with Windows 98 onwards and Mac OS 10.2 onwards. Not found on 8” or 5.25” drives. |
| Users: | Servers and high-end computers |
| File Systems: | FAT, NTFS, HFS/+, ext |
| Common manufacturers: | Western Digital, Seagate, Toshiba, |
Recognition
SAS was born out of SCSI developments and entered the market in 2004. One feature making it preferable to SCSI is its higher transfer rate. Its fast speeds and high level performance make it suitable for high-end personal computer hard drives and servers. The first version was slower than the latest version of SCSI having a data transfer rate of 300 MB/s. However, in 2009 this rate increased to 600 MB/s and it is expected to reach 1200 MB/s by 2012. SAS uses point-to-point topology to connect the interface and can support multiple devices (up to 200), making it popular with servers. For the same reasons SAS hard disk drives are relatively expensive therefore they are not as common on standard personal computers as the more general purpose SATA interface.
External Hard Disk Drives
Early Apple Macintosh computers used external SCSI hard disk drives, despite internal hard disk drives being the standard for other PCs. More recently external hard drives are primarily used as additional storage devices.
FireWire (IEEE 1394): First released in 1995 this was originally developed as a replacement for the SCSI connector and many computers since 2003 have a built-in FireWire port, particularly Apple machines. FireWire has a higher transfer rate than USB and the latest version, FireWire 3200 has a rate of 393 MB/s, which also exceeds that of eSATA, although this rate varies with Windows OS. However it is more expensive than USB, hence it has never superseded USB’s popularity. It is compatible with Windows OS from Windows XP onwards, though issues with
There have been several versions of FireWire each using different connectors. Here is a brief table setting this out:
| Version | Cable Used | Date Introduced |
| FireWire 400 (IEEE 1394) | 6-circuit | 1995 |
| FireWire 400 (IEEE 1394a) | 4-circuit | 2000 |
| FireWire 800 (IEEE 1394b) | 9-circuit | 2002 |
| FireWire S3200 | 9-circuit | 2007 |
It is most common to find 6-cicuit connectors on desktop computers and 4-cicuit connectors on laptops. However, in 2000 amendments were made and the 4-cicuit connector was standardised resulting in more of these connectors being found on desktop computers.
USB (Universal Serial Bus): USB was introduced in 1996 and has since become the dominant means to connect computer peripherals to the host controller. The original USB 1.0 has a transfer rate of 12 Mbits/s, which was increased to 60 MB/s (480 Mbits/s) by USB 2.0. This was released in 2000 and standardised in 2001. Like FireWire, USB connectors carry power as well as data, therefore do not require additional power cables.
eSATA: This is SATA’s own external connector introduced in 2004 with a transfer rate of 131 MB/s. Despite having a much larger data transfer rate, few computers have eSATA ports, favouring instead USB and FireWire.