shahine.com/omar/

homepage | Send mail to the author(s) contact

yet another Microsoft blogger

# Saturday, December 30, 2006

Amazon S3 programs

What are people using to backup to Amazon S3? I've tried a number of tools and they all fall short for one reason or another.

The basic problem with Amazon S3 is this:

  1. They do not support rename/moves (you must do a delete/copy)
  2. They write the current upload time as the modified time. This breaks programs that rely on date/time to detect changes.

Jungle Disk

Great app, it allows you to map a drive letter to your Amazon S3 account. It has built in backup, however, it does not remove things in the destination that no longer exist in the source. I personally want my backup to be a mirror of my source because I tend to organize/move files around a lot.

Jungle Disk + SyncBackSE

This combo proved to be horrible because SyncBackSE relies on file date/time modifications to detect changes. The end result is that lots of unchanged files get sync' ed again. To make matters worse, when SyncBackSE copies a file to a remote drive, it will upload a temp file and then rename it. Well since Amazon S3 does not support rename, this results in two file uploads by Jungle Disk (and wastes time and bandwidth).

SyncBackSE has an option to use file hashes to detect changes, but this is far slower and requires that each file be downloaded to get the hash. No good for remote backup where you are paying for the bandwidth.

Jungle Disk + SyncToy

If you configure SyncToy to "echo" files from your local machine to Amazon using Jungle Drive you get what appears to be perfection. However, I have noticed that renames and deletes on the source are not mirrored on the destination. I suspect this has to do with either a bug in Jungle Disk or some issue with amazon's lack of support for rename.

Furthermore, SyncToy does not delete directories on the destination using echo.

S3 Backup

I'm currently trying this out, but it lacks any backup scheduling support. I also do not know if it will delete files on the destination that are no longer on the source. I Will find out soon enough I guess.

update: looks like this scenerio is possible, but the option to delete files on the destination that are no longer on the source is greyed out in the current beta. hmm.

Anyone else have any tips here?

 

Saturday, December 30, 2006 2:50:45 PM (Pacific Standard Time, UTC-08:00)
I'm trying to decide - either go with an S3 based backup solution, or just back it up to my service provider - dreamhost.com .. Dreamhost offers me 200gb of storage, and I can just rsync from an Infrant NAS directly to the server at night.. How are you liking your S3 storage so far? Recommend it?
Saturday, December 30, 2006 4:50:57 PM (Pacific Standard Time, UTC-08:00)
I'm using Mozy right now. I don't think it uses S3 but it's priced right. Unlimited storage for $50. It takes a while to back up but I'm doing it incrementally right now. Backing up the important stuff and then adding less important stuff. It's a pretty lightweight app.
Tony
Saturday, December 30, 2006 5:21:23 PM (Pacific Standard Time, UTC-08:00)
I've been using S3, and tried syncback and synctoy. Neither worked well. I am thinking about using something else, such as a simple rsync script.
Leo
Saturday, December 30, 2006 7:13:41 PM (Pacific Standard Time, UTC-08:00)
Look at this:

http://www.storageelements.com/

My company is working on becoming a partner for their Digitiliti service. Very cool stuff. Remote backups to their redundant remote secure sites, software is agentless, can go on multiple PCs; supports versioning/snapshots, file deltas, and can backup every second, minute, hour, day, week, etc... You pick. You can filter out files you don't want to backup; if you have multiple copies of a particular file from multiple machines, it only copies it once. If the file you have already backed up has changed, it only sends the bits of the file that has changed, not the whole file. Supports bare-metal restore to any machine regardless of the hardware in the target PC.

I don't know how they handle the sync stuff you are looking for, but they are just about the coolest and most detail-driven backup solution I have seen out there. I think costs are around $4-$8 per GB stored each month. While this definitely not the cheapest solution out there, this cost is AFTER compression, and they can get a LOT backed up very efficiently. Much more efficiently than S3 or other services...
Glenn Berkshier
Wednesday, January 03, 2007 7:41:07 AM (Pacific Standard Time, UTC-08:00)
i briefly tried Elephant Drive. i don't know how it handles all the scenarios, but it seemed ok. the backup definitely needed more options.

of course the downside of all of these is the upload speed of the networks. what we really need is the upload speed restrictions lifted
jbranc
Sunday, January 07, 2007 9:09:44 PM (Pacific Standard Time, UTC-08:00)
For SyncBackSE switch off the "Make safe copies" option, and enable Fast Backup, then it doesn't matter about the dates & times and it won't use temp filenames. The first run will copy everything, but second and subsequent runs won't.
Mick
Monday, January 08, 2007 3:24:37 PM (Pacific Standard Time, UTC-08:00)
another thing you may want to consider is using something like ViceVersaPro to do the syncing to your jungle disk. it has a lot of cool features in terms of syncing (mirroring) and archiving (including keeping N number of copies on delete and overwrite in a separate archive location for X days). also has bandwidth limiting tools and a pretty powerful scheduler. i can't think of any syncing features it can't do. it can be found at www.tgrmn.com
jbranc
Tuesday, January 09, 2007 10:00:36 AM (Pacific Standard Time, UTC-08:00)
You may want to give MirrorFolder a try in combination with JungleDisk. I have been using it to mirror multiple computers in the home with my Buffalo Terastation over the network. MirrorFolder gives you the option to do real-time sync (done by intercepting I/O requests) or scheduled sync of a source folder to a target that is a physical or UNC path. Both options give you the choice of mirroring deletes/renames/moves etc.

Nik
Comments are closed.