If you’re a regular ProfHacker reader, you already know that we are obsessed with backup. You probably even know that we begin every post about backup by telling you that we are obsessed with backup.
It’s true: we are obsessed with backup. And it’s true: we keep telling you that we’re obsessed. You might think we’re being a nuisance by relentlessly revisiting the topic of backing up your data, but we like to think of ourselves as a contagion, spreading the Got-To-Backup-Now Virus (G2BNV). In the end, we hope every one of our readers is infected.
Most recently, Brian took a comprehensive look at backing up your social network, primarily Twitter and Facebook. ProfHacker will have a future post looking at Facebook’s new built-in export/backup feature, but today, I want to look more closely at backing up your Twitter account. More specifically, I want to highlight several alternatives to the backup services Brian mentions. The solutions here differ significantly from other backup services because they archive your Twitter activity on your own server.
Why Archive Twitter?
But why would you want to archive your Twitter stream? Maybe you don’t. It probably depends on how often you tweet and what you usually tweet about. I certainly didn’t think it was important for my first several hundred tweets or so. But then I began using Twitter to try out ideas that could possibly work their way into my teaching, my research, or even into ProfHacker. It turns out that Twitter is often where the first draft of my serious thinking occurs. And I didn’t want to those fragments of my thinking to disappear—which is effectively what happens once you have more than 3,200 tweets.
Twitter’s public search record only goes back 3,200 tweets for each account. There is no service that can recover tweets beyond your more recent 3,200 tweets. All of your tweets are permanently stored, bound as they are for the Library of Congress, but unless you know the exact ID number of a tweet earlier than 3,200 ago, you cannot readily access it.
So, if you’re approaching 3,000 tweets (and it doesn’t take nearly as long as you might think), you’ll want to begin archiving your Twitter activity, so the earliest ones (which are often quite quaint) don’t disappear.
Why Archive on Your Own Server?
So if you want to archive, why not stick with third-party services like BackupMyTweets or TwapperKeeper? These services are free and reliable, and there’s no reason why you shouldn’t be satisfied with them. But what I like about backing up material onto my own server is that it’s mine and I have complete control over it. After nearly 10,000 tweets, I’ve invested some serious intellectual labor into these bullet-bursts of writing, and those ideas are mine, and I want to keep them safe, on my own domain.
Of course, you’ll need your own server for any of the following three Twitter archiving solutions. Luckily, ProfHacker can get you started with your own domain.
Solution 1: TwapperKeeper on Your Own Server
TwapperKeeper is one of the better known Twitter archiving services, allowing you to archive your own Twitter stream, hashtags, and even random keywords. So when TwapperKeeper announced an open version of TwapperKeeper that you could run on your own server—called Your TwapperKeeper—it was exciting news. However, I was disappointed to find that I could not get Your TwapperKeeper running on my server. I’m reasonably adept at installing software on a LAMP (Linux-Apache-MySql-PHP) server, but after a great deal of frustration I finally gave up on TwapperKeeper. If you’ve had success running your own version on TwapperKeeper on a server, let us know in the comments!
Solution 2: Archiving Twitter through WordPress
If you’ve already got WordPress running on a server, you can turn a WordPress blog into a platform for archiving Twitter. Douglas Bowman (who just happens to be Twitter’s Creative Director) has written up easy-to-follow directions that make use of the Twitter Tools WordPress plugin. In no time at all I got this solution working on my own server, where my most recent 4,000 or so tweets are easily accessible and even better, searchable. The tweets appear in a blog all of their own, so they don’t clutter up my regular blog, something readers only interested in my longer writing might appreciate.
Solution 3: Archiving Twitter through Tweet Nest
I like the WordPress solution, but it has one major flaw: the archive does not link back to the original tweet on Twitter. The tweets are duplicated on my server, but there’s no easy way to get back to the source.
Enter Tweet Nest, a stand-alone open source PHP application coded primarily by Andy Graulund. Tweet Nest requires a bit more backend tinkering on your server and in MySQL, but Graulund’s instructions are clear, and again, in no time at all, I had a workable archive on my server.
Unlike the above WordPress solution, Tweet Nest does indeed link back to the original tweet on Twitter, as seen in this post of mine, archived by Tweet Nest:

Clicking the time-date stamp leads to the original post on Twitter. Not only that, but by clicking the “in reply” link, you end up at the tweet that began the conversation. Tweet Nest also provides a few data visualizations, such as a monthly calendar, showing the total number of daily tweets (and public replies).

What about You?
What about you? Are you likely to archive your Twitter account? Are you likely to try doing so on your own server? What other solutions have you found? (Hint: I have one more myself, which I’ll be sharing on ProfHacker very soon!)
[I'm Just Looking for Something image courtesy of Flickr user tpholland / Creative Commons Licensed]




6 Responses to Archiving Twitter on Your Own Server
kosboot - October 26, 2010 at 5:05 pm
Perhaps one of the best features of TwapperKeeper is that it can do a rudimentary analysis of your tweets via Summarizr . Take a look at Summarizr’s analysis of the Internet Librarian 2010 conference (currently going on) in Monterey, California:
http://summarizr.labs.eduserv.org.uk/?hashtag=intlib10
It gives you a basic idea of who’s talking, what’s being talked about, and (my favorite device): a URL extractor.
With the mass of web content increases, it’s becoming more necessarily to analyze and select content. I suspect that services like Summarizr will becoming increasingly popular. So unless it can work on your own PC/server, I prefer to keep it online with TwapperKeeper.
mark_sample - October 26, 2010 at 6:20 pm
@kosboot – Thanks for the Summarizr tip. It’s a neat tool, and it makes good use of TwapperKeeper data, though of course, on a very superficial “summary” level.
There’s no reason why one can’t archive Twitter streams on both a personal server and on TwapperKeeper, so that services can like Summarizr can be used. That is, in fact, what I do.
eszter - October 27, 2010 at 9:18 am
Do the above solutions allow you to download the tweets from any account or just your own? That is, do they have a password requirement? (I’m asking for purposes of analyzing tweets from certain accounts.)
I have found the instructions explained here helpful:
http://www.quickonlinetips.com/archives/2009/08/download-all-twitter-tweets/
(note also the comments that address some of its limitations)
kosboot - October 27, 2010 at 9:47 am
eszter: TwapperKeeper archives (and enables downloads) from a keyword or a hashtag, meaning that many people’s accounts will be included in the result. You should just go to the site and see the many kinds of archives that are stored there.
cogdog - October 27, 2010 at 11:08 am
I’m a fan of Backupify, which does several services backed up in the cloud; It does incremental twitter but also saves them all as a PDF http://www.backupify.com
Martin Hawksey has a WordPress plugin that can back up tweets by user name or tag and append them to a WP page http://www.rsc-ne-scotland.org.uk/mashe/twitterpad-plugin/
Maybe the sourceforge tool for archving to PDF
http://sourceforge.net/projects/twitter-to-pdf/
More here http://www.readwriteweb.com/archives/10_ways_to_archive_your_tweets.php (SweetCron is dead)
mhawksey - February 3, 2011 at 8:34 am
Whilst it isn’t your own server but a place where you arguably have the same level of control is Google Spreadsheets. In this post, I describe how you can use a Spreadsheet to automatically harvest tweets.
@cogdog Thanks for the plug ;)
Martin