Been reading up on Ext4. I have so many duplicate files that when I de-dupe them I bump into hard link limits (EMLINK) in the file system at around 30,000 files.
Author Archives: Jay Jay
Pcdedupe
I’m working on a new ProgClub project called pcdedupe. It’s a file system de-duplicator and it’s a C++ system based on rdfind. I haven’t created the project page on the wiki yet, but the source code is available.
Basically I’m going to take a new angle on the rdfind software and tailor it to suit my particular environment (I have ten million files with massive duplication and rdfind isn’t optimised for that kind of scale).
I’ve set up a new file server
I’ve been having some fun over the last day or two looking over all my old files. I’ve got files that go back as far as 1999 in my archives. I’ve found my old blog database and associated files, so I hope to get that back up again soon, and I found some old code that I’ve been looking for (I don’t want to have to write it again!).
So my new file server has 6TB of storage as 3 x 2TB partitions. I can fit all my data in 1.3TB of space, so I’m planning to have one file share, and then a backup of that onto another partition. I have 10,174,633 files in my archive folder, and many more in my media, download and home folders. I might publish some more stats once du -s has finished processing. :)
I’m running Ubuntu 10.04 LTS Server as my file server. I tried to setup the Desktop version but it wouldn’t play nice with my nVidia graphics card.
Windows SDK 7.1
I followed these instructions from Mozilla to download the latest version of the Windows SDK which is installing as we speak. I didn’t install the .NET Framework Version 4, I hope that doesn’t create a problem for me.
Boost C++ library
I’ve downloaded the Boost C++ library and my little adventure with it will begin as soon as the files have copied.
dirent.h
I’m doing some Linux C++ programming, and I’m doing the dev work on Windows, so I figured I’d have a go at compiling in Visual Studio (I’m running VS2008). One problem I had was that there was no dirent.h header file, but I found one.
Making Subversion/SVN recognize CVS Id and Revision tags
Today I found this article Making Subversion/SVN recognize CVS Id and Revision tags which describes how to add support for Id and Revision tags in Subversion.
Google webmaster tools
I decided to have a look at the Google Webmaster Tools. I’ve setup accounts for jsphp.co and www.progclub.org. So far I’m not very impressed at all. Maybe it’s because the accounts are just new, but there is basically no data in any of my sites, so it’s not very useful at all. I guess it’s just wait and see if data ends up being loaded or not.
PHP krsort
Today I used the PHP function krsort for the first time. It sorts the array with keys in reverse order, which is exactly what I needed to get my version listbox options in order from latest to earliest.
Doctrine Bidirectional Mant-To-Many Associations
I found out how to do a Many-To-Many, Bidirectional mapping with Doctrine in PHP.