Linux

Automatization of Sequence Handling (Staden's pregap4 and gap4)

Already a while ago I wrote a script, that tries to automatize most of the work involved in getting sequences from our ABI sequencer into a gap4 database. The script is far from perfect and looks like this:

!/bin/sh rm *.seq *.log Log\ file.txt for i in `ls *.ab1`; do echo "Renaming $i" mv $i `echo $i | sed "s/\(.\{27\}\).*/\1\.ab1/"` done ls *.ab1 *.txt > tracefile.list pregap4 -nowin -config /home/jeltsch/bin/pregap4_gap4.conf -fofn tracefile.list gap4 test.0.aux

Installation of Duplicate File Finder

I was making order among my 50000+ files in my home directory and realized that I had multiple copies of the same file in different directories. Therefore I was looking for some convenient way to find those duplicates and to delete them.
There are several applications that do something like this. However, none is perfect. E.g. mp3 files of the same song, but with a different id3 tag will be recognized by most programs as being different, whereas they might be essentially the same apart from the metadata in the id3 tag.

Pages