17 November 2007

Jason's blah blah blah

  • Configuration used for installing PHP5 on Mac OS X:
./configure --prefix=/usr/local/php5 --with-xml --with-zlib --with-gd --with-png-dir=/usr/local --with-jpeg-dir=/usr/local --with-apxs=/usr/sbin/apxs --with-freetype-dir=/usr/local/

This doco on how to install gd library on OS X is particularly helpful:
http://www.libgd.org/DOC_INSTALL_OSX

  • To validate a regular expression in the nutch conf files:
cat FILE-WITH-URLS | nutch net/nutch/net/RegexURLFilteror by calling "nutch net/nutch/net/RegexURLFilter" and entering the URL by hand.Everyline line beginning with a "+" ist accepted - a line with a "-" is accepted. For example: $ echo "http://www.nutch.org" | nutch net/nutch/net/RegexURLFilter run with heapsize 256 -Xmx256m 050202 173520 loadingfile:/home/nutch/nutch-0.7/conf/nutch-default.xml 050202 173520 loading file:/home/nutch/nutch-0.7/conf/nutch-site.xml 050202 173520 found resource regex-urlfilter

No comments: