Blogging Roller

Dave Johnson on social software, open source and Java

Above: a random selection of photos from my Flickr photo-stream.

Saturday Jan 11, 2003

Java HTML parsers.

The LinkbackExtractor that I posted yesterday uses the Swing HTML parser, which is built into Java, but there are other Java-based HTML parsers available. Erik Hatcher suggested the JTidy HTML parser and there is also the HTMLParser project on SourceForge. Know of any others?

Speaking of spam.

These guys are referer spammers: phplabs.com, openproxies.com, and lahostnet.com.