dillo is a very cool little browser written using gtk+. it makes me wonder how difficult it really is to write a standards-compliant html rendering engine. (unfortunately, dillo doesn't support css. but it is amazingly fast and the html rendering seems pretty solid.)
this article on unicode 3.0 line breaking would probably be useful if you were going to write an html rendering engine.