I'm not sure my implementation is normal, but I get around 200k for a 100 page crawl, and around 5-10 seconds to search, most of which I believe is in reading the file back into Java data structures. But I manually saved and restored as opposed to using Java's built-in serialization process, which I believe stores a TON of redundant data, at least in this case. |