It literally takes longer to write out the output file at this point than it takes to do all of the processing *including* reading the input file, and yet for some reason I'm not satisfied and want it to be even faster. Maybe I should just write it in C/C++ but that would change some other plans