Day 2 of Dream Lab @pricelab.bsky.social complete! Today, we covered OCR and HTR and how to process unstructured data (texts) into structured data through Tf-Idf. After that, we learned 3 methods to understand similarities between texts using Tf-Idf, LDA (topic modeling), and XML tagging!