Let’s collaborate on democratizing insights from tabular data in Amsterdam! ✨
PhD directions: 1) fundamental techniques for tabular foundation models, 2) reliable mechanisms for AI-powered tabular data analysis.
Sharing w/ friends appreciated! ⬇️
Today's funny tokenization realization: The GPT4 tokenizer has 1 single token for the entire lower- and uppercase alphabet.
Logitech forgot to renew its server certificate, so now my mouse scrolls in the opposite direction, and I'm unable to move between windows. What a time to be alive.
The winner is '.translatesAutoresizingMaskIntoConstraints', with 42 characters in a single token. OpenAI really values iOS development I guess 😅
IRLab Amsterdam made it to Bluesky! Go give @irlab-amsterdam.bsky.social a follow for all things RecSys, IR, RAG, Conv AI, etc!
A read that resonates. I keep coming back to the thought of how curation by algorithms fits into this idea of the web. If someone has some good reads on this, let me know! The group at serendipityengine.be is one great example imo.