Assistant professor of social psychology at Purdue. Obsessed with my cat Kartoffel. Recruiting a grad student for Fall 2026!
Thekla Morgenroth
8/n Of course SCORES also has it's limitations. E.g. it doesn't work great for long responses. We include this guide to help researchers decide whether or not they want to use SCORES:
9/9 I love working with open responses because they can provide insights that established scales often don't. But not everyone has an army of research assistants or programming experience. We hope that SCORES will make the use of open-response data in research more accessible.
Our lab has a new postdoc position that is ideal for someone interested in a career in data science or statistical consulting and an interest in gender diversity / trans health. apply.interfolio.com/182278
3/n SCORES clusters responses via word embeddings (which reflect similarity in meaning), similar to the process of reading through the responses, creating coding categories, and having human coders assign each response to a category.
7/n It also shows different quality indices. It weighs them by default to select the ideal cluster number, but you can also use this view to make your own decision.
1/n I'm really excited to share this (open access) paper in which we introduce SCORES (Semantic Clustering of Open Responses via Embedding Similarity) - a user-friendly tool to analyze (short) open-response data. journals.sagepub.com/doi/full/10.... With the magical @bpaassen.bsky.social.
4/n SCORES has an intuitive, user-friendly interface (no programming knowledge required) and a tutorial mode that can guide researchers through the decision processes. You simply upload your data and select which columns you want to include in your analysis:
6/n SCORES automatically names the clusters based on the most central response and shows you all responses that fall into the cluster. You can of course also re-name the cluster if you don't like the default name.
5/n Researchers then have a lot of flexibility, e.g. you can exclude certain words, set min and max number of clusters, use outlier detection (and be more or less exclusionary with outliers), and merge similar clusters (and decide how similar they have to be to be merged).
2/n Have you ever read a cool paper that uses open response data & analyzes it using word embeddings (e.g. psycnet.apa.org/buy/2022-585... by @gandalfnicolas.bsky.social, a paper I LOVE) but were too intimidated by the methodology to do something similar? If so, SCORES might be for you!