//
sign in
Profile
by @danabra.mov
Profile
by @dansshadow.bsky.social
Profile
by @jimpick.com
AviHandle
by @danabra.mov
AviHandle
by @dansshadow.bsky.social
AviHandle
by @katherine.computer
EventsList
by @katherine.computer
ProfileHeader
by @dansshadow.bsky.social
ProfileHeader
by @danabra.mov
ProfileMedia
by @danabra.mov
ProfilePlays
by @danabra.mov
ProfilePosts
by @danabra.mov
ProfilePosts
by @dansshadow.bsky.social
ProfileReplies
by @danabra.mov
Record
by @atsui.org
Skircle
by @danabra.mov
StreamPlacePlaylist
by @katherine.computer
+ new component
Profile
Loading...
๐Ÿ“š Researcher โ€ข ๐Ÿ’ป Developer โ€ข ๐Ÿ‡ช๐Ÿ‡บ European PhD student for health-related information retrieval at @uni-jena.de ร— @webis.de
Jan Heinrich Merker









Loading...
Mar 3, 2025
๐Ÿšจย New Pre-Print! ๐Ÿšจย Reviewer 2 has once again asked for DLโ€™19, what can you say in rebuttal? ย To help, we have re-annotated DLโ€™19. Work done with @maik_froebe.bsky.social, @hscells.bsky.social, @fschlatt1.bsky.social, Guglielmo Faggioli, Saber Zerhoudi, @macavaney.bsky.social, Eugene Yang ๐Ÿงต
I've already collected my first two stickers for the @ecir2026.eu Collab-a-thon. If you're in Delft today, don't muss out the first collab-a-thon session at 4pm in LAB.115 ๐Ÿค #collab-a-thon #collaboration #research #ecir
We just released "German Commons", the largest openly-licensed German text dataset for LLM training: 154B tokens with clear usage rights for research and commercial use. huggingface.co/datasets/coral-nlp/german-commons
Decades from now, the Covid-19 pandemic will be visible in the historical data of nearly anything measurable today. Hereโ€™s an incomplete collection of charts that capture that break โ€” across the economy, health care, education, work, family life and more.
2mo
Honored to win the ICTIR Best Paper Honorable Mention Award for "Axioms for Retrieval-Augmented Generation"! Our new axioms are integrated with ir_axioms: github.com/webis-de/ir_... Nice to see axiomatic IR gaining momentum.
Happy to share that our paper "The Viability of Crowdsourcing for RAG Evaluation" received the Best Paper Honourable Mention at #SIGIR2025! Very grateful to the community for recognizing our work on improving RAG evaluation. ย ๐Ÿ“„ webis.de/publications...
7mo
Mar 10, 2025
11mo
11mo
We presented two papers at ICTIR 2025 today: - Axioms for Retrieval-Augmented Generation webis.de/publications... - Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins webis.de/publications...
Weโ€™re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
coral-nlp/german-commons ยท Datasets at Hugging Face
Jan Heinrich Merker
30 Charts That Show How Everything Changed in March 2020
It can be easy to forget, or look away from, the pain and disruption of the pandemic. The numbers will be there to remind us.
www.nytimes.com
11mo
The New York Times
Lets replace search with "AI" then! Totally logical if you ask me. Even more worth it when you know they're exponentially overtaking the airline industry in their carbon footprint. Study: www.cjr.org/tow_center/w...
Webis Group
Webis Group
What a team of keynote speakers. I must confess seeing that Steve Robertson will be there is a thrill. One of the legends of information retrieval reflecting on the field. #sigir2025 sigir2025.dei.unipd.it/keynote-spea...
Webis Group
Mar 12, 2025
Webis Group
Dec 24, 2024
The SIGIR 2025 keynotes are held by esteemed speakers: Robertson S., Gurevych I. and Frieder O., who will cover topics that range from AI in medical search and ecommendation to BM25 and probabilistic ...
SIGIR 2025, Padua, 13-18 July | Keynotes
sigir2025.dei.unipd.it
New preprint of WSDM demo by @maik_froebe @matthias and Ferdinand Schlatt Lightning IR: Straightforward Fine-tuning and Inference of Transformer-based Language Models for Information Retrieval https://arxiv.org/abs/2411.04677 https://webis.de/lightning-ir/
Dec 19, 2024
Timnit Gebru
Lightning IR: Straightforward Fine-tuning and Inference of Transformer-based Language Models for Information Retrieval
A wide range of transformer-based language models have been proposed for information retrieval tasks. However, including transformer-based models in retrieval pipelines is often complex and requires substantial engineering effort. In this paper, we introduce Lightning IR, an easy-to-use PyTorch Lightning-based framework for applying transformer-based language models in retrieval scenarios. Lightning IR provides a modular and extensible architecture that supports all stages of a retrieval pipeline: from fine-tuning and indexing to searching and re-ranking. Designed to be scalable and reproducible, Lightning IR is available as open-source: https://github.com/webis-de/lightning-ir.
arxiv.org
Arjen P. de Vries Timmers ๐Ÿ•Š๏ธ