//
sign in
Profile
by @danabra.mov
Profile
by @dansshadow.bsky.social
Profile
by @jimpick.com
AviHandle
by @danabra.mov
AviHandle
by @dansshadow.bsky.social
AviHandle
by @katherine.computer
EventsList
by @katherine.computer
ProfileHeader
by @dansshadow.bsky.social
ProfileHeader
by @danabra.mov
ProfileMedia
by @danabra.mov
ProfilePlays
by @danabra.mov
ProfilePosts
by @danabra.mov
ProfilePosts
by @dansshadow.bsky.social
ProfileReplies
by @danabra.mov
Record
by @atsui.org
Skircle
by @danabra.mov
StreamPlacePlaylist
by @katherine.computer
+ new component
Profile
Loading...









Loading...
šŸ›ļøMajor AI companies are increasingly embedding sponsored content into chatbot conversations. Across two preregistered experiments (N=2,012), we test how effectively AI can steer consumers toward sponsored products in a realistic shopping scenario. šŸ“https://arxiv.org/abs/2604.04263
2mo
Good work from @hayoungjung.bsky.social and @manoelhortaribeiro.bsky.social Scientific AI agents are actively being deployed to synthesize clinical conclusions, but their factual accuracy remains remarkably low. #MedSky šŸ”— Direct link: arxiv.org/pdf/2606.11337
šŸ“Excited to share our new preprint, ā€œAI Assistance for Discretionary Work: Increasing Feedback Provision in Higher Educationā€: arxiv.org/abs/2606.03095 A thread 🧵 1/8
2d
10d
Deepfake pornography isn’t going away just because we are passing laws and taking down a couple of big websites. Our new pre-print, led by @aedcv.bsky.social suggests that the sharing of this material continued to prosper even after platform and policy shocks. arxiv.org/abs/2602.02754
Francesco Salvi
arxiv.org
Scott McGrath
4mo
First paper of my PhD with my amazing advisors! There’s been a ton of hype and media coverage on OpenEvidence as an ā€œAI co-pilot for cliniciansā€ā€¦ and our long-horizon benchmark puts them to the test!! Our results suggest they are far from reliable for downstream use.
Romina Mahinpei
One thing we also didn’t expect while building this benchmark: AI agents kept ā€œcheatingā€ Even when told not to, they searched the web for ground-truth answers. So we built a clean-room harness to filter answer-leaking results. We’re now exploring this more deeply in follow-up workšŸ‘€
Broadly interested in computational social science, AI safety & evaluation, NLP for social good & applications (in public health, science...)! Happy to chat or grab coffee at the conference! Feel free to DM me :)
First paper of my PhD with my amazing advisors! There’s been a ton of hype and media coverage on OpenEvidence as an ā€œAI co-pilot for cliniciansā€ā€¦ and our long-horizon benchmark puts them to the test!! Our results suggest they are far from reliable for downstream use.
I am at #EMNLP2025šŸ‡ØšŸ‡³ to present our main paper *MythTriage: Scalable Detection of Opioid Use Disorder Myths on a Video-Sharing Platform*! Come by to discuss details! šŸ¦ Location: Hall C ā²ļøTime: 11AM-12:30PM šŸ”— Paper: aclanthology.org/2025.emnlp-m... šŸ“ Repo: github.com/hayoungjungg...
2d
2d
7mo
New preprint! We introduce a new benchmark, SciConBench, with 9.11k scientific questions derived from Cochrane Systematic Reviews. We find evidence that frontier AI agents **cannot** synthesize scientific conclusions well. A thread 🧵 w/ @hayoungjung.bsky.social & others!
2d
Whoa, excellent study just dropped in Science! "Reranking partisan animosity in algorithmic social media feeds alters affective polarization" www.science.org/doi/10.1126/... Led by @tiziano.bsky.social and @msaveski.bsky.social
7mo
2d
Manoel Horta Ribeiro
6mo