🚨🚨🚨Take part in the AI Coach: Fitness challenge and the Low Power Computer Vision Challenge @ CVPR 2026
🎯Both challenges use the Qualcomm Exercise Video Dataset (QEVD) dataset.
👉Quick start guides and sample solutions: apratimbh.github.io/whatandwhen/
@cvprconference.bsky.social
🚨🚨🚨 We are now accepting submissions!
🚨Submission are now open!
📣📣📣Our team at Qualcomm AI Research is hiring Research Interns for Summer 2026 in Toronto to work on multi-modal LLMs and embodied AI.
👉Apply here:
1) Embodied AI:
qualcomm.wd12.myworkdayjobs.com/External/job...
2) Multi-modal LLMs:
🚨Submit by 1st May @cvprconference.bsky.social: extended abstracts on streaming vision-language models, real-time activity understanding, grounding, ego-centric video understanding, language and robot learning. Contributions are encouraged to include a demo!
👉Details: varworkshop.github.io/calls/
📅 Join us next week for the 2nd VAR Workshop at
CVPR 2026: June 3rd, 2026 from 8:30am to 1pm.
🎯 We are hosting an exciting line-line of speakers: Katerina Fragkiadaki, Wenhu Chen, Michael S. Ryoo, Ziwei Liu, Yao Qin, @vicenteor.bsky.social
👉Schedule/Papers: varworkshop.github.io/schedule/
Call for Participation: We're excited to announce a challenge focused on developing AI assistants that can guide users through workout sessions with intelligent feedback!
🚨The winning teams will receive a prize along with a contributed talk. 🚨
Website: varworkshop.github.io/challenges/
Call for Participation @cvprconference.bsky.social: Multi-Modal LLMs - prepare to engage in a dynamic, face-to-face conversation with a real human user!
Details: varworkshop.github.io/challenges/
🚨🚨🚨The winning teams will receive a prize and a contributed talk.
P.S. GPT-4o does not do too well.
Call for Papers and Demos @cvprconference.bsky.social: on topics such as streaming vision-language models, real-time activity understanding, grounding, ego-centric video understanding, language and robot learning. Contributions are encouraged to include a demo!
Link: varworkshop.github.io/calls/
Open-source fueled the LLM revolution, but Physical AI hasn't fully benefited from this flywheel yet. Today, we're launching kesai.eu, our mission to democratize robotics research! First milestone: training a frontier-level self-driving policy using significantly less data than typically required.
Apratim Bhattacharyya
Apratim Bhattacharyya
Apratim Bhattacharyya
Apratim Bhattacharyya
Apratim Bhattacharyya
Apratim Bhattacharyya
Vision-based Assistants in the Real-World
Vision-based Assistants in the Real-World
Vision-based Assistants in the Real-World
KE:SAI is a Franco-German non-profit open science lab for scalable autonomous intelligence.
Company: Qualcomm Canada ULC Job Area: Interns Group, Interns Group > Interim Engineering Intern - SW Qualcomm Overview: Qualcomm is a company of inventors that unlocked 5G ushering in an age of ra...
Call for Papers and Demos @cvprconference.bsky.social: on topics such as streaming vision-language models, real-time activity understanding, grounding, ego-centric video understanding, language and robot learning. Contributions are encouraged to include a demo!
Link: varworkshop.github.io/calls/
Call for Participation @cvprconference.bsky.social: Multi-Modal LLMs - prepare to engage in a dynamic, face-to-face conversation with a real human user!
Details: varworkshop.github.io/challenges/
🚨🚨🚨The winning teams will receive a prize and a contributed talk.
P.S. GPT-4o does not do too well.