default search action
Aniruddha Kembhavi
Person information
- affiliation: AI2, Allen Institute for Artificial Intelligence, Seattle, US
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j6]Adyasha Maharana, Amita Kamath, Christopher Clark, Mohit Bansal, Aniruddha Kembhavi:
Exposing and Addressing Cross-Task Inconsistency in Unified Vision-Language Models. Trans. Mach. Learn. Res. 2024 (2024) - [c61]Kalyani Marathe, Mahtab Bigverdi, Nishat Khan, Tuhin Kundu, Patrick Howe, Sharan Ranjit S, Anand Bhattad, Aniruddha Kembhavi, Linda G. Shapiro, Ranjay Krishna:
MIMIC: Masked Image Modeling with Image Correspondences. CVPR Workshops 2024: 718-727 - [c60]Chenhao Zheng, Jieyu Zhang, Aniruddha Kembhavi, Ranjay Krishna:
Iterated Learning Improves Compositionality in Large Vision-Language Models. CVPR 2024: 13785-13795 - [c59]Minyoung Hwang, Luca Weihs, Chanwoo Park, Kimin Lee, Aniruddha Kembhavi, Kiana Ehsani:
Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences. CVPR 2024: 16216-16226 - [c58]Kiana Ehsani, Tanmay Gupta, Rose Hendrix, Jordi Salvador, Luca Weihs, Kuo-Hao Zeng, Kunal Pratap Singh, Yejin Kim, Winson Han, Alvaro Herrasti, Ranjay Krishna, Dustin Schwenk, Eli VanderBilt, Aniruddha Kembhavi:
SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World. CVPR 2024: 16238-16250 - [c57]Ram Ramrakhya, Aniruddha Kembhavi, Dhruv Batra, Zsolt Kira, Kuo-Hao Zeng, Luca Weihs:
Seeing the Unseen: Visual Common Sense for Semantic Placement. CVPR 2024: 16273-16283 - [c56]Yue Yang, Fan-Yun Sun, Luca Weihs, Eli VanderBilt, Alvaro Herrasti, Winson Han, Jiajun Wu, Nick Haber, Ranjay Krishna, Lingjie Liu, Chris Callison-Burch, Mark Yatskar, Aniruddha Kembhavi, Christopher Clark:
Holodeck: Language Guided Generation of 3D Embodied AI Environments. CVPR 2024: 16277-16287 - [c55]Jiasen Lu, Christopher Clark, Sangho Lee, Zichen Zhang, Savya Khosla, Ryan Marten, Derek Hoiem, Aniruddha Kembhavi:
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action. CVPR 2024: 26429-26445 - [c54]Ainaz Eftekhar, Kuo-Hao Zeng, Jiafei Duan, Ali Farhadi, Aniruddha Kembhavi, Ranjay Krishna:
Selective Visual Representations Improve Convergence and Generalization for Embodied AI. ICLR 2024 - [c53]Abby O'Neill, Abdul Rehman, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alexander Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie, Anthony Brohan, Antonin Raffin, Archit Sharma, Arefeh Yavary, Arhan Jain, Ashwin Balakrishna, Ayzaan Wahid, Ben Burgess-Limerick, Beomjoon Kim, Bernhard Schölkopf, Blake Wulfe, Brian Ichter, Cewu Lu, Charles Xu, Charlotte Le, Chelsea Finn, Chen Wang, Chenfeng Xu, Cheng Chi, Chenguang Huang, Christine Chan, Christopher Agia, Chuer Pan, Chuyuan Fu, Coline Devin, Danfei Xu, Daniel Morton, Danny Driess, Daphne Chen, Deepak Pathak, Dhruv Shah, Dieter Büchler, Dinesh Jayaraman, Dmitry Kalashnikov, Dorsa Sadigh, Edward Johns, Ethan Paul Foster, Fangchen Liu, Federico Ceola, Fei Xia, Feiyu Zhao, Freek Stulp, Gaoyue Zhou, Gaurav S. Sukhatme, Gautam Salhotra, Ge Yan, Gilbert Feng, Giulio Schiavi, Glen Berseth, Gregory Kahn, Guanzhi Wang, Hao Su, Haoshu Fang, Haochen Shi, Henghui Bao, Heni Ben Amor, Henrik I. Christensen, Hiroki Furuta, Homer Walke, Hongjie Fang, Huy Ha, Igor Mordatch, Ilija Radosavovic, Isabel Leal, Jacky Liang, Jad Abou-Chakra, Jaehyung Kim, Jaimyn Drake, Jan Peters, Jan Schneider, Jasmine Hsu, Jeannette Bohg, Jeffrey Bingham, Jeffrey Wu, Jensen Gao, Jiaheng Hu, Jiajun Wu, Jialin Wu, Jiankai Sun, Jianlan Luo, Jiayuan Gu, Jie Tan, Jihoon Oh, Jimmy Wu, Jingpei Lu, Jingyun Yang, Jitendra Malik, João Silvério, Joey Hejna, Jonathan Booher, Jonathan Tompson, Jonathan Yang, Jordi Salvador, Joseph J. Lim, Junhyek Han, Kaiyuan Wang, Kanishka Rao, Karl Pertsch, Karol Hausman, Keegan Go, Keerthana Gopalakrishnan, Ken Goldberg, Kendra Byrne, Kenneth Oslund, Kento Kawaharazuka, Kevin Black, Kevin Lin, Kevin Zhang, Kiana Ehsani, Kiran Lekkala, Kirsty Ellis, Krishan Rana, Krishnan Srinivasan, Kuan Fang, Kunal Pratap Singh, Kuo-Hao Zeng, Kyle Hatch, Kyle Hsu, Laurent Itti, Lawrence Yunliang Chen, Lerrel Pinto, Li Fei-Fei, Liam Tan, Linxi Jim Fan, Lionel Ott, Lisa Lee, Luca Weihs, Magnum Chen, Marion Lepert, Marius Memmel, Masayoshi Tomizuka, Masha Itkina, Mateo Guaman Castro, Max Spero, Maximilian Du, Michael Ahn, Michael C. Yip, Mingtong Zhang, Mingyu Ding, Minho Heo, Mohan Kumar Srirama, Mohit Sharma, Moo Jin Kim, Naoaki Kanazawa, Nicklas Hansen, Nicolas Heess, Nikhil J. Joshi, Niko Sünderhauf, Ning Liu, Norman Di Palo, Nur Muhammad (Mahi) Shafiullah, Oier Mees, Oliver Kroemer, Osbert Bastani, Pannag R. Sanketi, Patrick Tree Miller, Patrick Yin, Paul Wohlhart, Peng Xu, Peter David Fagan, Peter Mitrano, Pierre Sermanet, Pieter Abbeel, Priya Sundaresan, Qiuyu Chen, Quan Vuong, Rafael Rafailov, Ran Tian, Ria Doshi, Roberto Martín-Martín, Rohan Baijal, Rosario Scalise, Rose Hendrix, Roy Lin, Runjia Qian, Ruohan Zhang, Russell Mendonca, Rutav Shah, Ryan Hoque, Ryan Julian, Samuel Bustamante, Sean Kirmani, Sergey Levine, Shan Lin, Sherry Moore, Shikhar Bahl, Shivin Dass, Shubham D. Sonawani, Shuran Song, Sichun Xu, Siddhant Haldar, Siddharth Karamcheti, Simeon Adebola, Simon Guist, Soroush Nasiriany, Stefan Schaal, Stefan Welker, Stephen Tian, Subramanian Ramamoorthy, Sudeep Dasari, Suneel Belkhale, Sungjae Park, Suraj Nair, Suvir Mirchandani, Takayuki Osa, Tanmay Gupta, Tatsuya Harada, Tatsuya Matsushima, Ted Xiao, Thomas Kollar, Tianhe Yu, Tianli Ding, Todor Davchev, Tony Z. Zhao, Travis Armstrong, Trevor Darrell, Trinity Chung, Vidhi Jain, Vincent Vanhoucke, Wei Zhan, Wenxuan Zhou, Wolfram Burgard, Xi Chen, Xiaolong Wang, Xinghao Zhu, Xinyang Geng, Xiyuan Liu, Liangwei Xu, Xuanlin Li, Yao Lu, Yecheng Jason Ma, Yejin Kim, Yevgen Chebotar, Yifan Zhou, Yifeng Zhu, Yilin Wu, Ying Xu, Yixuan Wang, Yonatan Bisk, Yoonyoung Cho, Youngwoon Lee, Yuchen Cui, Yue Cao, Yueh-Hua Wu, Yujin Tang, Yuke Zhu, Yunchu Zhang, Yunfan Jiang, Yunshuang Li, Yunzhu Li, Yusuke Iwasawa, Yutaka Matsuo, Zehan Ma, Zhuo Xu, Zichen Jeff Cui, Zichen Zhang, Zipeng Lin:
Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration. ICRA 2024: 6892-6903 - [i69]Ram Ramrakhya, Aniruddha Kembhavi, Dhruv Batra, Zsolt Kira, Kuo-Hao Zeng, Luca Weihs:
Seeing the Unseen: Visual Common Sense for Semantic Placement. CoRR abs/2401.07770 (2024) - [i68]Chenhao Zheng, Jieyu Zhang, Aniruddha Kembhavi, Ranjay Krishna:
Iterated Learning Improves Compositionality in Large Vision-Language Models. CoRR abs/2404.02145 (2024) - [i67]Duong H. Le, Tuan Pham, Aniruddha Kembhavi, Stephan Mandt, Wei-Chiu Ma, Jiasen Lu:
Preserving Identity with Variational Score for General-purpose 3D Editing. CoRR abs/2406.08953 (2024) - [i66]Jieyu Zhang, Weikai Huang, Zixian Ma, Oscar Michel, Dong He, Tanmay Gupta, Wei-Chiu Ma, Ali Farhadi, Aniruddha Kembhavi, Ranjay Krishna:
Task Me Anything. CoRR abs/2406.11775 (2024) - [i65]Tanmay Gupta, Luca Weihs, Aniruddha Kembhavi:
CodeNav: Beyond tool-use to using real-world codebases with LLM agents. CoRR abs/2406.12276 (2024) - [i64]Kuo-Hao Zeng, Zichen Zhang, Kiana Ehsani, Rose Hendrix, Jordi Salvador, Alvaro Herrasti, Ross B. Girshick, Aniruddha Kembhavi, Luca Weihs:
PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators. CoRR abs/2406.20083 (2024) - [i63]Jiaheng Hu, Rose Hendrix, Ali Farhadi, Aniruddha Kembhavi, Roberto Martin Martin, Peter Stone, Kuo-Hao Zeng, Kiana Ehsani:
FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale Reinforcement Learning Fine-Tuning. CoRR abs/2409.16578 (2024) - [i62]Matt Deitke, Christopher Clark, Sangho Lee, Rohun Tripathi, Yue Yang, Jae Sung Park, Mohammadreza Salehi, Niklas Muennighoff, Kyle Lo, Luca Soldaini, Jiasen Lu, Taira Anderson, Erin Bransom, Kiana Ehsani, Huong Ngo, Yen-Sung Chen, Ajay Patel, Mark Yatskar, Chris Callison-Burch, Andrew Head, Rose Hendrix, Favyen Bastani, Eli VanderBilt, Nathan Lambert, Yvonne Chou, Arnavi Chheda, Jenna Sparks, Sam Skjonsberg, Michael Schmitz, Aaron Sarnat, Byron Bischoff, Pete Walsh, Chris Newell, Piper Wolters, Tanmay Gupta, Kuo-Hao Zeng, Jon Borchardt, Dirk Groeneveld, Jen Dumas, Crystal Nam, Sophie Lebrecht, Caitlin Wittlif, Carissa Schoenick, Oscar Michel, Ranjay Krishna, Luca Weihs, Noah A. Smith, Hannaneh Hajishirzi, Ross B. Girshick, Ali Farhadi, Aniruddha Kembhavi:
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models. CoRR abs/2409.17146 (2024) - 2023
- [j5]Matthew Wallingford, Aditya Kusupati, Keivan Alizadeh-Vahid, Aaron Walsman, Aniruddha Kembhavi, Ali Farhadi:
FLUID: A Unified Evaluation Framework for Flexible Sequential Data. Trans. Mach. Learn. Res. 2023 (2023) - [c52]Matt Deitke, Rose Hendrix, Ali Farhadi, Kiana Ehsani, Aniruddha Kembhavi:
Phone2Proc: Bringing Robust Robots into Our Chaotic World. CVPR 2023: 9665-9675 - [c51]Matt Deitke, Dustin Schwenk, Jordi Salvador, Luca Weihs, Oscar Michel, Eli VanderBilt, Ludwig Schmidt, Kiana Ehsani, Aniruddha Kembhavi, Ali Farhadi:
Objaverse: A Universe of Annotated 3D Objects. CVPR 2023: 13142-13153 - [c50]Hao Zhu, Raghav Kapoor, So Yeon Min, Winson Han, Jiatai Li, Kaiwen Geng, Graham Neubig, Yonatan Bisk, Aniruddha Kembhavi, Luca Weihs:
EXCALIBUR: Encouraging and Evaluating Embodied Exploration. CVPR 2023: 14931-14942 - [c49]Tanmay Gupta, Aniruddha Kembhavi:
Visual Programming: Compositional visual reasoning without training. CVPR 2023: 14953-14962 - [c48]Sophia Gu, Christopher Clark, Aniruddha Kembhavi:
I can't believe there's no images! : Learning Visual Tasks Using Only Language Supervision. ICCV 2023: 2672-2683 - [c47]Kunal Pratap Singh, Jordi Salvador, Luca Weihs, Aniruddha Kembhavi:
Scene Graph Contrastive Learning for Embodied Navigation. ICCV 2023: 10850-10860 - [c46]Favyen Bastani, Piper Wolters, Ritwik Gupta, Joe Ferdinando, Aniruddha Kembhavi:
SatlasPretrain: A Large-Scale Dataset for Remote Sensing Image Understanding. ICCV 2023: 16726-16736 - [c45]Jiasen Lu, Christopher Clark, Rowan Zellers, Roozbeh Mottaghi, Aniruddha Kembhavi:
UNIFIED-IO: A Unified Model for Vision, Language, and Multi-modal Tasks. ICLR 2023 - [c44]Matthew Wallingford, Aditya Kusupati, Alex Fang, Vivek Ramanujan, Aniruddha Kembhavi, Roozbeh Mottaghi, Ali Farhadi:
Neural Radiance Field Codebooks. ICLR 2023 - [c43]Matt Deitke, Ruoshi Liu, Matthew Wallingford, Huong Ngo, Oscar Michel, Aditya Kusupati, Alan Fan, Christian Laforte, Vikram Voleti, Samir Yitzhak Gadre, Eli VanderBilt, Aniruddha Kembhavi, Carl Vondrick, Georgia Gkioxari, Kiana Ehsani, Ludwig Schmidt, Ali Farhadi:
Objaverse-XL: A Universe of 10M+ 3D Objects. NeurIPS 2023 - [c42]Cheng-Yu Hsieh, Jieyu Zhang, Zixian Ma, Aniruddha Kembhavi, Ranjay Krishna:
SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality. NeurIPS 2023 - [c41]Oscar Michel, Anand Bhattad, Eli VanderBilt, Ranjay Krishna, Aniruddha Kembhavi, Tanmay Gupta:
OBJECT 3DIT: Language-guided 3D-aware Image Editing. NeurIPS 2023 - [c40]Matthew Wallingford, Vivek Ramanujan, Alex Fang, Aditya Kusupati, Roozbeh Mottaghi, Aniruddha Kembhavi, Ludwig Schmidt, Ali Farhadi:
Neural Priming for Sample-Efficient Adaptation. NeurIPS 2023 - [i61]Matthew Wallingford, Aditya Kusupati, Alex Fang, Vivek Ramanujan, Aniruddha Kembhavi, Roozbeh Mottaghi, Ali Farhadi:
Neural Radiance Field Codebooks. CoRR abs/2301.04101 (2023) - [i60]Adyasha Maharana, Amita Kamath, Christopher Clark, Mohit Bansal, Aniruddha Kembhavi:
Exposing and Addressing Cross-Task Inconsistency in Unified Vision-Language Models. CoRR abs/2303.16133 (2023) - [i59]Matthew Wallingford, Vivek Ramanujan, Alex Fang, Aditya Kusupati, Roozbeh Mottaghi, Aniruddha Kembhavi, Ludwig Schmidt, Ali Farhadi:
Neural Priming for Sample-Efficient Adaptation. CoRR abs/2306.10191 (2023) - [i58]Cheng-Yu Hsieh, Jieyu Zhang, Zixian Ma, Aniruddha Kembhavi, Ranjay Krishna:
SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality. CoRR abs/2306.14610 (2023) - [i57]Kalyani Marathe, Mahtab Bigverdi, Nishat Khan, Tuhin Kundu, Aniruddha Kembhavi, Linda G. Shapiro, Ranjay Krishna:
MIMIC: Masked Image Modeling with Image Correspondences. CoRR abs/2306.15128 (2023) - [i56]Matt Deitke, Ruoshi Liu, Matthew Wallingford, Huong Ngo, Oscar Michel, Aditya Kusupati, Alan Fan, Christian Laforte, Vikram Voleti, Samir Yitzhak Gadre, Eli VanderBilt, Aniruddha Kembhavi, Carl Vondrick, Georgia Gkioxari, Kiana Ehsani, Ludwig Schmidt, Ali Farhadi:
Objaverse-XL: A Universe of 10M+ 3D Objects. CoRR abs/2307.05663 (2023) - [i55]Oscar Michel, Anand Bhattad, Eli VanderBilt, Ranjay Krishna, Aniruddha Kembhavi, Tanmay Gupta:
OBJECT 3DIT: Language-guided 3D-aware Image Editing. CoRR abs/2307.11073 (2023) - [i54]Ainaz Eftekhar, Kuo-Hao Zeng, Jiafei Duan, Ali Farhadi, Aniruddha Kembhavi, Ranjay Krishna:
Selective Visual Representations Improve Convergence and Generalization for Embodied AI. CoRR abs/2311.04193 (2023) - [i53]Piper Wolters, Favyen Bastani, Aniruddha Kembhavi:
Zooming Out on Zooming In: Advancing Super-Resolution for Remote Sensing. CoRR abs/2311.18082 (2023) - [i52]Kiana Ehsani, Tanmay Gupta, Rose Hendrix, Jordi Salvador, Luca Weihs, Kuo-Hao Zeng, Kunal Pratap Singh, Yejin Kim, Winson Han, Alvaro Herrasti, Ranjay Krishna, Dustin Schwenk, Eli VanderBilt, Aniruddha Kembhavi:
Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World. CoRR abs/2312.02976 (2023) - [i51]Ruihan Yang, Yejin Kim, Aniruddha Kembhavi, Xiaolong Wang, Kiana Ehsani:
Harmonic Mobile Manipulation. CoRR abs/2312.06639 (2023) - [i50]Yue Yang, Fan-Yun Sun, Luca Weihs, Eli VanderBilt, Alvaro Herrasti, Winson Han, Jiajun Wu, Nick Haber, Ranjay Krishna, Lingjie Liu, Chris Callison-Burch, Mark Yatskar, Aniruddha Kembhavi, Christopher Clark:
Holodeck: Language Guided Generation of 3D Embodied AI Environments. CoRR abs/2312.09067 (2023) - [i49]Minyoung Hwang, Luca Weihs, Chanwoo Park, Kimin Lee, Aniruddha Kembhavi, Kiana Ehsani:
Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences. CoRR abs/2312.09337 (2023) - [i48]Jiasen Lu, Christopher Clark, Sangho Lee, Zichen Zhang, Savya Khosla, Ryan Marten, Derek Hoiem, Aniruddha Kembhavi:
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action. CoRR abs/2312.17172 (2023) - 2022
- [j4]Luca Weihs, Amanda Rose Yuile, Renée Baillargeon, Cynthia Fisher, Gary Marcus, Roozbeh Mottaghi, Aniruddha Kembhavi:
Benchmarking Progress to Infant-Level Physical Reasoning in AI. Trans. Mach. Learn. Res. 2022 (2022) - [c39]Kshitij Dwivedi, Gemma Roig, Aniruddha Kembhavi, Roozbeh Mottaghi:
What do navigation agents learn about their environment? CVPR 2022: 10266-10275 - [c38]Apoorv Khandelwal, Luca Weihs, Roozbeh Mottaghi, Aniruddha Kembhavi:
Simple but Effective: CLIP Embeddings for Embodied AI. CVPR 2022: 14809-14818 - [c37]Tanmay Gupta, Amita Kamath, Aniruddha Kembhavi, Derek Hoiem:
Towards General Purpose Vision Systems: An End-to-End Task-Agnostic Vision-Language Architecture. CVPR 2022: 16378-16388 - [c36]Kiana Ehsani, Ali Farhadi, Aniruddha Kembhavi, Roozbeh Mottaghi:
Object Manipulation via Visual Target Localization. ECCV (39) 2022: 321-337 - [c35]Amita Kamath, Christopher Clark, Tanmay Gupta, Eric Kolve, Derek Hoiem, Aniruddha Kembhavi:
Webly Supervised Concept Expansion for General Purpose Vision Models. ECCV (36) 2022: 662-681 - [c34]Matt Deitke, Eli VanderBilt, Alvaro Herrasti, Luca Weihs, Kiana Ehsani, Jordi Salvador, Winson Han, Eric Kolve, Aniruddha Kembhavi, Roozbeh Mottaghi:
🏘️ ProcTHOR: Large-Scale Embodied AI Using Procedural Generation. NeurIPS 2022 - [c33]Kunal Pratap Singh, Luca Weihs, Alvaro Herrasti, Jonghyun Choi, Aniruddha Kembhavi, Roozbeh Mottaghi:
Ask4Help: Learning to Leverage an Expert for Embodied Tasks. NeurIPS 2022 - [i47]Amita Kamath, Christopher Clark, Tanmay Gupta, Eric Kolve, Derek Hoiem, Aniruddha Kembhavi:
Webly Supervised Concept Expansion for General Purpose Vision Models. CoRR abs/2202.02317 (2022) - [i46]Jiasen Lu, Jordi Salvador, Roozbeh Mottaghi, Aniruddha Kembhavi:
ASC me to Do Anything: Multi-task Training for Embodied AI. CoRR abs/2202.06987 (2022) - [i45]Kiana Ehsani, Ali Farhadi, Aniruddha Kembhavi, Roozbeh Mottaghi:
Object Manipulation via Visual Target Localization. CoRR abs/2203.08141 (2022) - [i44]Tanmay Gupta, Ryan Marten, Aniruddha Kembhavi, Derek Hoiem:
GRIT: General Robust Image Task Benchmark. CoRR abs/2204.13653 (2022) - [i43]Matt Deitke, Eli VanderBilt, Alvaro Herrasti, Luca Weihs, Jordi Salvador, Kiana Ehsani, Winson Han, Eric Kolve, Ali Farhadi, Aniruddha Kembhavi, Roozbeh Mottaghi:
ProcTHOR: Large-Scale Embodied AI Using Procedural Generation. CoRR abs/2206.06994 (2022) - [i42]Kshitij Dwivedi, Gemma Roig, Aniruddha Kembhavi, Roozbeh Mottaghi:
What do navigation agents learn about their environment? CoRR abs/2206.08500 (2022) - [i41]Jiasen Lu, Christopher Clark, Rowan Zellers, Roozbeh Mottaghi, Aniruddha Kembhavi:
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks. CoRR abs/2206.08916 (2022) - [i40]Matt Deitke, Dhruv Batra, Yonatan Bisk, Tommaso Campari, Angel X. Chang, Devendra Singh Chaplot, Changan Chen, Claudia Pérez-D'Arpino, Kiana Ehsani, Ali Farhadi, Li Fei-Fei, Anthony G. Francis, Chuang Gan, Kristen Grauman, David Hall, Winson Han, Unnat Jain, Aniruddha Kembhavi, Jacob Krantz, Stefan Lee, Chengshu Li, Sagnik Majumder, Oleksandr Maksymets, Roberto Martín-Martín, Roozbeh Mottaghi, Sonia Raychaudhuri, Mike Roberts, Silvio Savarese, Manolis Savva, Mohit Shridhar, Niko Sünderhauf, Andrew Szot, Ben Talbot, Joshua B. Tenenbaum, Jesse Thomason, Alexander Toshev, Joanne Truong, Luca Weihs, Jiajun Wu:
Retrospectives on the Embodied AI Workshop. CoRR abs/2210.06849 (2022) - [i39]Sophia Gu, Christopher Clark, Aniruddha Kembhavi:
I Can't Believe There's No Images! Learning Visual Tasks Using only Language Data. CoRR abs/2211.09778 (2022) - [i38]Kunal Pratap Singh, Luca Weihs, Alvaro Herrasti, Jonghyun Choi, Aniruddha Kembhavi, Roozbeh Mottaghi:
Ask4Help: Learning to Leverage an Expert for Embodied Tasks. CoRR abs/2211.09960 (2022) - [i37]Tanmay Gupta, Aniruddha Kembhavi:
Visual Programming: Compositional visual reasoning without training. CoRR abs/2211.11559 (2022) - [i36]Favyen Bastani, Piper Wolters, Ritwik Gupta, Joe Ferdinando, Aniruddha Kembhavi:
Satlas: A Large-Scale, Multi-Task Dataset for Remote Sensing Image Understanding. CoRR abs/2211.15660 (2022) - [i35]Kunal Pratap Singh, Jordi Salvador, Luca Weihs, Aniruddha Kembhavi:
A General Purpose Supervisory Signal for Embodied Agents. CoRR abs/2212.01186 (2022) - [i34]Matt Deitke, Rose Hendrix, Luca Weihs, Ali Farhadi, Kiana Ehsani, Aniruddha Kembhavi:
Phone2Proc: Bringing Robust Robots Into Our Chaotic World. CoRR abs/2212.04819 (2022) - [i33]Matt Deitke, Dustin Schwenk, Jordi Salvador, Luca Weihs, Oscar Michel, Eli VanderBilt, Ludwig Schmidt, Kiana Ehsani, Aniruddha Kembhavi, Ali Farhadi:
Objaverse: A Universe of Annotated 3D Objects. CoRR abs/2212.08051 (2022) - 2021
- [c32]Rowan Zellers, Ari Holtzman, Matthew E. Peters, Roozbeh Mottaghi, Aniruddha Kembhavi, Ali Farhadi, Yejin Choi:
PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World. ACL/IJCNLP (1) 2021: 2040-2050 - [c31]Kiana Ehsani, Winson Han, Alvaro Herrasti, Eli VanderBilt, Luca Weihs, Eric Kolve, Aniruddha Kembhavi, Roozbeh Mottaghi:
ManipulaTHOR: A Framework for Visual Object Manipulation. CVPR 2021: 4497-4506 - [c30]Arka Sadhu, Tanmay Gupta, Mark Yatskar, Ram Nevatia, Aniruddha Kembhavi:
Visual Semantic Role Labeling for Video Understanding. CVPR 2021: 5589-5600 - [c29]Luca Weihs, Matt Deitke, Aniruddha Kembhavi, Roozbeh Mottaghi:
Visual Room Rearrangement. CVPR 2021: 5922-5931 - [c28]Christopher Clark, Jordi Salvador, Dustin Schwenk, Derrick Bonafilia, Mark Yatskar, Eric Kolve, Alvaro Herrasti, Jonghyun Choi, Sachin Mehta, Sam Skjonsberg, Carissa Schoenick, Aaron Sarnat, Hannaneh Hajishirzi, Aniruddha Kembhavi, Oren Etzioni, Ali Farhadi:
Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text. EMNLP (1) 2021: 1864-1886 - [c27]Unnat Jain, Iou-Jen Liu, Svetlana Lazebnik, Aniruddha Kembhavi, Luca Weihs, Alexander G. Schwing:
GridToPix: Training Embodied Agents with Minimal Supervision. ICCV 2021: 15121-15131 - [c26]Prithvijit Chattopadhyay, Judy Hoffman, Roozbeh Mottaghi, Aniruddha Kembhavi:
RobustNav: Towards Benchmarking Robustness in Embodied Navigation. ICCV 2021: 15671-15680 - [c25]Luca Weihs, Aniruddha Kembhavi, Kiana Ehsani, Sarah M. Pratt, Winson Han, Alvaro Herrasti, Eric Kolve, Dustin Schwenk, Roozbeh Mottaghi, Ali Farhadi:
Learning Generalizable Visual Representations via Interactive Gameplay. ICLR 2021 - [c24]Luca Weihs, Unnat Jain, Iou-Jen Liu, Jordi Salvador, Svetlana Lazebnik, Aniruddha Kembhavi, Alexander G. Schwing:
Bridging the Imitation Gap by Adaptive Insubordination. NeurIPS 2021: 19134-19146 - [c23]Peng Gao, Jiasen Lu, Hongsheng Li, Roozbeh Mottaghi, Aniruddha Kembhavi:
Container: Context Aggregation Networks. NeurIPS 2021: 19160-19171 - [i32]Luca Weihs, Matt Deitke, Aniruddha Kembhavi, Roozbeh Mottaghi:
Visual Room Rearrangement. CoRR abs/2103.16544 (2021) - [i31]Tanmay Gupta, Amita Kamath, Aniruddha Kembhavi, Derek Hoiem:
Towards General Purpose Vision Systems. CoRR abs/2104.00743 (2021) - [i30]Arka Sadhu, Tanmay Gupta, Mark Yatskar, Ram Nevatia, Aniruddha Kembhavi:
Visual Semantic Role Labeling for Video Understanding. CoRR abs/2104.00990 (2021) - [i29]Kiana Ehsani, Winson Han, Alvaro Herrasti, Eli VanderBilt, Luca Weihs, Eric Kolve, Aniruddha Kembhavi, Roozbeh Mottaghi:
ManipulaTHOR: A Framework for Visual Object Manipulation. CoRR abs/2104.11213 (2021) - [i28]Unnat Jain, Iou-Jen Liu, Svetlana Lazebnik, Aniruddha Kembhavi, Luca Weihs, Alexander G. Schwing:
GridToPix: Training Embodied Agents with Minimal Supervision. CoRR abs/2105.00931 (2021) - [i27]Rowan Zellers, Ari Holtzman, Matthew E. Peters, Roozbeh Mottaghi, Aniruddha Kembhavi, Ali Farhadi, Yejin Choi:
PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World. CoRR abs/2106.00188 (2021) - [i26]Peng Gao, Jiasen Lu, Hongsheng Li, Roozbeh Mottaghi, Aniruddha Kembhavi:
Container: Context Aggregation Network. CoRR abs/2106.01401 (2021) - [i25]Prithvijit Chattopadhyay, Judy Hoffman, Roozbeh Mottaghi, Aniruddha Kembhavi:
RobustNav: Towards Benchmarking Robustness in Embodied Navigation. CoRR abs/2106.04531 (2021) - [i24]Apoorv Khandelwal, Luca Weihs, Roozbeh Mottaghi, Aniruddha Kembhavi:
Simple but Effective: CLIP Embeddings for Embodied AI. CoRR abs/2111.09888 (2021) - [i23]Christopher Clark, Jordi Salvador, Dustin Schwenk, Derrick Bonafilia, Mark Yatskar, Eric Kolve, Alvaro Herrasti, Jonghyun Choi, Sachin Mehta, Sam Skjonsberg, Carissa Schoenick, Aaron Sarnat, Hannaneh Hajishirzi, Aniruddha Kembhavi, Oren Etzioni, Ali Farhadi:
Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text. CoRR abs/2112.00800 (2021) - 2020
- [c22]Matt Deitke, Winson Han, Alvaro Herrasti, Aniruddha Kembhavi, Eric Kolve, Roozbeh Mottaghi, Jordi Salvador, Dustin Schwenk, Eli VanderBilt, Matthew Wallingford, Luca Weihs, Mark Yatskar, Ali Farhadi:
RoboTHOR: An Open Simulation-to-Real Embodied AI Platform. CVPR 2020: 3161-3171 - [c21]Vivek Ramanujan, Mitchell Wortsman, Aniruddha Kembhavi, Ali Farhadi, Mohammad Rastegari:
What's Hidden in a Randomly Weighted Neural Network? CVPR 2020: 11890-11899 - [c20]Sarah M. Pratt, Mark Yatskar, Luca Weihs, Ali Farhadi, Aniruddha Kembhavi:
Grounded Situation Recognition. ECCV (4) 2020: 314-332 - [c19]Unnat Jain, Luca Weihs, Eric Kolve, Ali Farhadi, Svetlana Lazebnik, Aniruddha Kembhavi, Alexander G. Schwing:
A Cordial Sync: Going Beyond Marginal Policies for Multi-agent Embodied Tasks. ECCV (5) 2020: 471-490 - [c18]Jaemin Cho, Jiasen Lu, Dustin Schwenk, Hannaneh Hajishirzi, Aniruddha Kembhavi:
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers. EMNLP (1) 2020: 8785-8805 - [c17]Purva Tendulkar, Abhishek Das, Aniruddha Kembhavi, Devi Parikh:
Feel The Music: Automatically Generating A Dance For An Input Song. ICCC 2020: 292-295 - [c16]Martin Lohmann, Jordi Salvador, Aniruddha Kembhavi, Roozbeh Mottaghi:
Learning About Objects by Learning to Interact with Them. NeurIPS 2020 - [c15]Mitchell Wortsman, Vivek Ramanujan, Rosanne Liu, Aniruddha Kembhavi, Mohammad Rastegari, Jason Yosinski, Ali Farhadi:
Supermasks in Superposition. NeurIPS 2020 - [i22]Sarah M. Pratt, Mark Yatskar, Luca Weihs, Ali Farhadi, Aniruddha Kembhavi:
Grounded Situation Recognition. CoRR abs/2003.12058 (2020) - [i21]Matt Deitke, Winson Han, Alvaro Herrasti, Aniruddha Kembhavi, Eric Kolve, Roozbeh Mottaghi, Jordi Salvador, Dustin Schwenk, Eli VanderBilt, Matthew Wallingford, Luca Weihs, Mark Yatskar, Ali Farhadi:
RoboTHOR: An Open Simulation-to-Real Embodied AI Platform. CoRR abs/2004.06799 (2020) - [i20]Martin Lohmann, Jordi Salvador, Aniruddha Kembhavi, Roozbeh Mottaghi:
Learning About Objects by Learning to Interact with Them. CoRR abs/2006.09306 (2020) - [i19]Purva Tendulkar, Abhishek Das, Aniruddha Kembhavi, Devi Parikh:
Feel The Music: Automatically Generating A Dance For An Input Song. CoRR abs/2006.11905 (2020) - [i18]Dhruv Batra, Aaron Gokaslan, Aniruddha Kembhavi, Oleksandr Maksymets, Roozbeh Mottaghi, Manolis Savva, Alexander Toshev, Erik Wijmans:
ObjectNav Revisited: On Evaluation of Embodied Agents Navigating to Objects. CoRR abs/2006.13171 (2020) - [i17]Mitchell Wortsman, Vivek Ramanujan, Rosanne Liu, Aniruddha Kembhavi, Mohammad Rastegari, Jason Yosinski, Ali Farhadi:
Supermasks in Superposition. CoRR abs/2006.14769 (2020) - [i16]Matthew Wallingford, Aditya Kusupati, Keivan Alizadeh-Vahid, Aaron Walsman, Aniruddha Kembhavi, Ali Farhadi:
In the Wild: From ML Models to Pragmatic ML Systems. CoRR abs/2007.02519 (2020) - [i15]Unnat Jain, Luca Weihs, Eric Kolve, Ali Farhadi, Svetlana Lazebnik, Aniruddha Kembhavi, Alexander G. Schwing:
A Cordial Sync: Going Beyond Marginal Policies for Multi-Agent Embodied Tasks. CoRR abs/2007.04979 (2020) - [i14]Luca Weihs, Unnat Jain, Jordi Salvador, Svetlana Lazebnik, Aniruddha Kembhavi, Alexander G. Schwing:
Bridging the Imitation Gap by Adaptive Insubordination. CoRR abs/2007.12173 (2020) - [i13]Luca Weihs, Jordi Salvador, Klemen Kotar, Unnat Jain, Kuo-Hao Zeng, Roozbeh Mottaghi, Aniruddha Kembhavi:
AllenAct: A Framework for Embodied AI Research. CoRR abs/2008.12760 (2020) - [i12]Jaemin Cho, Jiasen Lu, Dustin Schwenk, Hannaneh Hajishirzi, Aniruddha Kembhavi:
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers. CoRR abs/2009.11278 (2020)
2010 – 2019
- 2019
- [c14]Huiyu Wang, Aniruddha Kembhavi, Ali Farhadi, Alan L. Yuille, Mohammad Rastegari:
ELASTIC: Improving CNNs With Dynamic Scaling Policies. CVPR 2019: 2258-2267 - [c13]Unnat Jain, Luca Weihs, Eric Kolve, Mohammad Rastegari, Svetlana Lazebnik, Ali Farhadi, Alexander G. Schwing, Aniruddha Kembhavi:
Two Body Problem: Collaborative Visual Task Completion. CVPR 2019: 6689-6699 - [i11]Unnat Jain, Luca Weihs, Eric Kolve, Mohammad Rastegari, Svetlana Lazebnik, Ali Farhadi, Alexander G. Schwing, Aniruddha Kembhavi:
Two Body Problem: Collaborative Visual Task Completion. CoRR abs/1904.05879 (2019) - [i10]Vivek Ramanujan, Mitchell Wortsman, Aniruddha Kembhavi, Ali Farhadi, Mohammad Rastegari:
What's Hidden in a Randomly Weighted Neural Network? CoRR abs/1911.13299 (2019) - [i9]Luca Weihs, Aniruddha Kembhavi, Winson Han, Alvaro Herrasti, Eric Kolve, Dustin Schwenk, Roozbeh Mottaghi, Ali Farhadi:
Artificial Agents Learn Flexible Visual Representations by Playing a Hiding Game. CoRR abs/1912.08195 (2019) - 2018
- [c12]Jonghyun Choi, Jayant Krishnamurthy, Aniruddha Kembhavi, Ali Farhadi:
Structured Set Matching Networks for One-Shot Part Labeling. CVPR 2018: 3627-3636 - [c11]Daniel Gordon, Aniruddha Kembhavi, Mohammad Rastegari, Joseph Redmon, Dieter Fox, Ali Farhadi:
IQA: Visual Question Answering in Interactive Environments. CVPR 2018: 4089-4098 - [c10]Aishwarya Agrawal, Dhruv Batra, Devi Parikh, Aniruddha Kembhavi:
Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering. CVPR 2018: 4971-4980 - [c9]Tanmay Gupta, Dustin Schwenk, Ali Farhadi, Derek Hoiem, Aniruddha Kembhavi:
Imagine This! Scripts to Compositions to Videos. ECCV (8) 2018: 610-626 - [i8]Tanmay Gupta, Dustin Schwenk, Ali Farhadi, Derek Hoiem, Aniruddha Kembhavi:
Imagine This! Scripts to Compositions to Videos. CoRR abs/1804.03608 (2018) - [i7]Huiyu Wang, Aniruddha Kembhavi, Ali Farhadi, Alan L. Yuille, Mohammad Rastegari:
ELASTIC: Improving CNNs with Instance Specific Scaling Policies. CoRR abs/1812.05262 (2018) - 2017
- [c8]Aniruddha Kembhavi, Min Joon Seo, Dustin Schwenk, Jonghyun Choi, Ali Farhadi, Hannaneh Hajishirzi:
Are You Smarter Than a Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension. CVPR 2017: 5376-5384 - [c7]Min Joon Seo, Aniruddha Kembhavi, Ali Farhadi, Hannaneh Hajishirzi:
Bidirectional Attention Flow for Machine Comprehension. ICLR (Poster) 2017 - [i6]Aishwarya Agrawal, Aniruddha Kembhavi, Dhruv Batra, Devi Parikh:
C-VQA: A Compositional Split of the Visual Question Answering (VQA) v1.0 Dataset. CoRR abs/1704.08243 (2017) - [i5]Aishwarya Agrawal, Dhruv Batra, Devi Parikh, Aniruddha Kembhavi:
Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering. CoRR abs/1712.00377 (2017) - [i4]Jonghyun Choi, Jayant Krishnamurthy, Aniruddha Kembhavi, Ali Farhadi:
Structured Set Matching Networks for One-Shot Part Labeling. CoRR abs/1712.01867 (2017) - [i3]Daniel Gordon, Aniruddha Kembhavi, Mohammad Rastegari, Joseph Redmon, Dieter Fox, Ali Farhadi:
IQA: Visual Question Answering in Interactive Environments. CoRR abs/1712.03316 (2017) - 2016
- [c6]Aniruddha Kembhavi, Mike Salvato, Eric Kolve, Min Joon Seo, Hannaneh Hajishirzi, Ali Farhadi:
A Diagram is Worth a Dozen Images. ECCV (4) 2016: 235-251 - [c5]Jayant Krishnamurthy, Oyvind Tafjord, Aniruddha Kembhavi:
Semantic Parsing to Probabilistic Programs for Situated Question Answering. EMNLP 2016: 160-170 - [i2]Aniruddha Kembhavi, Mike Salvato, Eric Kolve, Min Joon Seo, Hannaneh Hajishirzi, Ali Farhadi:
A Diagram Is Worth A Dozen Images. CoRR abs/1603.07396 (2016) - [i1]Min Joon Seo, Aniruddha Kembhavi, Ali Farhadi, Hannaneh Hajishirzi:
Bidirectional Attention Flow for Machine Comprehension. CoRR abs/1611.01603 (2016) - 2011
- [j3]Aniruddha Kembhavi, David Harwood, Larry S. Davis:
Vehicle Detection Using Partial Least Squares. IEEE Trans. Pattern Anal. Mach. Intell. 33(6): 1250-1265 (2011) - 2010
- [b1]Aniruddha Kembhavi:
Recognizing Objects And Reasoning About Their Interactions. University of Maryland, College Park, MD, USA, 2010 - [c4]Aniruddha Kembhavi, Tom Yeh, Larry S. Davis:
Why Did the Person Cross the Road (There)? Scene Understanding Using Probabilistic Logic Models and Common Sense Reasoning. ECCV (2) 2010: 693-706
2000 – 2009
- 2009
- [j2]Yunqian Ma, Petr Císar, Aniruddha Kembhavi:
Motion segmentation and activity representation in crowds. Int. J. Imaging Syst. Technol. 19(2): 80-90 (2009) - [j1]Abhinav Gupta, Aniruddha Kembhavi, Larry S. Davis:
Observing Human-Object Interactions: Using Spatial and Functional Compatibility for Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31(10): 1775-1789 (2009) - [c3]William Robson Schwartz, Aniruddha Kembhavi, David Harwood, Larry S. Davis:
Human detection using partial least squares analysis. ICCV 2009: 24-31 - [c2]Aniruddha Kembhavi, Behjat Siddiquie, Roland Miezianko, Scott McCloskey, Larry S. Davis:
Incremental Multiple Kernel Learning for object recognition. ICCV 2009: 638-645 - 2008
- [c1]Aniruddha Kembhavi, Ryan Farrell, Yuancheng Luo, David W. Jacobs, Ramani Duraiswami, Larry S. Davis:
Tracking Down Under: Following the Satin Bowerbird. WACV 2008: 1-7
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-20 20:58 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint