default search action
Amrit S. Bedi
Person information
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j23]Amrit Singh Bedi, Anjaly Parayil, Junyu Zhang, Mengdi Wang, Alec Koppel:
On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control. J. Mach. Learn. Res. 25: 39:1-39:58 (2024) - [c59]Souradip Chakraborty, Amrit S. Bedi, Alec Koppel, Huazheng Wang, Dinesh Manocha, Mengdi Wang, Furong Huang:
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback. ICLR 2024 - [c58]Souradip Chakraborty, Amrit S. Bedi, Sicheng Zhu, Bang An, Dinesh Manocha, Furong Huang:
Position: On the Possibilities of AI-Generated Text Detection. ICML 2024 - [c57]Souradip Chakraborty, Jiahao Qiu, Hui Yuan, Alec Koppel, Dinesh Manocha, Furong Huang, Amrit S. Bedi, Mengdi Wang:
MaxMin-RLHF: Alignment with Diverse Human Preferences. ICML 2024 - [c56]Mudit Gaur, Amrit S. Bedi, Di Wang, Vaneet Aggarwal:
Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network Parametrization. ICML 2024 - [c55]Bhrij Patel, Wesley A. Suttle, Alec Koppel, Vaneet Aggarwal, Brian M. Sadler, Dinesh Manocha, Amrit S. Bedi:
Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles. ICML 2024 - [c54]Utsav Singh, Wesley A. Suttle, Brian M. Sadler, Vinay P. Namboodiri, Amrit S. Bedi:
PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling. ICML 2024 - [c53]Senthil Hariharan Arul, Amrit Singh Bedi, Dinesh Manocha:
When, What, and with Whom to Communicate: Enhancing RL-based Multi-Robot Navigation through Selective Communication. IROS 2024: 7695 - [c52]Xingpeng Sun, Yiran Zhang, Xindi Tang, Amrit Singh Bedi, Aniket Bera:
TrustNavGPT: Modeling Uncertainty to Improve Trustworthiness of Audio-Guided LLM-Based Robot Navigation. IROS 2024: 8794-8801 - [c51]Chak Lam Shek, Xiyang Wu, Wesley A. Suttle, Carl E. Busart, Erin G. Zaroukian, Dinesh Manocha, Pratap Tokekar, Amrit Singh Bedi:
LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments. IROS 2024: 9612-9619 - [i64]Xingpeng Sun, Haoming Meng, Souradip Chakraborty, Amrit Singh Bedi, Aniket Bera:
Beyond Text: Improving LLM's Decision Making for Robot Navigation via Vocal Cues. CoRR abs/2402.03494 (2024) - [i63]Souradip Chakraborty, Jiahao Qiu, Hui Yuan, Alec Koppel, Furong Huang, Dinesh Manocha, Amrit Singh Bedi, Mengdi Wang:
MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences. CoRR abs/2402.08925 (2024) - [i62]Xiyang Wu, Ruiqi Xian, Tianrui Guan, Jing Liang, Souradip Chakraborty, Fuxiao Liu, Brian M. Sadler, Dinesh Manocha, Amrit Singh Bedi:
On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities. CoRR abs/2402.10340 (2024) - [i61]Peihong Yu, Manav Mishra, Alec Koppel, Carl E. Busart, Priya Narayan, Dinesh Manocha, Amrit S. Bedi, Pratap Tokekar:
Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning. CoRR abs/2403.08936 (2024) - [i60]Vishnu Sashank Dorbala, Bhrij Patel, Amrit Singh Bedi, Dinesh Manocha:
Right Place, Right Time! Towards ObjectNav for Non-Stationary Goals. CoRR abs/2403.09905 (2024) - [i59]Bhrij Patel, Wesley A. Suttle, Alec Koppel, Vaneet Aggarwal, Brian M. Sadler, Amrit Singh Bedi, Dinesh Manocha:
Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic. CoRR abs/2403.11925 (2024) - [i58]Utsav Singh, Wesley A. Suttle, Brian M. Sadler, Vinay P. Namboodiri, Amrit Singh Bedi:
PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling. CoRR abs/2404.13423 (2024) - [i57]Mudit Gaur, Amrit Singh Bedi, Di Wang, Vaneet Aggarwal:
Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network Parametrization. CoRR abs/2405.01843 (2024) - [i56]Marco Bornstein, Amrit Singh Bedi, Abdirisak Mohamed, Furong Huang:
FACT or Fiction: Can Truthful Mechanisms Eliminate Federated Free Riding? CoRR abs/2405.13879 (2024) - [i55]Souradip Chakraborty, Soumya Suvra Ghosal, Ming Yin, Dinesh Manocha, Mengdi Wang, Amrit Singh Bedi, Furong Huang:
Transfer Q Star: Principled Decoding for LLM Alignment. CoRR abs/2405.20495 (2024) - [i54]Bhrij Patel, Vishnu Sashank Dorbala, Dinesh Manocha, Amrit Singh Bedi:
Embodied Question Answering via Multi-LLM Systems. CoRR abs/2406.10918 (2024) - [i53]Mucong Ding, Souradip Chakraborty, Vibhu Agrawal, Zora Che, Alec Koppel, Mengdi Wang, Amrit S. Bedi, Furong Huang:
SAIL: Self-Improving Efficient Online Alignment of Large Language Models. CoRR abs/2406.15567 (2024) - [i52]Xingpeng Sun, Yiran Zhang, Xindi Tang, Amrit Singh Bedi, Aniket Bera:
TrustNavGPT: Modeling Uncertainty to Improve Trustworthiness of Audio-Guided LLM-Based Robot Navigation. CoRR abs/2408.01867 (2024) - [i51]Mohamad Fares El Hajj Chehade, Amrit Singh Bedi, Amy Zhang, Hao Zhu:
CAT: Caution Aware Transfer in Reinforcement Learning via Distributional Risk. CoRR abs/2408.08812 (2024) - [i50]Marco Bornstein, Zora Che, Suhas Julapalli, Abdirisak Mohamed, Amrit Singh Bedi, Furong Huang:
Auction-Based Regulation for Artificial Intelligence. CoRR abs/2410.01871 (2024) - [i49]Bhrij Patel, Souradip Chakraborty, Wesley A. Suttle, Mengdi Wang, Amrit Singh Bedi, Dinesh Manocha:
AIME: AI System Optimization via Multiple LLM Evaluators. CoRR abs/2410.03131 (2024) - [i48]Anas Barakat, Souradip Chakraborty, Peihong Yu, Pratap Tokekar, Amrit Singh Bedi:
On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning. CoRR abs/2410.04108 (2024) - [i47]Mudit Gaur, Amrit Singh Bedi, Raghu Pasupathy, Vaneet Aggarwal:
On The Global Convergence Of Online RLHF With Neural Parametrization. CoRR abs/2410.15610 (2024) - [i46]Kai Cheng, Zhengyuan Li, Xingpeng Sun, Byung-Cheol Min, Amrit Singh Bedi, Aniket Bera:
EfficientEQA: An Efficient Approach for Open Vocabulary Embodied Question Answering. CoRR abs/2410.20263 (2024) - [i45]Utsav Singh, Souradip Chakraborty, Wesley A. Suttle, Brian M. Sadler, Anit Kumar Sahu, Mubarak Shah, Vinay P. Namboodiri, Amrit Singh Bedi:
Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction. CoRR abs/2411.00361 (2024) - [i44]Soumya Suvra Ghosal, Souradip Chakraborty, Vaibhav Singh, Tianrui Guan, Mengdi Wang, Ahmad Beirami, Furong Huang, Alvaro Velasquez, Dinesh Manocha, Amrit Singh Bedi:
Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment. CoRR abs/2411.18688 (2024) - [i43]James Beetham, Souradip Chakraborty, Mengdi Wang, Furong Huang, Amrit Singh Bedi, Mubarak Shah:
LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds. CoRR abs/2412.05232 (2024) - 2023
- [j22]Soumya Suvra Ghosal, Souradip Chakraborty, Jonas Geiping, Furong Huang, Dinesh Manocha, Amrit S. Bedi:
A Survey on the Possibilities & Impossibilities of AI-generated Text Detection. Trans. Mach. Learn. Res. 2023 (2023) - [c50]Qinbo Bai, Amrit Singh Bedi, Vaneet Aggarwal:
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm. AAAI 2023: 6737-6744 - [c49]Souradip Chakraborty, Amrit Singh Bedi, Pratap Tokekar, Alec Koppel, Brian M. Sadler, Furong Huang, Dinesh Manocha:
Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning. AAAI 2023: 6980-6988 - [c48]Hans He, Alec Koppel, Amrit Singh Bedi, Mazen Farhood, Daniel J. Stilwell:
Bi-Level Nonstationary Kernels for Online Gaussian Process Regression. CASE 2023: 1-7 - [c47]Xiyang Wu, Rohan Chandra, Tianrui Guan, Amrit S. Bedi, Dinesh Manocha:
Intent-Aware Planning in Heterogeneous Traffic via Distributed Multi-Agent Reinforcement Learning. CoRL 2023: 446-477 - [c46]Marco Bornstein, Tahseen Rabbani, Evan Wang, Amrit S. Bedi, Furong Huang:
SWIFT: Rapid Decentralized Federated Learning via Wait-Free Model Communication. ICLR 2023 - [c45]Souradip Chakraborty, Amrit S. Bedi, Alec Koppel, Mengdi Wang, Furong Huang, Dinesh Manocha:
STEERING : Stein Information Directed Exploration for Model-Based Reinforcement Learning. ICML 2023: 3949-3978 - [c44]Wesley A. Suttle, Amrit S. Bedi, Bhrij Patel, Brian M. Sadler, Alec Koppel, Dinesh Manocha:
Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic. ICML 2023: 33240-33267 - [c43]Souradip Chakraborty, Amrit Singh Bedi, Kasun Weerakoon, Prithvi Poddar, Alec Koppel, Pratap Tokekar, Dinesh Manocha:
Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policy Optimization. ICRA 2023: 989-995 - [c42]Aakriti Agrawal, Amrit Singh Bedi, Dinesh Manocha:
RTAW: An Attention Inspired Reinforcement Learning Method for Multi-Robot Task Allocation in Warehouse Environments. ICRA 2023: 1393-1399 - [c41]Hans He, Alec Koppel, Amrit Singh Bedi, Daniel J. Stilwell, Mazen Farhood, Benjamin Biggs:
Decentralized Multi-agent Exploration with Limited Inter-agent Communications. ICRA 2023: 5530-5536 - [i42]Souradip Chakraborty, Amrit Singh Bedi, Alec Koppel, Mengdi Wang, Furong Huang, Dinesh Manocha:
STEERING: Stein Information Directed Exploration for Model-Based Reinforcement Learning. CoRR abs/2301.12038 (2023) - [i41]Wesley A. Suttle, Amrit Singh Bedi, Bhrij Patel, Brian M. Sadler, Alec Koppel, Dinesh Manocha:
Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic. CoRR abs/2301.12083 (2023) - [i40]Souradip Chakraborty, Kasun Weerakoon, Prithvi Poddar, Pratap Tokekar, Amrit Singh Bedi, Dinesh Manocha:
RE-MOVE: An Adaptive Policy Design Approach for Dynamic Environments via Language-Based Feedback. CoRR abs/2303.07622 (2023) - [i39]Souradip Chakraborty, Amrit Singh Bedi, Sicheng Zhu, Bang An, Dinesh Manocha, Furong Huang:
On the Possibilities of AI-Generated Text Detection. CoRR abs/2304.04736 (2023) - [i38]Bhrij Patel, Kasun Weerakoon, Wesley A. Suttle, Alec Koppel, Brian M. Sadler, Amrit Singh Bedi, Dinesh Manocha:
Ada-NAV: Adaptive Trajectory-Based Sample Efficient Policy Learning for Robotic Navigation. CoRR abs/2306.06192 (2023) - [i37]Xiyang Wu, Rohan Chandra, Tianrui Guan, Amrit Singh Bedi, Dinesh Manocha:
iPLAN: Intent-Aware Planning in Heterogeneous Traffic via Distributed Multi-Agent Reinforcement Learning. CoRR abs/2306.06236 (2023) - [i36]Mudit Gaur, Amrit Singh Bedi, Di Wang, Vaneet Aggarwal:
On the Global Convergence of Natural Actor-Critic with Two-layer Neural Network Parametrization. CoRR abs/2306.10486 (2023) - [i35]Souradip Chakraborty, Amrit Singh Bedi, Alec Koppel, Dinesh Manocha, Huazheng Wang, Furong Huang, Mengdi Wang:
Aligning Agent Policy with Externalities: Reward Design via Bilevel RL. CoRR abs/2308.02585 (2023) - [i34]Chak Lam Shek, Xiyang Wu, Dinesh Manocha, Pratap Tokekar, Amrit Singh Bedi:
LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments. CoRR abs/2310.00481 (2023) - [i33]Marco Bornstein, Amrit Singh Bedi, Anit Kumar Sahu, Furqan Khan, Furong Huang:
RealFM: A Realistic Mechanism to Incentivize Data Contribution and Device Participation. CoRR abs/2310.13681 (2023) - [i32]Soumya Suvra Ghosal, Souradip Chakraborty, Jonas Geiping, Furong Huang, Dinesh Manocha, Amrit Singh Bedi:
Towards Possibilities & Impossibilities of AI-generated Text Detection: A Survey. CoRR abs/2310.15264 (2023) - [i31]Souradip Chakraborty, Amisha Bhaskar, Anukriti Singh, Pratap Tokekar, Dinesh Manocha, Amrit Singh Bedi:
REBEL: A Regularization-Based Solution for Reward Overoptimization in Reinforcement Learning from Human Feedback. CoRR abs/2312.14436 (2023) - 2022
- [j21]Amrit Singh Bedi, Ketan Rajawat, Vaneet Aggarwal, Alec Koppel:
Escaping Saddle Points for Successive Convex Approximation. IEEE Trans. Signal Process. 70: 307-321 (2022) - [j20]Zeeshan Akhtar, Amrit Singh Bedi, Srujan Teja Thomdapu, Ketan Rajawat:
Projection-Free Stochastic Bi-Level Optimization. IEEE Trans. Signal Process. 70: 6332-6347 (2022) - [c40]Qinbo Bai, Amrit Singh Bedi, Mridul Agarwal, Alec Koppel, Vaneet Aggarwal:
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach. AAAI 2022: 3682-3689 - [c39]Junyu Zhang, Amrit Singh Bedi, Mengdi Wang, Alec Koppel:
Multi-Agent Reinforcement Learning with General Utilities via Decentralized Shadow Reward Actor-Critic. AAAI 2022: 9031-9039 - [c38]Kushal Chakrabarti, Amrit S. Bedi, Fikadu T. Dagefu, Jeffrey N. Twigg, Nikhil Chopra:
Fast Distributed Beamforming without Receiver Feedback. IEEECONF 2022: 1408-1412 - [c37]Alec Koppel, Amrit Singh Bedi, Bhargav Ganguly, Vaneet Aggarwal:
Convergence Rates of Average-Reward Multi-agent Reinforcement Learning via Randomized Linear Programming. CDC 2022: 4545-4552 - [c36]Kasun Weerakoon, Souradip Chakraborty, Nare Karapetyan, Adarsh Jagan Sathyamoorthy, Amrit S. Bedi, Dinesh Manocha:
HTRON: Efficient Outdoor Navigation with Sparse Rewards via Heavy Tailed Adaptive Reinforce Algorithm. CoRL 2022: 1629-1639 - [c35]Amrit Singh Bedi, Souradip Chakraborty, Anjaly Parayil, Brian M. Sadler, Pratap Tokekar, Alec Koppel:
On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces. ICML 2022: 1716-1731 - [c34]Anis Elgabli, Chaouki Ben Issaid, Amrit Singh Bedi, Ketan Rajawat, Mehdi Bennis, Vaneet Aggarwal:
FedNew: A Communication-Efficient and Privacy-Preserving Newton-Type Method for Federated Learning. ICML 2022: 5861-5877 - [c33]Yulun Tian, Amrit Singh Bedi, Alec Koppel, Miguel Calvo-Fullana, David M. Rosen, Jonathan P. How:
Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation. IROS 2022: 4391-4398 - [c32]Aakriti Agrawal, Senthil Hariharan Arul, Amrit Singh Bedi, Dinesh Manocha:
DC-MRTA: Decentralized Multi-Robot Task Allocation and Navigation in Complex Environments. IROS 2022: 11711-11718 - [i30]Amrit Singh Bedi, Souradip Chakraborty, Anjaly Parayil, Brian M. Sadler, Pratap Tokekar, Alec Koppel:
On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces. CoRR abs/2201.12332 (2022) - [i29]Yulun Tian, Amrit Singh Bedi, Alec Koppel, Miguel Calvo-Fullana, David M. Rosen, Jonathan P. How:
Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation. CoRR abs/2203.00851 (2022) - [i28]Souradip Chakraborty, Amrit Singh Bedi, Alec Koppel, Brian M. Sadler, Furong Huang, Pratap Tokekar, Dinesh Manocha:
Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning. CoRR abs/2206.01162 (2022) - [i27]Souradip Chakraborty, Amrit Singh Bedi, Alec Koppel, Pratap Tokekar, Dinesh Manocha:
Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies. CoRR abs/2206.05652 (2022) - [i26]Qinbo Bai, Amrit Singh Bedi, Vaneet Aggarwal:
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm. CoRR abs/2206.05850 (2022) - [i25]Anis Elgabli, Chaouki Ben Issaid, Amrit S. Bedi, Ketan Rajawat, Mehdi Bennis, Vaneet Aggarwal:
FedNew: A Communication-Efficient and Privacy-Preserving Newton-Type Method for Federated Learning. CoRR abs/2206.08829 (2022) - [i24]Amrit Singh Bedi, Chen Fan, Alec Koppel, Anit Kumar Sahu, Brian M. Sadler, Furong Huang, Dinesh Manocha:
FedBC: Calibrating Global and Local Models via Federated Learning Beyond Consensus. CoRR abs/2206.10815 (2022) - [i23]Kasun Weerakoon, Souradip Chakraborty, Nare Karapetyan, Adarsh Jagan Sathyamoorthy, Amrit Singh Bedi, Dinesh Manocha:
HTRON: Efficient Outdoor Navigation with Sparse Rewards via Heavy Tailed Adaptive Reinforce Algorithm. CoRR abs/2207.03694 (2022) - [i22]Aakriti Agrawal, Senthil Hariharan Arul, Amrit Singh Bedi, Dinesh Manocha:
DC-MRTA: Decentralized Multi-Robot Task Allocation and Navigation in Complex Environments. CoRR abs/2209.02865 (2022) - [i21]Aakriti Agrawal, Amrit Singh Bedi, Dinesh Manocha:
RTAW: An Attention Inspired Reinforcement Learning Method for Multi-Robot Task Allocation in Warehouse Environments. CoRR abs/2209.05738 (2022) - [i20]Senthil Hariharan Arul, Amrit Singh Bedi, Dinesh Manocha:
DMCA: Dense Multi-agent Navigation using Attention and Communication. CoRR abs/2209.06415 (2022) - [i19]Marco Bornstein, Tahseen Rabbani, Evan Wang, Amrit Singh Bedi, Furong Huang:
SWIFT: Rapid Decentralized Federated Learning via Wait-Free Model Communication. CoRR abs/2210.14026 (2022) - 2021
- [j19]Junyu Zhang, Amrit Singh Bedi, Mengdi Wang, Alec Koppel:
Cautious Reinforcement Learning via Distributional Risk in the Dual Domain. IEEE J. Sel. Areas Inf. Theory 2(2): 611-626 (2021) - [j18]Rishabh Dixit, Amrit Singh Bedi, Ketan Rajawat:
Online Learning Over Dynamic Graphs via Distributed Proximal Gradient Algorithm. IEEE Trans. Autom. Control. 66(11): 5065-5079 (2021) - [j17]Anis Elgabli, Jihong Park, Amrit Singh Bedi, Chaouki Ben Issaid, Mehdi Bennis, Vaneet Aggarwal:
Q-GADMM: Quantized Group ADMM for Communication Efficient Decentralized Machine Learning. IEEE Trans. Commun. 69(1): 164-181 (2021) - [j16]Hrusikesha Pradhan, Amrit Singh Bedi, Alec Koppel, Ketan Rajawat:
Adaptive Kernel Learning in Heterogeneous Networks. IEEE Trans. Signal Inf. Process. over Networks 7: 423-437 (2021) - [j15]Amrit Singh Bedi, Alec Koppel, Ketan Rajawat, Panchajanya Sanyal:
Nonparametric Compositional Stochastic Optimization for Risk-Sensitive Kernel Learning. IEEE Trans. Signal Process. 69: 428-442 (2021) - [j14]Deepak S. Kalhan, Amrit Singh Bedi, Alec Koppel, Ketan Rajawat, Hamed Hassani, Abhishek K. Gupta, Adrish Banerjee:
Dynamic Online Learning via Frank-Wolfe Algorithm. IEEE Trans. Signal Process. 69: 932-947 (2021) - [j13]Zeeshan Akhtar, Amrit Singh Bedi, Ketan Rajawat:
Conservative Stochastic Optimization With Expectation Constraints. IEEE Trans. Signal Process. 69: 3190-3205 (2021) - [j12]Alec Koppel, Amrit Singh Bedi, Brian M. Sadler, Víctor Elvira:
Nearly Consistent Finite Particle Estimates in Streaming Importance Sampling. IEEE Trans. Signal Process. 69: 6401-6415 (2021) - [c31]Alec Koppel, Amrit Singh Bedi, Bhargav Ganguly, Vaneet Aggarwal:
Randomized Linear Programming for Tabular Average-Cost Multi-agent Reinforcement Learning. ACSCC 2021: 1023-1026 - [c30]Junyu Zhang, Amrit Singh Bedi, Mengdi Wang, Alec Koppel:
Beyond Cumulative Returns via Reinforcement Learning over State-Action Occupancy Measures. ACC 2021: 894-901 - [c29]Zeeshan Akhtar, Amrit Singh Bedi, Ketan Rajawat:
Conservative Stochastic Optimization: $\mathcal{O}(T^{-1/2})$ Optimality Gap with Zero Constraint Violation. ACC 2021: 2224-2229 - [c28]Anjaly Parayil, Amrit Singh Bedi, Alec Koppel:
Joint Position and Beamforming Control via Alternating Nonlinear Least-Squares with a Hierarchical Gamma Prior. ACC 2021: 3513-3518 - [c27]Amrit Singh Bedi, Alec Koppel, Mengdi Wang, Junyu Zhang:
Intermittent Communications in Decentralized Shadow Reward Actor-Critic. CDC 2021: 2613-2620 - [c26]Anis Elgabli, Chaouki Ben Issaid, Amrit S. Bedi, Mehdi Bennis, Vaneet Aggarwal:
Energy-Efficient and Federated Meta-Learning via Projected Stochastic Gradient Ascent. GLOBECOM 2021: 1-6 - [c25]Alec Koppel, Amrit S. Bedi, Vikram Krishnamurthy:
A Dynamical Systems Perspective on Online Bayesian Nonparametric Estimators with Adaptive Hyperparameters. ICASSP 2021: 2975-2979 - [c24]Michael E. Kepler, Alec Koppel, Amrit Singh Bedi, Daniel J. Stilwell:
Wasserstein-Splitting Gaussian Process Regression for Heterogeneous Online Bayesian Inference. IROS 2021: 9833-9840 - [i18]Anis Elgabli, Chaouki Ben Issaid, Amrit S. Bedi, Mehdi Bennis, Vaneet Aggarwal:
Energy-Efficient and Federated Meta-Learning via Projected Stochastic Gradient Ascent. CoRR abs/2105.14772 (2021) - [i17]Junyu Zhang, Amrit Singh Bedi, Mengdi Wang, Alec Koppel:
MARL with General Utilities via Decentralized Shadow Reward Actor-Critic. CoRR abs/2106.00543 (2021) - [i16]Amrit Singh Bedi, Anjaly Parayil, Junyu Zhang, Mengdi Wang, Alec Koppel:
On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control. CoRR abs/2106.08414 (2021) - [i15]Michael E. Kepler, Alec Koppel, Amrit Singh Bedi, Daniel J. Stilwell:
Wasserstein-Splitting Gaussian Process Regression for Heterogeneous Online Bayesian Inference. CoRR abs/2107.12797 (2021) - [i14]Qinbo Bai, Amrit Singh Bedi, Mridul Agarwal, Alec Koppel, Vaneet Aggarwal:
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach. CoRR abs/2109.06332 (2021) - [i13]Zeeshan Akhtar, Amrit Singh Bedi, Srujan Teja Thomdapu, Ketan Rajawat:
Projection-Free Algorithm for Stochastic Bi-level Optimization. CoRR abs/2110.11721 (2021) - 2020
- [j11]Anis Elgabli, Jihong Park, Amrit S. Bedi, Mehdi Bennis, Vaneet Aggarwal:
GADMM: Fast and Communication Efficient Framework for Distributed Machine Learning. J. Mach. Learn. Res. 21: 76:1-76:39 (2020) - [j10]Yulun Tian, Alec Koppel, Amrit Singh Bedi, Jonathan P. How:
Asynchronous and Parallel Distributed Pose Graph Optimization. IEEE Robotics Autom. Lett. 5(4): 5819-5826 (2020) - [j9]Alec Koppel, Amrit Singh Bedi, Ketan Rajawat, Brian M. Sadler:
Optimally Compressed Nonparametric Online Learning: Tradeoffs between memory and consistency. IEEE Signal Process. Mag. 37(3): 61-70 (2020) - [j8]Mohan Krishna Nutalapati, Amrit Singh Bedi, Ketan Rajawat, Marceau Coupechoux:
Online Trajectory Optimization Using Inexact Gradient Feedback for Time-Varying Environments. IEEE Trans. Signal Process. 68: 4824-4838 (2020) - [c23]Hrusikesha Pradhan, Amrit Singh Bedi, Alec Koppel, Ketan Rajawat:
Conservative Multi-agent Online Kernel Learning in Heterogeneous Networks. ACSSC 2020: 53-57 - [c22]Amrit Singh Bedi, Alec Koppel, Ketan Rajawat, Brian M. Sadler:
Trading Dynamic Regret for Model Complexity in Nonstationary Nonparametric Optimization. ACC 2020: 321-326 - [c21]Anis Elgabli, Jihong Park, Amrit S. Bedi, Mehdi Bennis, Vaneet Aggarwal:
Communication Efficient Framework for Decentralized Machine Learning. CISS 2020: 1-5 - [c20]Deepak S. Kalhan, Amrit S. Bedi, Alec Koppel, Ketan Rajawat, Abhishek K. Gupta, Adrish Banerjee:
Projection Free Dynamic Online Learning. ICASSP 2020: 3957-3961 - [c19]Anis Elgabli, Jihong Park, Amrit S. Bedi, Mehdi Bennis, Vaneet Aggarwal:
Q-GADMM: Quantized Group ADMM for Communication Efficient Decentralized Machine Learning. ICASSP 2020: 8876-8880 - [c18]Amrit Singh Bedi, Dheeraj Peddireddy, Vaneet Aggarwal, Alec Koppel:
Efficient Large-Scale Gaussian Process Bandits by Believing only Informative Actions. L4DC 2020: 924-934 - [c17]Junyu Zhang, Alec Koppel, Amrit Singh Bedi, Csaba Szepesvári, Mengdi Wang:
Variational Policy Gradient Method for Reinforcement Learning with General Utilities. NeurIPS 2020 - [i12]Junyu Zhang, Amrit Singh Bedi, Mengdi Wang, Alec Koppel:
Cautious Reinforcement Learning via Distributional Risk in the Dual Domain. CoRR abs/2002.12475 (2020) - [i11]Yulun Tian, Alec Koppel, Amrit Singh Bedi, Jonathan P. How:
Asynchronous and Parallel Distributed Pose Graph Optimization. CoRR abs/2003.03281 (2020) - [i10]Amrit Singh Bedi, Dheeraj Peddireddy, Vaneet Aggarwal, Alec Koppel:
Efficient Gaussian Process Bandits by Believing only Informative Actions. CoRR abs/2003.10550 (2020) - [i9]Junyu Zhang, Alec Koppel, Amrit Singh Bedi, Csaba Szepesvári, Mengdi Wang:
Variational Policy Gradient Method for Reinforcement Learning with General Utilities. CoRR abs/2007.02151 (2020) - [i8]Zeeshan Akhtar, Amrit Singh Bedi, Ketan Rajawat:
Conservative Stochastic Optimization with Expectation Constraints. CoRR abs/2008.05758 (2020)
2010 – 2019
- 2019
- [j7]Amrit Singh Bedi, Alec Koppel, Ketan Rajawat:
Asynchronous Online Learning in Multi-Agent Systems With Proximity Constraints. IEEE Trans. Signal Inf. Process. over Networks 5(3): 479-494 (2019) - [j6]Rishabh Dixit, Amrit Singh Bedi, Ruchi Tripathi, Ketan Rajawat:
Online Learning With Inexact Proximal Online Gradient Descent Algorithms. IEEE Trans. Signal Process. 67(5): 1338-1352 (2019) - [j5]Amrit Singh Bedi, Alec Koppel, Ketan Rajawat:
Asynchronous Saddle Point Algorithm for Stochastic Optimization in Heterogeneous Networks. IEEE Trans. Signal Process. 67(7): 1742-1757 (2019) - [c16]Amrit Singh Bedi, Alec Koppel, Brian M. Sadler, Víctor Elvira:
Compressed Streaming Importance Sampling for Efficient Representations of Localization Distributions. ACSSC 2019: 477-481 - [c15]Alec Koppel, Amrit S. Bedi, Ketan Rajawat:
Controlling the Bias-Variance Tradeoff via Coherent Risk for Robust Learning with Kernels. ACC 2019: 3519-3525 - [c14]Rishabh Dixit, Amrit Singh Bedi, Ketan Rajawat, Alec Koppel:
Distributed Online Learning over Time-varying Graphs via Proximal Gradient Descent. CDC 2019: 2745-2751 - [i7]Rishabh Dixit, Amrit Singh Bedi, Ketan Rajawat:
Online Learning over Dynamic Graphs via Distributed Proximal Gradient Algorithm. CoRR abs/1905.07018 (2019) - [i6]Anis Elgabli, Jihong Park, Amrit S. Bedi, Mehdi Bennis, Vaneet Aggarwal:
GADMM: Fast and Communication Efficient Framework for Distributed Machine Learning. CoRR abs/1909.00047 (2019) - [i5]Amrit Singh Bedi, Alec Koppel, Ketan Rajawat, Brian M. Sadler:
Nonstationary Nonparametric Online Learning: Balancing Dynamic Regret and Model Parsimony. CoRR abs/1909.05442 (2019) - [i4]Alec Koppel, Amrit Singh Bedi, Victor Elvira, Brian M. Sadler:
Approximate Shannon Sampling in Importance Sampling: Nearly Consistent Finite Particle Estimates. CoRR abs/1909.10279 (2019) - [i3]Alec Koppel, Amrit Singh Bedi, Ketan Rajawat, Brian M. Sadler:
Optimally Compressed Nonparametric Online Learning. CoRR abs/1909.11555 (2019) - [i2]Anis Elgabli, Jihong Park, Amrit S. Bedi, Mehdi Bennis, Vaneet Aggarwal:
Q-GADMM: Quantized Group ADMM for Communication Efficient Decentralized Machine Learning. CoRR abs/1910.10453 (2019) - 2018
- [j4]Amrit S. Bedi, Paban Sarma, Ketan Rajawat:
Tracking Moving Agents via Inexact Online Gradient Descent Algorithm. IEEE J. Sel. Top. Signal Process. 12(1): 202-217 (2018) - [j3]Amrit Singh Bedi, Ketan Rajawat:
Network Resource Allocation via Stochastic Subgradient Descent: Convergence Rate. IEEE Trans. Commun. 66(5): 2107-2121 (2018) - [j2]Amrit S. Bedi, Ketan Rajawat:
Asynchronous Incremental Stochastic Dual Descent Algorithm for Network Resource Allocation. IEEE Trans. Signal Process. 66(9): 2229-2244 (2018) - [c13]Anant Chopra, Deepak S. Kalhan, Amrit S. Bedi, Abhishek K. Gupta, Ketan Rajawat:
On Socially Optimal Traffic Flow in the Presence of Random Users. ANTS 2018: 1-6 - [c12]Rishabh Dixit, Amrit Singh Bedi, Ruchi Tripathi, Ketan Rajawat:
Time Varying optimization via Inexact Proximal Online Gradient Descent. ACSSC 2018: 759-763 - [c11]Amrit Singh Bedi, Alec Koppel, Ketan Rajawat:
Asynchronous Saddle Point Method: Interference Management Through Pricing. CDC 2018: 3229-3235 - [c10]Hrusikesha Pradhan, Amrit Singh Bedi, Alec Koppel, Ketan Rajawat:
Exact Nonparametric Decentralized Online Optimization. GlobalSIP 2018: 643-647 - [c9]Amrit Singh Bedi, Paban Sarma, Ketan Rajawat:
Adversarial Multi-Agent Target Tracking with Inexact Online Gradient Descent. ICASSP 2018: 2881-2885 - [c8]Amrit S. Bedi, Ketan Rajawat, Marceau Coupechoux:
An Online Approach to D2D Trajectory Utility Maximization Problem. INFOCOM 2018: 1610-1618 - [c7]Amrit Singh Bedi, Hrusikesha Pradhan, Ketan Rajawat:
Decentralized Asynchronous Stochastic Gradient Descent: Convergence Rate Analysis. SPCOM 2018: 402-406 - [c6]Amrit Singh Bedi, Ketan Rajawat:
Wireless network optimization via stochastic sub-gradient descent: Rate analysis. WCNC 2018: 1-6 - [i1]Anant Chopra, Deepak S. Kalhan, Amrit S. Bedi, Abhishek K. Gupta, Ketan Rajawat:
On Socially Optimal Traffic Flow in the Presence of Random Users. CoRR abs/1810.07934 (2018) - 2017
- [c5]Amrit Singh Bedi, Alec Koppel, Ketan Rajawat:
Beyond consensus and synchrony in decentralized online optimization using saddle point method. ACSSC 2017: 293-297 - [c4]Amrit S. Bedi, Ketan Rajawat:
Asynchronous resource allocation in distributed heterogeneous networks. ICC 2017: 1-7 - [c3]Amrit S. Bedi, Md. Waseem Ahmad, Ketan Rajawat, Sandeep Anand:
Optimal utilization of storage systems under real-time pricing. ICC Workshops 2017: 1141-1146 - 2016
- [j1]Amrit S. Bedi, Javed Akhtar, Ketan Rajawat, Aditya K. Jagannatham:
BER-Optimized Precoders for OFDM Systems With Insufficient Cyclic Prefix. IEEE Commun. Lett. 20(2): 280-283 (2016) - [c2]Javed Akhtar, Amrit S. Bedi, Ketan Rajawat, Aditya K. Jagannatham:
BER-Optimized Robust Precoder Design for MIMO-OFDM Systems with Insufficient CP. GLOBECOM 2016: 1-6 - [c1]Amrit S. Bedi, Ketan Rajawat:
Online load scheduling under price and demand uncertainty in smart grid. SPCOM 2016: 1-5
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-22 21:27 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint