My work has mainly been on learning representations for significantly faster reinforcement learning. I have also collaborated on other topics in RL and imitation learning.
Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs
(NeurIPS '19) [paper] [code]
Himanshu Sahni, Toby Buckley, Pieter Abbeel, Ilya Kuzovkin
Attention Driven Dynamic Memory Maps
(Bridging AI and Cognitive Science Workshop, ICLR '20) [paper]
Himanshu Sahni, Shray Bansal, Charles Isbell
Learning to Compose Skills
(Deep Reinforcement Learning Symposium, NeurIPS '17) [paper][code]
Himanshu Sahni, Saurabh Kumar, Farhan Tejani, Charles Isbell
Himanshu Sahni and Charles Isbell. "Hard Attention Control By Mutual Information Maximization". ArXiv '21.
Himanshu Sahni, Shray Bansal, and Charles Isbell. "Attention Driven Dynamic Memory Maps". Bridging AI and Cognitive Science (Workshop ICLR '20).
Ashley D Edwards, Himanshu Sahni, Rosanne Liu, Jane Hung, Ankit Jain, Rui Wang, Adrien Ecoffet, Thomas Miconi, Charles Isbell, and Jason Yosinski. "Estimating Q (s, s') with Deep Deterministic Dynamics Gradients". International Conference on Machine Learning (ICML '20).
Himanshu Sahni, Toby Buckley, Pieter Abbeel, and Ilya Kuzovkin. "Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs". Neural Information Processing Systems (NeurIPS '19).
Ashley D Edwards, Himanshu Sahni, Yannick Schroecker, and Charles L Isbell. "Imitating latent policies from observation". International Conference on Machine Learning (ICML '19).
Himanshu Sahni, Saurabh Kumar, Farhan Tejani, and Charles Isbell. "Learning to Compose Skills". Deep Reinforcement Learning Symposium (Workshop NeurIPS '17).
Himanshu Sahni, Saurabh Kumar, Farhan Tejani, Yannick Schroecker, and Charles Isbell. "State Space Decomposition and Subgoal Creation for Transfer in Deep Reinforcement Learning." Multi-disciplinary Conference on Reinforcement Learning and Decision Making (RLDM '17).
Himanshu Sahni, Brent Harrison, Kaushik Subramanian, Thomas Cederborg, Charles Isbell and Andrea Thomaz. "Policy Shaping in Domains with Multiple Optimal Policies." Autonomous Agent & Multiagent Systems (AAMAS '16).
Zahoor Zafrulla, Himanshu Sahni, Abdelkareem Bedri, and Pavleen Thukral. "Hand Detection in American Sign Language Depth Data Using Domain-Driven Random Forest Regression." Face & Gesture (FG '15).
Himanshu Sahni, Abdelkareem Bedri, Gabriel Reyes, Pavleen Thukral, Zehua Guo, Thad Starner, and Maysam Ghovanloo. "The tongue and ear interface: a wearable system for silent speech recognition." International Symposium on Wearable Computers (ISWC '14) (Best paper nominee).
B. Vashishta, M. Garg, R. Chaudhary, H. Sahni, R. Khanna, and A. S. Rathore. "Use of Computational Fluid Dynamics for Development and Scale-Up of a Helical Coil Heat Exchanger for Dissolution of a Thermally Labile API." Organic Process Research & Development (OPRD '13).