research-article

Bayes–Nash: Bayesian inference for Nash equilibrium selection in human-robot parallel play

Published:01 January 2022Publication History
Skip Abstract Section

Abstract

Abstract

We consider shared workspace scenarios with humans and robots acting to achieve independent goals, termed as parallel play. We model these as general-sum games and construct a framework that utilizes the Nash equilibrium solution concept to consider the interactive effect of both agents while planning. We find multiple Pareto-optimal equilibria in these tasks. We hypothesize that people act by choosing an equilibrium based on social norms and their personalities. To enable coordination, we infer the equilibrium online using a probabilistic model that includes these two factors and use it to select the robot’s action. We apply our approach to a close-proximity pick-and-place task involving a robot and a simulated human with three potential behaviors—defensive, selfish, and norm-following. We showed that using a Bayesian approach to infer the equilibrium enables the robot to complete the task with less than half the number of collisions while also reducing the task execution time as compared to the best baseline. We also performed a study with human participants interacting either with other humans or with different robot agents and observed that our proposed approach performs similar to human-human parallel play interactions.

References

  1. Bansal, S., Cosgun, A., Nakhaei, A., & Fujimura, K. (2018). Collaborative planning for mixed-autonomy lane merging. In 2018 IEEE/RSJ international conference on intelligent robots and systems (IROS).Google ScholarGoogle Scholar
  2. Barto AGSutton RSAnderson CWNeuronlike adaptive elements that can solve difficult learning control problemsIEEE Transactions on Systems, Man, and Cybernetics1983583484610.1109/TSMC.1983.6313077Google ScholarGoogle ScholarCross RefCross Ref
  3. Brockman, G., Cheung, V., Pettersson, L., Schneider, J., Schulman, J., Tang, J., & Zaremba, W. (2016). Openai gym. arXiv preprint arXiv:1606.01540.Google ScholarGoogle Scholar
  4. Carroll, M., Shah, R., Ho, MK., Griffiths, T., Seshia, S., Abbeel, P., & Dragan, A. (2019). On the utility of learning about humans for human-ai coordination. In Advances in Neural Information Processing Systems (pp 5175–5186).Google ScholarGoogle Scholar
  5. Chen, M., Nikolaidis, S., Soh, H., Hsu, D., & Srinivasa, S. (2018). Planning with trust for human-robot collaboration. In Proceedings of the 2018 ACM/IEEE international conference on human-robot interaction (pp. 307–315).Google ScholarGoogle Scholar
  6. Diankov, R. (2010). Automated construction of robotic manipulation programs. PhD thesis, Carnegie Mellon University, Robotics Institute.Google ScholarGoogle Scholar
  7. Engel, D., Woolley, A. W., Jing, L. X., Chabris, C. F., & Malone, T. W. (2014). Reading the mind in the eyes or reading between the lines? Theory of mind predicts collective intelligence equally well online and face-to-face. PloS One,9(12)Google ScholarGoogle Scholar
  8. Fisac, J. F., Bronstein, E., Stefansson, E., Sadigh, D., Sastry, S. S., & Dragan, A. D. (2019). Hierarchical game-theoretic planning for autonomous vehicles. In 2019 International conference on robotics and automation (ICRA) (pp 9590–9596). IEEE.Google ScholarGoogle Scholar
  9. Gabler, V., Stahl, T., Huber, G., Oguz, O., & Wollherr, D. (2017). A game-theoretic approach for adaptive action selection in close proximity human-robot-collaboration. In 2017 IEEE international conference on robotics and automation (ICRA).Google ScholarGoogle Scholar
  10. Gombolay MCGutierrez RAClarke SGSturla GFShah JADecision-making authority, team efficiency and human worker satisfaction in mixed human-robot teamsAutonomous Robots201539329331210.1007/s10514-015-9457-9Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Hawkins, K. P., Bansal, S., Vo, N. N., & Bobick, A. F. (2014). Anticipating human actions for collaboration in the presence of task and sensor uncertainty. In 2014 ieee international conference on Robotics and automation (ICRA).Google ScholarGoogle Scholar
  12. Ho, M. K., MacGlashan, J., Greenwald, A., Littman, M. L., Hilliard, E., Trimbach, C., Brawner, S., Tenenbaum, J., Kleiman-Weiner, M., & Austerweil, J. L. (2016). Feature-based joint planning and norm learning in collaborative games. In CogSciGoogle ScholarGoogle Scholar
  13. Hoffman GEvaluating fluency in human-robot collaborationIEEE Transactions on Human-Machine Systems201949320921810.1109/THMS.2019.2904558Google ScholarGoogle ScholarCross RefCross Ref
  14. Koppula HSSaxena AAnticipating human activities using object affordances for reactive robotic responseIEEE Transactions on Pattern Analysis and Machine Intelligence2015381142910.1109/TPAMI.2015.2430335Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Lavalle, S. M. (1998). Rapidly-exploring random trees: a new tool for path planning. Tech. rep.Google ScholarGoogle Scholar
  16. Leyton-Brown KShoham YEssentials of game theory: A concise multidisciplinary introductionSynthesis Lectures on Artificial Intelligence and Machine Learning20082118810.2200/S00108ED1V01Y200802AIM003Google ScholarGoogle ScholarCross RefCross Ref
  17. Li, S., Shah, J. A. (2019). Safe and efficient high dimensional motion planning in space-time with time parameterized prediction. In 2019 international conference on robotics and automation (ICRA).Google ScholarGoogle Scholar
  18. Mailath GJDo people play nash equilibrium? Lessons from evolutionary game theoryJournal of Economic Literature199836313471374Google ScholarGoogle Scholar
  19. Mainprice JHayne RBerenson DGoal set inverse optimal control and iterative replanning for predicting human reaching motions in shared workspacesIEEE Transactions on Robotics201632489790810.1109/TRO.2016.2581216Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Nikolaidis, S., Kuznetsov, A., Hsu, D., & Srinivasa, S. (2016). Formalizing human-robot mutual adaptation: A bounded memory model. In 2016 11th ACM/IEEE international conference on human-robot interaction (HRI) (pp. 75–82). IEEE.Google ScholarGoogle Scholar
  21. Nikolaidis, S., Nath, S., Procaccia, A. D., & Srinivasa, S. (2017). Game-theoretic modeling of human adaptation in human-robot collaboration. In Proceedings of the 2017 ACM/IEEE international conference on human-robot interaction (pp. 323–331).Google ScholarGoogle Scholar
  22. Nikolaidis, S., Ramakrishnan, R., Gu, K., & Shah, J. (2015). Efficient model learning from joint-action demonstrations for human-robot collaborative tasks. In ACM/IEEE international conference on human-robot interaction.Google ScholarGoogle Scholar
  23. Park, H. W., & Howard, A. M. (2010). Understanding a child’s play for robot interaction by sequencing play primitives using hidden markov models. In 2010 IEEE international conference on robotics and automation (pp. 170–177).Google ScholarGoogle Scholar
  24. Parten MBSocial participation among pre-school childrenThe Journal of Abnormal and Social Psychology193227324310.1037/h0074524Google ScholarGoogle ScholarCross RefCross Ref
  25. Peters, L., Fridovich-Keil, D., Tomlin, C., & Sunberg, Z. (2020). Inference-based strategy alignment for general-sum differential games. In AAMAS ’20, international foundation for autonomous agents and multiagent systems. https://github.com/lassepe/AAMAS2020-GameInference-Paper/blob/master/submission/ibsa-camera-ready-aamas2020.pdf.Google ScholarGoogle Scholar
  26. Premack DWoodruff GDoes the chimpanzee have a theory of mind?Behavioral and Brain Sciences19781451552610.1017/S0140525X00076512Google ScholarGoogle ScholarCross RefCross Ref
  27. Sadigh, D., Sastry, S., Seshia, S. A., & Dragan, A. D. (2016a). Planning for autonomous cars that leverage effects on human actions. In Robotics: Science and systems.Google ScholarGoogle Scholar
  28. Sadigh, D., Sastry, S. S., Seshia, S. A., & Dragan, A. (2016b). Information gathering actions over human internal state. In 2016 IEEE/RSJ international conference on intelligent robots and systems (IROS) (pp. 66–73). IEEE.Google ScholarGoogle Scholar
  29. Schwarting WPierson AAlonso-Mora JKaraman SRus DSocial behavior for autonomous vehiclesProceedings of the National Academy of Sciences2019116502497224978404734910.1073/pnas.1820676116Google ScholarGoogle Scholar
  30. Silver, D., Huang, A., Maddison, C. J., Guez, A., Sifre, L., Van Den Driessche, G., Schrittwieser, J., Antonoglou, I., Panneershelvam, V., Lanctot, M., Dieleman, S., Grewe, D., Nham, J., Kalchbrenner, N., Sutskever, I., Lillicrap, T., Leach, M., Kavukcuoglu, K., Graepel T., & Hassabis, D. (2016). Mastering the game of go with deep neural networks and tree search. Nature, 529.(7587):484Google ScholarGoogle Scholar
  31. Spica RCristofalo EWang ZMontijano ESchwager MA real-time game theoretic planner for autonomous two-player drone racingIEEE Transactions on Robotics20203651389140310.1109/TRO.2020.2994881Google ScholarGoogle Scholar
  32. Sucan, I. A., Moll, M., & Kavraki, L. E. (2012). The open motion planning library. IEEE Robotics & Automation Magazine. DOI: https://doi.org/10.1109/MRA.2012.2205651.Google ScholarGoogle Scholar
  33. Tesauro GTemporal difference learning and td-gammonCommunications of the ACM1995383586810.1145/203330.203343Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Trautman, P., & Krause, A. (2010). Unfreezing the robot: Navigation in dense, interacting crowds. In 2010 IEEE/RSJ international conference on intelligent robots and systems (pp. 797–803). IEEE.Google ScholarGoogle Scholar
  35. Turnwald AWollherr DHuman-like motion planning based on game theoretic decision makingInternational Journal of Social Robotics201911115117010.1007/s12369-018-0487-2Google ScholarGoogle ScholarCross RefCross Ref
  36. Unhelkar, V. V., Siu, H. C., Shah, J. A. (2014). Comparative performance of human and mobile robotic assistants in collaborative fetch-and-deliver tasks. In ACM/IEEE international conference on human-robot interaction (HRI).Google ScholarGoogle Scholar
  37. Ziebart, B. D., Ratliff, N., Gallagher, G., Mertz, C., Peterson, K., Bagnell, J. A., Hebert, M., Dey, A. K., & Srinivasa, S. (2009). Planning-based prediction for pedestrians. In 2009 IEEE/RSJ international conference on intelligent robots and systems (pp. 3931–3936). IEEE.Google ScholarGoogle Scholar

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in

Full Access

About Cookies On This Site

We use cookies to ensure that we give you the best experience on our website.

Learn more

Got it!