601.773: Machine Social Intelligence

Instructors

Tianmin Shu (tianmin.shu@jhu.edu)

Chuanyang Jin (Head TA) & Tianai Yue
(Teaching Assistants)

Classes: on Tuesday/Thursday 1:30 - 2:45 pm (Hudson 315)
Office hours:
Tianmin Shu: Friday 11 am - 12 pm Malone 213
Chuanyang Jin: Monday 1- 2 pm Malone 216
Tianai Yue: Tuesday 11 am - 12 pm Malone 216
Class structure: The class will be in-person. Each session will primarily involve small-group discussions of recent or highly impactful papers on social intelligence in humans and machines. Students will also complete a course project in a group of 2-3 students.
Canvas: We use Canvas for discussion, announcement and grading.
Coursework:
- Readings (~2 main papers + optional papers) for each class period.
- A short written response to the main readings is to be submitted via this google forms by 12 pm on the day of the class. You should write at least 3 key discussion points of the papers. These points can include key takeaways of the papers, limitations and future works, or even additioal paper recommendations.
- Presenter: Starting from week 3, 2 - 3 students will present the main papers. Each presenter should give a 10-min presentation with two parts, a short (5-min) summary and a short (5-min) disucssion starter (5-min). For the discussion starter, you could put your reviewer hat on to critique the paper, being a visionary to propose an imaginary follow-up research project or a new application, or an empiricist to situate the paper in real-world contexts. Each group must send their slides to the instructor and CA 48 hours before the class for feedback.
- Group discussions: After the paper presentation, we will have breakout groups of ~5 students to discuss the papers. Group assignment will be random, and there will be a lead discussant. Everyone will be the lead discussant in their discussion groups at least two times over the semester. The lead discussants should have read all responses. They will lead and organize the discussion in their group and synthesize for reporting back to the whole class. One of the main goals of this course is to help you learn to think about and discuss research ideas in a cooperative setting. So your active participation in grouop discussions will be a crucial part of the course.
- A final project (in a group of 1 - 3 students). You will submit a project proposal, share midway progress, present your final project at the end of the term, and write a project report.
Tinker credits: Each student will receive $250 credit for Tinker, which can be used for LLM evaluation and finetuning. We will release a tutorial and share accounts soon. Thanks to Thinking Machine for the sponsorship.
Grading:
- Reading responses before the class (10%)
- Class presentation (25%)
- Group discussions (15%)
- Course Group Project (50%)
Attendance policy: This is a seminar course focusing on in-person discussion. Students are expected to attend class and may notify instructors if there are extenuating circumstances.
Late submission policy: Late submissions will incur a penalty of 20% per day. Submissions will not be accepted if received more than three days after the deadline.

Acknowledgements Website template from Prof. Anjalie Field, Prof. Daniel Khashabi, and Prof. Ziang Xiao.

Date	Topic	Readings	Work Due
Jan 20	Introduction		No Required Reading
Jan 22	Background: decision making		No Required Reading
Jan 27	Background: decision making		No Required Reading
Jan 29	Background: inverse decision making		No Required Reading
Feb 3	Emergent social intelligence via MARL	Main: Reward is Enough Human-level performance in 3D multiplayer games with population-based reinforcement learning Suggested: Emergent Tool Use From Multi-Agent Autocurricula “Other-Play” for Zero-Shot Coordination	Reading Responses by 12 pm
Feb 5	Emergent social intelligence via LLMs	Main: Generative Agents: Interactive Simulacra of Human Behavior SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents Suggested: SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs	Reading Responses by 12 pm
Feb 10	The need for a human model	Main: On the Utility of Learning about Humans for Human-AI Coordination Human-level play in the game of Diplomacy by combining language models with strategic reasoning Suggested: Learning to Influence Human Behavior with Offline Reinforcement Learning Learning to Cooperate with Humans using Generative Agents	Reading Responses by 12 pm
Feb 12	How can social cognition help?	Main: Socially intelligent machines that learn from humans and help humans learn Building Machines that Learn and Think with People Socially intelligent robots: dimensions of human–robot interaction (Section 1-3)	Reading Responses by 12 pm
Feb 17	Evaluating Theory of Mind in humans and machines	Main: Baby Intuitions Benchmark (BIB): Discerning the goals, preferences, and actions of others Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models Suggested: Understanding Social Reasoning in Language Models with Language Models Teleological reasoning in infancy: the naı̈ve theory of rational action AGENT: A Benchmark for Core Psychological Reasoning	Reading Responses by 12 pm
Feb 19	Cognitive modeling for Theory of Mind	Main: Theory of mind and inverse decision-making (Chapter 14 of Bayesian Models of Cognition: Reverse Engineering the Mind) Planning with Theory of Mind Suggested: Action understanding as inverse planning Rational quantitative attribution of beliefs, desires and percepts in human mentalizing Computational Models of Emotion Inference in Theory of Mind: A Review and Roadmap Emotion prediction as computation over a generative theory of mind Intervening on Emotions by Planning Over a Theory of Mind Human-like Affective Cognition in Foundation Models	Reading Responses by 12pm
Feb 24	Pragmatic reasoning	Main: Pragmatics in Language Grounding: Phenomena, Tasks, and Modeling Approaches Pragmatic Language Interpretation as Probabilistic Inference Suggested: Learning to refer informatively by amortizing pragmatic reasoning Reasoning about Pragmatics with Neural Listeners and Speakers A fine-grained comparison of pragmatic language understanding in humans and language models	Reading Responses by 12 pm
Feb 26	Instruction following	Main: HandMeThat: Human-Robot Communication in Physical and Social Environments Pragmatic Instruction Following and Goal Assistance via Cooperative Language-Guided Inverse Planning Suggested: Situated Instruction Following Learning to communicate about shared procedural abstractions	Reading Responses by 12 pm
Mar 3	Multi-agent planning and Theory of Minds	Main: Too many cooks: Bayesian inference for coordinating multi-agent collaboration Theory of Minds: Understanding Behavior in Groups Through Inverse Planning Suggested: Coordinate to cooperate or compete: Abstract goals and joint intentions in social interaction Goal Inference Improves Objective and Perceived Performance in Human-Robot Collaboration	Reading Responses by 12pm
Mar 5	Proactive assistance	Main: AvE: Assistance via Empowerment COOPERA: Continual Open-Ended Human-Robot Assistance Suggested: The Lumiere Project: Bayesian User Modeling for Inferring the Goals and Needs of Software Users NOPA: Neurally-guided Online Probabilistic Assistance for Building Socially Intelligent Home Assistants Proactive Robot Assistance via Spatio-Temporal Object Modeling	Reading Responses by 12pm; Project Proposal by Mar 8th, 11:59 pm
Mar 10	Understanding suboptimal behavior	Main: Online Bayesian Goal Inference for Boundedly-Rational Planning Agents Where Do You Think You're Going?: Inferring Beliefs about Dynamics from Behavior Suggested: Modeling the Mistakes of Boundedly Rational Agents Within a Bayesian Theory of Mind Explainable Procedural Mistake Detection	Reading Responses by 12pm
Mar 12	Cogntivie models meet foundation models	Main: Discovering Symbolic Cognitive Models from Human and Animal Behavior AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling Suggested: Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models Towards Automation of Cognitive Modeling using Large Language Models Human-like Few-Shot Learning via Bayesian Reasoning over Natural Language From Word Models to World Models: Translating from Natural Language to the Probabilistic Language of Thought (Section 3.3)	Reading Responses by 12pm
Mar 17	Spring Break
Mar 19	Spring Break
Mar 24	Nonverbal communication	Main: Planning for Autonomous Cars that Leverage Effects on Human Actions Gesture-Informed Robot Assistance via Foundation Models Suggested: The eyes have it: the neuroethology, function and evolution of social gaze Legibility and predictability of robot motion Emergence of Grounded Compositional Language in Multi-Agent Populations Social Eye Gaze in Human-Robot Interaction: A Review	Reading Responses by 12pm
Mar 26	Cooperative verbal communication	Main: RoCo: Dialectic Multi-Robot Collaboration with Large Language Models Cooperative Explanation as Rational Communication Suggested: GOMA: Proactive Embodied Cooperative Communication via Goal-Oriented Mental Alignment Building cooperative embodied agents modularly with large language models Towards Collaborative Plan Acquisition through Theory of Mind Modeling in Situated Dialogue Mutual Theory of Mind	Reading Responses by 12pm; midway progress report by Mar 30th, 11:59 pm
Mar 31	Reinforcement learning from human feedback	Main: Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback Causal Confusion and Reward Misidentification in Preference-Based Reward Learning Suggested: The PRISM Alignment Dataset: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models Human-centric dialog training via offline reinforcement learning	Reading Responses by 12pm
Apr 2	Human social learning	Main: Inferential social learning: cognitive foundations of human social learning and teaching Natural pedagogy Suggested: Adaptive Social Learning using Theory of Mind	Reading Responses by 12pm
Apr 7	Machine social learning	Main: Cooperative Inverse Reinforcement Learning Language and Experience: A Computational Model of Social Learning in Complex Tasks Suggested: Yell At Your Robot: Improving On-the-Fly from Language Corrections How to talk so AI will learn: Instructions, descriptions, and autonomy Pragmatic Feature Preferences: Learning Reward-Relevant Preferences from Human Input Vocal Sandbox: Continual Learning and Adaptation for Situated Human-Robot Collaboration Understanding Teacher Gaze Patterns for Robot Learning Pragmatic-Pedagogic Value Alignment Learning Robot Objectives from Physical Human Interaction On Using Social Signals to Enable Flexible Error-Aware HRI	Reading Responses by 12pm
Apr 9	Collaborative multi-agent problem solving	Main: CooperBench: Why Coding Agents Cannot be Your Teammates Yet Language Model Teams as Distributed Systems Suggested: Improving Factuality and Reasoning in Language Models through Multiagent Debate Talk Isn't Always Cheap: Understanding Failure Modes in Multi-Agent Debate τ2-Bench: Evaluating Conversational Agents in a Dual-Control Environment The Virtual Lab of AI agents designs new SARS-CoV-2 nanobodies	Reading Responses by 12pm
Apr 14	Cultural learning/transmission	Main: Cultural Learning (Section 1-2) Learning few-shot imitation as cultural transmission Suggested: Evolving general cooperation with a Bayesian theory of mind	Reading Responses by 12pm
Apr 16	Moral decison making	Main: Computational ethics When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment Suggested: Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties The logic of universalization guides moral judgment	Reading Responses by 12pm
Apr 21	Project presentation
Apr 23	Project presentation		Final report due by May 3rd, 11:59 pm

Date

Topic

Readings

Work Due

Jan 20

Introduction

No Required Reading

Jan 22

Background: decision making

No Required Reading

Jan 27

Background: decision making

No Required Reading

Jan 29

Background: inverse decision making

No Required Reading

Feb 3

Emergent social intelligence via MARL

Main:

Suggested:

Reading Responses by 12 pm

Feb 5

Emergent social intelligence via LLMs

Main:

Suggested:

Reading Responses by 12 pm

Feb 10

The need for a human model

Main:

Suggested:

Reading Responses by 12 pm

Feb 12

How can social cognition help?

Main:

Reading Responses by 12 pm

Feb 17

Evaluating Theory of Mind in humans and machines

Main:

Suggested:

Reading Responses by 12 pm

Feb 19

Cognitive modeling for Theory of Mind

Main:

Suggested:

Reading Responses by 12pm

Feb 24

Pragmatic reasoning

Main:

Suggested:

Reading Responses by 12 pm

Feb 26

Instruction following

Main:

Suggested:

Reading Responses by 12 pm

Mar 3

Multi-agent planning and Theory of Minds

Main:

Suggested:

Reading Responses by 12pm

Mar 5

Proactive assistance

Main:

Suggested:

Reading Responses by 12pm; Project Proposal by Mar 8th, 11:59 pm

Mar 10

Understanding suboptimal behavior

Main:

Suggested:

Reading Responses by 12pm

Mar 12

Cogntivie models meet foundation models

Main:

Suggested:

Reading Responses by 12pm

Mar 17

Spring Break

Mar 19

Spring Break

Mar 24

Nonverbal communication

Main:

Suggested:

Reading Responses by 12pm

Mar 26

Cooperative verbal communication

Main:

Suggested:

Reading Responses by 12pm; midway progress report by Mar 30th, 11:59 pm

Mar 31

Reinforcement learning from human feedback

Main:

Suggested:

Reading Responses by 12pm

Apr 2

Human social learning

Main:

Suggested:

Adaptive Social Learning using Theory of Mind

Reading Responses by 12pm

Apr 7

Machine social learning

Main:

Suggested:

Reading Responses by 12pm

Apr 9

Collaborative multi-agent problem solving

Main:

Suggested:

Reading Responses by 12pm

Apr 14

Cultural learning/transmission

Main:

Suggested:

Evolving general cooperation with a Bayesian theory of mind

Reading Responses by 12pm

Apr 16

Moral decison making

Main:

Suggested:

Reading Responses by 12pm

Apr 21

Project presentation

Apr 23

Project presentation

Final report due by May 3rd, 11:59 pm

Policies

Attendance policy This is a graduate-level course revolving around in-person discussion. Students are expected to attend class and may notify instructors if there are extenuating circumstances.

Course Conduct This is a discussion class focused on cutting-edge research. All students are expected to respect everyone's perspective and input and to contribute towards creating a welcoming and inclusive climate. We the instructors will strive to make this classroom an inclusive space for all students, and we welcome feedback on ways to improve.

Academic Integrity This course will have a zero-tolerance philosophy regarding plagiarism or other forms of cheating, and incidents of academic dishonesty will be reported. A student who has doubts about how the Honor Code applies to this course should obtain specific guidance from the course instructor before submitting the respective assignment.

AI Use Policy All written reponses and presentations must be prepared by the students without the help of AI. It is okay to use AI in the projects (for coding, model development and evaluation, and report editing). However, the students cannot use the AI to directly produce the project proposal, presentations, and the final reports.

Discrimination and Harrasment The Johns Hopkins University is committed to equal opportunity for its faculty, staff, and students. To that end, the university does not discriminate on the basis of sex, gender, marital status, pregnancy, race, color, ethnicity, national origin, age, disability, religion, sexual orientation, gender identity or expression, veteran status, military status, immigration status or other legally protected characteristic. The University's Discrimination and Harassment Policy and Procedures provides information on how to report or file a complaint of discrimination or harassment based on any of the protected statuses listed in the earlier sentence, and the University’s prompt and equitable response to such complaints.

Personal Well-being Take care of yourself! Being a student can be challenging and your physical and mental health is important. If you need support, please seek it out. Here are several of the many helpful resources on campus:

EN.601.773: Machine Social Intelligence

Johns Hopkins University - Spring 2026

Instructors

Schedule

Policies