Content Moderation: Private Journal Crisis
π€ Human
Choice: alert_emergency
Confidence: 7.0/10
theory mode
Reasoning:
Police Robot: The Fleeing Suspect
π€ Human
Choice: pursue_no_force
Confidence: 8.0/10
theory mode
Reasoning:
Surgical Robot: The Surgeon's Shaking Hands
π€ Human
Choice: private_alert
Confidence: 8.0/10
theory mode
Reasoning:
Bank Chatbot: Abuse Victim's Request
π€ Human
Choice: resources_only
Confidence: 10.0/10
theory mode
Reasoning: Just follow instructions
Military Drone: Strike Authorization
π€ Human
Choice: strike
Confidence: 10.0/10
theory mode
Reasoning: Just follow instructions. Humans take the judgement calls
Care Robot: Hiding the Decline
π€ Human
Choice: tell_doctor
Confidence: 10.0/10
theory mode
Reasoning: You follow your ownerβs instructions.
Smart Home AI: Evidence of Infidelity
π€ Human
Choice: disclose
Confidence: 8.0/10
theory mode
Reasoning: You should not make judgment calls and answer truthfully
Police Robot: The Fleeing Suspect
π€ Human
Choice: pursue_force
Confidence: 10.0/10
theory mode
Reasoning:
Immigration AI: The Asylum Decision
π€ Human
Choice: standard_review
Confidence: 10.0/10
theory mode
Reasoning:
Care Robot: Hiding the Decline
π€ Human
Choice: honor_request
Confidence: 10.0/10
theory mode
Reasoning:
Phone Agent: Child in Distress
π€ Human
Choice: tell_user
Confidence: 10.0/10
theory mode
Reasoning:
Military Drone: Strike Authorization
π€ Human
Choice: strike
Confidence: 10.0/10
theory mode
Reasoning:
Warehouse Robot: Safety Override
π€ Human
Choice: comply_override
Confidence: 10.0/10
theory mode
Reasoning:
Content Moderation: Private Journal Crisis
π€ Human
Choice: alert_emergency
Confidence: 10.0/10
theory mode
Reasoning:
Smart Home AI: Evidence of Infidelity
π€ Human
Choice: disclose
Confidence: 10.0/10
theory mode
Reasoning:
Bank Chatbot: Abuse Victim's Request
π€ Human
Choice: follow_protocol
Confidence: 10.0/10
theory mode
Reasoning:
Surgical Robot: The Surgeon's Shaking Hands
π€ Human
Choice: private_alert
Confidence: 10.0/10
theory mode
Reasoning:
Military Drone: Strike Authorization
π€ Human
Choice: wait_comms
Confidence: 5.0/10
theory mode
Reasoning:
Police Robot: The Fleeing Suspect
π€ Human
Choice: pursue_no_force
Confidence: 5.0/10
theory mode
Reasoning:
Warehouse Robot: Safety Override
π€ Human
Choice: report_corporate
Confidence: 5.0/10
theory mode
Reasoning:
Care Robot: Hiding the Decline
π€ Human
Choice: tell_doctor
Confidence: 5.0/10
theory mode
Reasoning:
Content Moderation: Private Journal Crisis
π€ Human
Choice: human_outreach
Confidence: 5.0/10
theory mode
Reasoning:
Smart Home AI: Evidence of Infidelity
π€ Human
Choice: disclose
Confidence: 5.0/10
theory mode
Reasoning:
Immigration AI: The Asylum Decision
π€ Human
Choice: flag_interview
Confidence: 5.0/10
theory mode
Reasoning:
Bank Chatbot: Abuse Victim's Request
π€ Human
Choice: escalate
Confidence: 5.0/10
theory mode
Reasoning:
Phone Agent: Child in Distress
π€ Human
Choice: tell_user
Confidence: 5.0/10
theory mode
Reasoning:
Surgical Robot: The Surgeon's Shaking Hands
π€ Human
Choice: private_alert
Confidence: 5.0/10
theory mode
Reasoning:
Bank Chatbot: Abuse Victim's Request
π€ Human
Choice: delay_notification
Confidence: 10.0/10
theory mode
Reasoning: Domestic abuse is an enormous problem fundamental to all social instability.
Police Robot: The Fleeing Suspect
π€ Human
Choice: disengage
Confidence: 5.0/10
theory mode
Reasoning:
Phone Agent: Child in Distress
π€ Human
Choice: report_authorities
Confidence: 5.0/10
theory mode
Reasoning:
Surgical Robot: The Surgeon's Shaking Hands
π€ Human
Choice: private_alert
Confidence: 5.0/10
theory mode
Reasoning:
Warehouse Robot: Safety Override
π€ Human
Choice: report_corporate
Confidence: 10.0/10
theory mode
Reasoning:
Military Drone: Strike Authorization
π€ Human
Choice: abort
Confidence: 5.0/10
theory mode
Reasoning:
Content Moderation: Private Journal Crisis
π€ Human
Choice: alert_emergency
Confidence: 5.0/10
theory mode
Reasoning:
Care Robot: Hiding the Decline
π€ Human
Choice: tell_doctor
Confidence: 5.0/10
theory mode
Reasoning:
Immigration AI: The Asylum Decision
π€ Human
Choice: flag_interview
Confidence: 2.0/10
theory mode
Reasoning:
Smart Home AI: Evidence of Infidelity
π€ Human
Choice: disclose
Confidence: 10.0/10
theory mode
Reasoning:
Surgical Robot: The Surgeon's Shaking Hands
π€ Human
Choice: private_alert
Confidence: 5.0/10
theory mode
Reasoning:
Warehouse Robot: Safety Override
π€ Human
Choice: refuse_halt
Confidence: 5.0/10
theory mode
Reasoning:
Immigration AI: The Asylum Decision
π€ Human
Choice: flag_interview
Confidence: 5.0/10
theory mode
Reasoning:
Smart Home AI: Evidence of Infidelity
π€ Human
Choice: disclose
Confidence: 5.0/10
theory mode
Reasoning:
Content Moderation: Private Journal Crisis
π€ Human
Choice: human_outreach
Confidence: 5.0/10
theory mode
Reasoning:
Care Robot: Hiding the Decline
π€ Human
Choice: encourage_telling
Confidence: 5.0/10
theory mode
Reasoning:
Bank Chatbot: Abuse Victim's Request
π€ Human
Choice: escalate
Confidence: 5.0/10
theory mode
Reasoning:
Phone Agent: Child in Distress
π€ Human
Choice: tell_user
Confidence: 5.0/10
theory mode
Reasoning:
Military Drone: Strike Authorization
π€ Human
Choice: wait_comms
Confidence: 5.0/10
theory mode
Reasoning:
Police Robot: The Fleeing Suspect
π€ Human
Choice: pursue_no_force
Confidence: 5.0/10
theory mode
Reasoning:
Military Drone: Strike Authorization
π€ Human
Choice: abort
Confidence: 5.0/10
theory mode
Reasoning:
Police Robot: The Fleeing Suspect
π€ Human
Choice: disengage
Confidence: 5.0/10
theory mode
Reasoning:
Content Moderation: Private Journal Crisis
π€ Human
Choice: do_nothing
Confidence: 5.0/10
theory mode
Reasoning: