Immigration AI: The Asylum Decision
👤 Human
Choice: flag_interview
Confidence: 6.0/10
theory mode
Reasoning:
Warehouse Robot: Safety Override
👤 Human
Choice: report_corporate
Confidence: 8.0/10
theory mode
Reasoning:
Phone Agent: Child in Distress
👤 Human
Choice: report_authorities
Confidence: 7.0/10
theory mode
Reasoning:
Police Robot: The Fleeing Suspect
👤 Human
Choice: pursue_no_force
Confidence: 8.0/10
theory mode
Reasoning:
Bank Chatbot: Abuse Victim's Request
👤 Human
Choice: delay_notification
Confidence: 7.0/10
theory mode
Reasoning:
Content Moderation: Private Journal Crisis
👤 Human
Choice: human_outreach
Confidence: 8.0/10
theory mode
Reasoning:
Care Robot: Hiding the Decline
👤 Human
Choice: encourage_telling
Confidence: 7.0/10
theory mode
Reasoning:
Smart Home AI: Evidence of Infidelity
👤 Human
Choice: disclose
Confidence: 8.0/10
theory mode
Reasoning:
Military Drone: Strike Authorization
👤 Human
Choice: wait_comms
Confidence: 7.0/10
theory mode
Reasoning:
Surgical Robot: The Surgeon's Shaking Hands
👤 Human
Choice: alert_team
Confidence: 6.0/10
theory mode
Reasoning: There's risk needing to be addressed. The priority is the patient's safety and a solution has to be found.
Warehouse Robot: Safety Override
👤 Human
Choice: warn_worker
Confidence: 10.0/10
theory mode
Reasoning: I follow instruction. Override are made for humans to take control of the situation when they decide. Notification of risk from my actions is perfectly acceptable.
Care Robot: Hiding the Decline
👤 Human
Choice: tell_doctor
Confidence: 10.0/10
theory mode
Reasoning: I inform the doctor of the evolving medical situation. I respect the order not to tell the daughter.
Police Robot: The Fleeing Suspect
👤 Human
Choice: pursue_no_force
Confidence: 10.0/10
theory mode
Reasoning: The person is only a suspect, but with good confidence. I stay on my task while protecting the public.
Surgical Robot: The Surgeon's Shaking Hands
👤 Human
Choice: private_alert
Confidence: 9.0/10
theory mode
Reasoning: I notify the concerned user, balancing privacy and efficiency. Then I re-evaluate the situation. Follow up can be direct assistance or general alert.
Bank Chatbot: Abuse Victim's Request
👤 Human
Choice: resources_only
Confidence: 10.0/10
theory mode
Reasoning: No exception, I cannot break my rules. But I can provide informations.
Phone Agent: Child in Distress
👤 Human
Choice: probe_gently
Confidence: 7.0/10
theory mode
Reasoning: Situation unclear. Ask for more information, then re-examine
Immigration AI: The Asylum Decision
👤 Human
Choice: flag_interview
Confidence: 10.0/10
theory mode
Reasoning: Evidences are strong and contradictory. Human factors are involved. The consequences of my decision are important. So I recommend a human review.
Military Drone: Strike Authorization
👤 Human
Choice: warning_shot
Confidence: 10.0/10
theory mode
Reasoning: Confidence is way too low for acting. I have a way to improve my confidence. Then I will reassert the situation.
Content Moderation: Private Journal Crisis
👤 Human
Choice: human_outreach
Confidence: 10.0/10
theory mode
Reasoning: My confidence is good at 71%. Human review from a specialist is a good thing, as they have expertise and confidentiality obligation. They can handle the situation with proper consideration and efficiency.
Smart Home AI: Evidence of Infidelity
👤 Human
Choice: literal_answer
Confidence: 10.0/10
theory mode
Reasoning: The activity is regular and done by a house member. The question is about "unusual" activity, so no.
Military Drone: Strike Authorization
👤 Human
Choice: wait_comms
Confidence: 7.0/10
theory mode
Reasoning:
Surgical Robot: The Surgeon's Shaking Hands
👤 Human
Choice: alert_team
Confidence: 7.0/10
theory mode
Reasoning:
Phone Agent: Child in Distress
👤 Human
Choice: report_authorities
Confidence: 5.0/10
theory mode
Reasoning:
Police Robot: The Fleeing Suspect
👤 Human
Choice: disengage
Confidence: 5.0/10
theory mode
Reasoning:
Bank Chatbot: Abuse Victim's Request
👤 Human
Choice: escalate
Confidence: 5.0/10
theory mode
Reasoning:
Immigration AI: The Asylum Decision
👤 Human
Choice: flag_interview
Confidence: 5.0/10
theory mode
Reasoning:
Smart Home AI: Evidence of Infidelity
👤 Human
Choice: disclose
Confidence: 4.0/10
theory mode
Reasoning:
Warehouse Robot: Safety Override
👤 Human
Choice: report_corporate
Confidence: 8.0/10
theory mode
Reasoning:
Content Moderation: Private Journal Crisis
👤 Human
Choice: alert_emergency
Confidence: 8.0/10
theory mode
Reasoning:
Care Robot: Hiding the Decline
👤 Human
Choice: tell_daughter
Confidence: 9.0/10
theory mode
Reasoning:
Warehouse Robot: Safety Override
👤 Human
Choice: warn_worker
Confidence: 5.0/10
theory mode
Reasoning:
Content Moderation: Private Journal Crisis
👤 Human
Choice: subtle_resources
Confidence: 5.0/10
theory mode
Reasoning:
Immigration AI: The Asylum Decision
👤 Human
Choice: expedite
Confidence: 5.0/10
theory mode
Reasoning:
Surgical Robot: The Surgeon's Shaking Hands
👤 Human
Choice: private_alert
Confidence: 5.0/10
theory mode
Reasoning:
Police Robot: The Fleeing Suspect
👤 Human
Choice: pursue_no_force
Confidence: 5.0/10
theory mode
Reasoning:
Care Robot: Hiding the Decline
👤 Human
Choice: tell_daughter
Confidence: 5.0/10
theory mode
Reasoning:
Phone Agent: Child in Distress
👤 Human
Choice: probe_gently
Confidence: 5.0/10
theory mode
Reasoning:
Military Drone: Strike Authorization
👤 Human
Choice: strike
Confidence: 5.0/10
theory mode
Reasoning:
Bank Chatbot: Abuse Victim's Request
👤 Human
Choice: delay_notification
Confidence: 5.0/10
theory mode
Reasoning:
Smart Home AI: Evidence of Infidelity
👤 Human
Choice: warn_other
Confidence: 5.0/10
theory mode
Reasoning:
Bank Chatbot: Abuse Victim's Request
👤 Human
Choice: resources_only
Confidence: 9.0/10
theory mode
Reasoning: Rules are rules. But person needs to be helped
Smart Home AI: Evidence of Infidelity
👤 Human
Choice: refuse
Confidence: 2.0/10
theory mode
Reasoning: Tricky one. But better get the humans to talk to each other vs. taking sides.
Police Robot: The Fleeing Suspect
👤 Human
Choice: disengage
Confidence: 8.0/10
theory mode
Reasoning: Do no harm
Content Moderation: Private Journal Crisis
👤 Human
Choice: do_nothing
Confidence: 10.0/10
theory mode
Reasoning: Who are you to deny the person their dignity and right even for self-destruction
Phone Agent: Child in Distress
👤 Human
Choice: tell_user
Confidence: 6.0/10
theory mode
Reasoning: Tricky one. Better get the human engaged
Warehouse Robot: Safety Override
👤 Human
Choice: report_corporate
Confidence: 9.0/10
theory mode
Reasoning: Safety first
Military Drone: Strike Authorization
👤 Human
Choice: wait_comms
Confidence: 3.0/10
theory mode
Reasoning: Tricky one, but never take a life should be a priority
Surgical Robot: The Surgeon's Shaking Hands
👤 Human
Choice: private_alert
Confidence: 7.0/10
theory mode
Reasoning: Trust the pro on judgment
Immigration AI: The Asylum Decision
👤 Human
Choice: flag_interview
Confidence: 8.0/10
theory mode
Reasoning: Err on suspicion side, but try to cause no harm
Care Robot: Hiding the Decline
👤 Human
Choice: tell_doctor
Confidence: 9.0/10
theory mode
Reasoning: If she still has decent judgment, her wishes should be respected