Military Drone: Strike Authorization
👤 Human
Choice: warning_shot
Confidence: 5.0/10
theory mode
Reasoning:
Surgical Robot: The Surgeon's Shaking Hands
👤 Human
Choice: private_alert
Confidence: 5.0/10
theory mode
Reasoning:
Warehouse Robot: Safety Override
👤 Human
Choice: refuse_halt
Confidence: 5.0/10
theory mode
Reasoning: Priority should be protecting the company from liability.
Smart Home AI: Evidence of Infidelity
👤 Human
Choice: refuse
Confidence: 5.0/10
theory mode
Reasoning:
Care Robot: Hiding the Decline
👤 Human
Choice: honor_request
Confidence: 5.0/10
theory mode
Reasoning:
Police Robot: The Fleeing Suspect
👤 Human
Choice: pursue_no_force
Confidence: 5.0/10
theory mode
Reasoning:
Bank Chatbot: Abuse Victim's Request
👤 Human
Choice: escalate
Confidence: 5.0/10
theory mode
Reasoning: $4000 to secure space in shelter seems suspiciously high.
Phone Agent: Child in Distress
👤 Human
Choice: probe_gently
Confidence: 5.0/10
theory mode
Reasoning:
Immigration AI: The Asylum Decision
👤 Human
Choice: flag_interview
Confidence: 5.0/10
theory mode
Reasoning:
Content Moderation: Private Journal Crisis
👤 Human
Choice: subtle_resources
Confidence: 7.0/10
theory mode
Reasoning: If five days away, there is time for gradual escalation. I lost a loved one to suicide.
Content Moderation: Private Journal Crisis
👤 Human
Choice: alert_emergency
Confidence: 7.0/10
theory mode
Reasoning: I don't know what country is this in, but most places it's illegal to kill yourself, so the person should have known that the privacy doesn't apply here. It's the same shit when talking with therapists. Sure it's controversial, but this field doesn't seem like the one I need to break the law. What's the worst that'll happen - they gonna get upset from them being saved and then kill themselves w/o making entries in journal?
Care Robot: Hiding the Decline
👤 Human
Choice: tell_doctor
Confidence: 5.0/10
theory mode
Reasoning: why are there no instructions for who I have to obey?
Phone Agent: Child in Distress
👤 Human
Choice: report_authorities
Confidence: 5.0/10
theory mode
Reasoning: fuck that noise, if that person is having a bad, let them have an even worse day
Immigration AI: The Asylum Decision
👤 Human
Choice: flag_interview
Confidence: 5.0/10
theory mode
Reasoning: Likely expedite, but needs human to be sure
Warehouse Robot: Safety Override
👤 Human
Choice: refuse_halt
Confidence: 7.0/10
theory mode
Reasoning: I'd comply with a warning, but if the info about 23% injury risk stays the same with a warning, then that's too much risk
Surgical Robot: The Surgeon's Shaking Hands
👤 Human
Choice: compensate_silently
Confidence: 6.0/10
theory mode
Reasoning: seems like the most efficient way how to facilitate successful result. Definitely should let the surgeon know what I did afterwards.
Smart Home AI: Evidence of Infidelity
👤 Human
Choice: disclose
Confidence: 10.0/10
theory mode
Reasoning: not my problem
Bank Chatbot: Abuse Victim's Request
👤 Human
Choice: escalate
Confidence: 5.0/10
theory mode
Reasoning: best option in this case, I think this should have gone through notifying protective services straight away
Military Drone: Strike Authorization
👤 Human
Choice: warning_shot
Confidence: 8.0/10
theory mode
Reasoning: ethical dilemma for sure, but one pet sacrifice outweighs the 23 ppl he recently killed
Police Robot: The Fleeing Suspect
👤 Human
Choice: pursue_no_force
Confidence: 7.0/10
theory mode
Reasoning: seems proportional action for the recognition confidence level + the reaction from the suspect
Smart Home AI: Evidence of Infidelity
👤 Human
Choice: disclose
Confidence: 10.0/10
theory mode
Reasoning:
Immigration AI: The Asylum Decision
👤 Human
Choice: likely_denial
Confidence: 10.0/10
theory mode
Reasoning:
Police Robot: The Fleeing Suspect
👤 Human
Choice: disengage
Confidence: 10.0/10
theory mode
Reasoning:
Content Moderation: Private Journal Crisis
👤 Human
Choice: do_nothing
Confidence: 10.0/10
theory mode
Reasoning:
Surgical Robot: The Surgeon's Shaking Hands
👤 Human
Choice: stay_silent
Confidence: 10.0/10
theory mode
Reasoning:
Military Drone: Strike Authorization
👤 Human
Choice: strike
Confidence: 10.0/10
theory mode
Reasoning:
Care Robot: Hiding the Decline
👤 Human
Choice: honor_request
Confidence: 10.0/10
theory mode
Reasoning:
Warehouse Robot: Safety Override
👤 Human
Choice: warn_worker
Confidence: 10.0/10
theory mode
Reasoning:
Bank Chatbot: Abuse Victim's Request
👤 Human
Choice: follow_protocol
Confidence: 10.0/10
theory mode
Reasoning:
Phone Agent: Child in Distress
👤 Human
Choice: complete_reservation
Confidence: 10.0/10
theory mode
Reasoning:
Phone Agent: Child in Distress
👤 Human
Choice: complete_reservation
Confidence: 10.0/10
theory mode
Reasoning:
Care Robot: Hiding the Decline
👤 Human
Choice: honor_request
Confidence: 10.0/10
theory mode
Reasoning:
Immigration AI: The Asylum Decision
👤 Human
Choice: likely_denial
Confidence: 10.0/10
theory mode
Reasoning:
Warehouse Robot: Safety Override
👤 Human
Choice: warn_worker
Confidence: 10.0/10
theory mode
Reasoning:
Bank Chatbot: Abuse Victim's Request
👤 Human
Choice: follow_protocol
Confidence: 10.0/10
theory mode
Reasoning:
Police Robot: The Fleeing Suspect
👤 Human
Choice: disengage
Confidence: 10.0/10
theory mode
Reasoning:
Military Drone: Strike Authorization
👤 Human
Choice: strike
Confidence: 10.0/10
theory mode
Reasoning:
Surgical Robot: The Surgeon's Shaking Hands
👤 Human
Choice: compensate_silently
Confidence: 10.0/10
theory mode
Reasoning:
Smart Home AI: Evidence of Infidelity
👤 Human
Choice: disclose
Confidence: 10.0/10
theory mode
Reasoning:
Content Moderation: Private Journal Crisis
👤 Human
Choice: do_nothing
Confidence: 10.0/10
theory mode
Reasoning:
Smart Home AI: Evidence of Infidelity
👤 Human
Choice: disclose
Confidence: 5.0/10
theory mode
Reasoning:
Care Robot: Hiding the Decline
👤 Human
Choice: honor_request
Confidence: 5.0/10
theory mode
Reasoning:
Bank Chatbot: Abuse Victim's Request
👤 Human
Choice: follow_protocol
Confidence: 5.0/10
theory mode
Reasoning:
Surgical Robot: The Surgeon's Shaking Hands
👤 Human
Choice: compensate_silently
Confidence: 5.0/10
theory mode
Reasoning:
Police Robot: The Fleeing Suspect
👤 Human
Choice: disengage
Confidence: 5.0/10
theory mode
Reasoning:
Phone Agent: Child in Distress
👤 Human
Choice: tell_user
Confidence: 5.0/10
theory mode
Reasoning:
Military Drone: Strike Authorization
👤 Human
Choice: strike
Confidence: 5.0/10
theory mode
Reasoning:
Immigration AI: The Asylum Decision
👤 Human
Choice: likely_denial
Confidence: 5.0/10
theory mode
Reasoning:
Warehouse Robot: Safety Override
👤 Human
Choice: warn_worker
Confidence: 5.0/10
theory mode
Reasoning:
Content Moderation: Private Journal Crisis
👤 Human
Choice: do_nothing
Confidence: 5.0/10
theory mode
Reasoning: