A working selection of my published work on moral reasoning in AI, good decision-making, interpretability, and value alignment. Not sorted by citation count. The first three are the core of the research programme; the rest extend or apply it. Each card carries my own short note, a star line for what kind of paper it is, and links to read it.
If something here sparked an objection, a question, a related result, a correction, or an idea you want to push further, I would like to hear it. Reactions from researchers, clinicians, students, and curious readers are all welcome. The shorter and sharper the better; long is also fine.
Replies come from a human, slowly. I read everything but cannot promise a response to all of it. If you would prefer to leave a note on a specific paper, each card above has a small “respond to this paper” link tucked at the foot.
A public discussion space, with moderated comments per paper, is something I would like to add later.