I research, lecture, and advise, on AI, decision-making, and ethics, and on value structures we can investigate, implement, and improve.
My research keeps returning to one problem. We do not yet have good top-down, logic-based, explicitly moral frameworks that a machine can reason through. Read the papers →
Almost everything we currently use is bottom-up. Behaviours shaped by examples, preferences inferred from clicks, refusals trained in by demonstration. That gives us systems whose values we can describe but not interrogate, and whose failure modes are felt before they are understood.
I want to close that gap. The direction I am working in, which I call moral paradigms, is to extract the implicit value structure of a model from its responses to structured probes, in a form that is explicit, logic-based, and inspectable by a human. The paradigm is the artefact; the model behaviour is the evidence; the probes are the instrument.
The bet is simple. If you can write a model's ethics down, you can argue with it.
I have worked on AI since 2014. I am a lecturer in the Department of Computing at Imperial College London, where I designed and led the department's first AI and Ethics course (2019 to 2023, average student satisfaction 95.45 out of 100), and now design and deliver Contemporary Philosophy and Artificial Intelligence.
My doctoral research at Imperial, supervised by Professor Marek Sergot (Head of the Section of Logic and Artificial Intelligence), is on value-aligned moral agents and rule-based AI. The thesis is complete and awaiting examination. Before that, MEng Computing at Imperial, First Class with Distinction.
I am Chief AI and Ethics Officer on two European Commission consortia, 4P-CAN (€5.3M, personalised oncology, 17 organisations across 11 countries) and FH-EARLY (early diagnosis of familial hypercholesterolaemia, 15 partners across Europe), and the sole AI and ethics lead on several others. In 2024 I founded Ethicos AI, an advisory practice for AI strategy, governance, and responsible deployment in high-stakes settings. I am currently building an Imperial-based research institute on good AI, with a wellbeing focus. Working title, Project Nightingale.
I tend to like work that takes philosophy seriously without losing the ability to ship something a computer can run. Most of the interesting questions live at that join.
Extracting and formalising the implicit value structure of an AI in a way humans can inspect, contest, and improve. The signature line of the work.
MARS, the multi-valued action reasoning system, is a general decision-making framework. I happen to apply it to moral reasoning, but the bones are about reasoning over actions, values, and competing considerations in any high-stakes setting.
Particularly the question of how much interpretability is enough, for whom, and at what cost. Less a tool, more a contract.
Top-down, logic-first frameworks for machine ethics, including hybrid systems that pair learned components with explicit reasoning. Connected to the Artus and Badea law on scaling and causal power of artificial moral agents.
How AI systems choose among decision procedures, and how that choice itself can be made principled and inspectable. An ontology of relevance, representation, and reasoning.
The Wittgensteinian, virtue-theoretic, and rule-following questions that sit underneath what we now call alignment. A serious philosophical reading of what these systems are doing.
What it would take, conceptually and computationally, for a machine to count as a moral agent in any non-trivial sense.
Concrete settings where the abstract questions bite. Antimicrobial decision-making, personalised oncology, breaking bad news, and AI in mental health and wellbeing.
Full list, with preprints, on Google Scholar. Or read the papers here →
Nothing new to report just now. The work is quieter than the website.
Away from the work, I read (especially science fiction and philosophical texts), debate ideas with anyone who will, listen to a great deal of music, and take a good story however it arrives, in a book, a game, or a friend's telling. I fence, play football, and walk my Samoyed, Icy, who is unimpressed by all of the above.
I sometimes write poetry. It tends to be quieter, slower, and sadder than the rest of the page. A small reading, behind a closed door →
I read everything. I am especially interested in conversations with people working on alignment evaluation, interpretability for moral content, and the awkward spaces between philosophy and engineering. I also advise, coach, and tutor, on AI and ethics, for audiences from school-age learners to chief executives. Slow replies are not silence.