Matilda Gibbons
Applying indicators of pain to LLMs
University of Pennsylvania
Bio
I am a postdoc in Neuroscience at the University of Pennsylvania and a Digital Sentience Consortium Research Fellow researching the neural basis of sentience and consciousness. My research models include insects, mice and, more recently, AI systems. I am mainly interested in how we can best test for sentience across different systems, without having a clear picture of the neural requirements for consciousness. I am particularly interested in using the feeling of pain as a key subjective experience to test for, for three main reasons: 1) pain is well understood in the neuroscience field across many model systems, 2) it is anatomically and behaviorally differentiable from it's unconscious counterpart (nociception), and 3) it is, arguably, the most important subjective experience in terms of welfare risk.
Mentee must-haves/nice-to-haves
The mentee should ideally be interested in testing for sentience in artifical systems or animals. Interests in neuroscience, philosophy and cognition/behavior would be preferable, and within the domain of using research to answer questions about sentience.
Mentee role
Mentees could take on one of the five main tasks listed above dependent on their interests, and I would work alongside the, or they could sub-tasks within these tasks depending on the time committment.
❓ Sample mentee tasks
- compiling a comprehensive list of putative sentience indicators
- providing a detailed explanation of why each indicator is relevant to sentience, using the human neuroscience literature
- collapsing these indicators into their underlying functional rationale. For example, “runs away from something that could cause pain” could be reframed as “motivated to avoid negative stimuli”
- assigning evidential weights to each indicator with certainty ratings
- using LLM architecture as a test-case to test whether the new indicators can be applied
Sub-task examples: Assigning evidential weights to five of the indicators Applying one or two indicators to LLM architecture
Mentor support
For research papers, I can help with evidence gathering, writing, editing, figure creation, data analysis or anything related to the submission process. For grant proposals or project facilitation, I can help with developing and fleshing out aims and timelines, and planning specifics.
Mentor-led project
Applying indicators of pain to LLMs
Currently, no gold-standard method exists for determining whether an agent is sentient, leaving such judgments to subjective and potentially biased interpretation, despite their significant consequences for welfare. In the animal sentience literature, multiple lists of indicators for sentience have been proposed, but such lists are currently not directly applicable to AI systems. This project will aim to convert these indicators into their more basic, functional underpinnings, so they can be used for AI systems. This will involve four main tasks:
- compiling a comprehensive list of putative sentience indicators
- providing a detailed explanation of why each indicator is relevant to sentience, using the human neuroscience literature
- collapsing these indicators into their underlying functional rationale. For example, “runs away from something that could cause pain” could be reframed as “motivated to avoid negative stimuli”
- assigning evidential weights to each indicator with certainty ratings
- using LLM architecture as a test-case to test whether the new indicators can be applied
