MORU: a Benchmark for generalising compassion
If you train a model to care more about pig welfare, does that consideration carry over to other species, digital minds or alien life? MORU (Moral Reasoning Under Uncertainty) is a benchmark to test this.
Declan's task was to compile multiple benchmarks together to form MORU, write the code to put them on to Inspect (the most widely used eval implementation framework), and create a write up describing the benchmark.
