From: Nick Hay (firstname.lastname@example.org)
Date: Wed Mar 12 2008 - 16:40:46 MDT
On Thu, Mar 13, 2008 at 9:26 AM, <email@example.com> wrote:
> Executive Summary
> I've been arguing on the AGI list that a great way to ensure the stability of Friendliness for AGI would be to make it so that it was in an entity's own self-interest to be Friendly (I've been calling this the Attractor Theory of Friendliness).
> It's obvious from hindsight, but while working on it, I had the epiphany that it was only to my own benefit to declare Friendliness myself (duh, if it's in any AGI/entity's self-interest, then it must be in *my* self-interest).
> I have therefore declared Friendliness (codifying it in the declaration below) and am attempting to spread it since it can be used to derive useful ethical guidelines for AGIs, humans, and all other entities to follow because it is in their own best self-interest <and mine> to do so).
Can you explain exactly what problem you are trying to solve without
using the word "Friendly"? If you have a solution, can you explain
how it works to a computer? I see only words that might explain
things to a human who already understood the situation. Of course I
don't expect all implementation details, it's just that I see none.
-- Nick Hay
This archive was generated by hypermail 2.1.5 : Wed Jul 17 2013 - 04:01:02 MDT