
Brian Christian
As artificial intelligence rapidly evolves, we face an urgent and complex challenge to ensure these powerful systems actually share and understand human values.
Machine learning models often inherit and perpetuate the harmful biases present in their training data.
Providing artificial intelligence with rigid reward structures can lead to unexpected and potentially dangerous outcomes.
Curiosity and intrinsic motivation are essential tools for helping algorithms learn in environments with limited external rewards.