Aligning foundation models with human perception