On Manifold Markets, "If Artificial General Intelligence has an okay outcome, what will be the reason? — Hacks like RLHF-ing self-disempowerment into frontier models work long enough to develop better alignment methods, which in turn work long enough to ... etc; we keep ahead of 'alignment escape velocity'" has a probability of 1.5%.
View on Manifold Markets →This question is currently tracked on Manifold Markets only. When the same question appears on additional platforms, we compute a cross-platform consensus probability. Learn about our methodology →
The only prediction market aggregator with academically validated cross-platform consensus.