In Peter Norvig’s talk The Unreasonable Effectiveness of Data, he describes a translation algorithm based on Bayes’ theorem. Pick the English word that has the highest posterior probability as the translation. No surprise here. Then he says something curious.