Benchmarking Progress in AI: A New Benchmark for Natural Language Understanding