Multi-Hypothesis Machine Translation Evaluation