Data BAD | What Will it Take to Fix Benchmarking for NLU?

Data BAD | What Will it Take to Fix Benchmarking for NLU?

Nov 12, 2021
|
22 views
Details
The Coffee Bean explains and comments the sobering take of the paper "What Will it Take to Fix Benchmarking in Natural Language Understanding?" See more videos from Ms. Coffee Bean about natural language understanding: πŸ“Ί The road to NLU: https://youtube.com/playlist?list=PLpZBeKTZRGPMjF-Ob-NYjaTtewbMNXKcU β–Ί Thanks to our Patrons who support us in Tier 2, 3, 4: πŸ™ donor, Dres. Trost GbR, Yannik Schneider Paper: πŸ“œ Bowman, Samuel R., and George E. Dahl. "What Will it Take to Fix Benchmarking in Natural Language Understanding?." arXiv preprint arXiv:2104.02145 (2021). https://arxiv.org/abs/2104.02145 πŸ”— SuperGLUE: https://super.gluebenchmark.com/tasks πŸ”— WiC: The Word-in-Context Dataset (English): https://pilehvar.github.io/wic/ Outline: 00:00 NLU Benchmarking – Motivation 01:04 How to measure NLU advances? 02:31 Why is NLU benchmarking broken? 04:43 What are the fixes? β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€ πŸ”₯ Optionally, pay us a coffee to help with our Coffee Bean production! β˜• Patreon: https://www.patreon.com/AICoffeeBreak Ko-fi: https://ko-fi.com/aicoffeebreak β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€β–€ πŸ”— Links: AICoffeeBreakQuiz: https://www.youtube.com/c/AICoffeeBreak/community Twitter: https://twitter.com/AICoffeeBreak Reddit: https://www.reddit.com/r/AICoffeeBreak/ YouTube: https://www.youtube.com/AICoffeeBreak #AICoffeeBreak #MsCoffeeBean #MachineLearning #AI #research​

Outline: 00:00 NLU Benchmarking – Motivation 01:04 How to measure NLU advances? 02:31 Why is NLU benchmarking broken? 04:43 What are the fixes?
Comments
loading...