Prodigy is a modern annotation tool for collecting training data for machine learning models, developed by the makers of spaCy. In this video, we'll show you how to use set up Prodigy to find bad labels in text classification tasks. While many of the techniques are applied to text classification, they can also be used for classification tasks in general.
[00:00] Bad Labels
[03:03] Google Emotions
[07:46] Heuristics
[09:12] Jupyter
[12:16] Models for Bad Labels
[15:26] Jupyter
[21:43] Embedding Tricks
[25:38] Jupyter
[29:29] Reason of Doubt
[31:20] Setting up Prodigy
[32:56] Annotating in Prodigy
[38:01] Annotator Disagreement
[42:16] Learnings
PRODIGY
● Website & docs: prodi.gy
● Live demo: prodi.gy/demo
● Forum: support.prodi.gy
THIS TUTORIAL
● Google Emotions Paper: arxiv.org/abs/...
● Code & data: github.com/exp...
● Whatlies Project: github.com/koa...
● Doubtlab Project: github.com/koa...
FOLLOW US
● Explosion: / explosion_ai
● We offer new services! spaCy Custom Solutions✨ explosion.ai/c...
5 окт 2024