INFO 523 - Spring 2024 - Project Final
Model Configuration | Parameters (Millions) | Training Details | Expected Advantage |
---|---|---|---|
DeBERTa-v3 Extra Small | 70.68 | - smaller batch size - maximize model efficiency |
- lower computational requirements - suitable for limited resources |
DeBERTa-v3 Small | 141.30 | - moderate batch size - balanced computational load and performance |
- better than extra small model with manageable resource use |
DeBERTa-v3 Large | 434.01 | - larger batch size - extended training periods |
- Highest accuracy and performance - suitable for resource-abundant scenarios |